-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error using single gpu for training #21
Comments
Hello @Adnan-Khan7 , could you let me know the details errors such as traceback and the code line ? |
sure, please have a look at the traceback Traceback (most recent call last): |
have you change some default arguments? is the same with the command |
I didn't change any other default arguments. Adding --world-size 1 now generates ZeroDivisionError, please see the below command that I am running python train.py --world-size 1 --rank 0 --overwrite train.py:40: UserWarning: You have chosen to seed training. This will turn on the CUDNN deterministic setting, which can slow down your training considerably! You may see unexpected behavior when restarting from checkpoints. by adding --gpu 0 train.py:40: UserWarning: You have chosen to seed training. This will turn on the CUDNN deterministic setting, which can slow down your training considerably! You may see unexpected behavior when restarting from checkpoints. |
Dear Lee, any comments on the above-stated error? |
Thanks for the work you have done.
I encounter the following error using the single GPU training,
ValueError:num_samples should be a positive integer value, but got num_samples=-67108864
Command I am using is; python train.py --rank 0 --gpu 0
Can you please assist?
Thanks
The text was updated successfully, but these errors were encountered: