-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
spikes #1
Comments
I just set |
Okay, thanks |
I did this following the guides of the origial paper, as in the paper |
Big thanks |
What batch size and lr do you use? You can try different batch size or lr, perhaps it can solve your problem. |
Batch size is 16.Lr's are 1e-10 and 2e-6 (on screenshot). Well, as you can see, decreasing lr only delays time till spikes appear |
I used batch-size 16 and lr 0.00002 at the first several epochs. The spike did not appear. You can try the following commands:
If this problem still appears, you may check the network parameters in |
Hi, do you have any suggestions on the next problem:
While training sdu(nadam,lr=0.00025), this is the loss on validation test:
Different model on the same training data was fine
Also, while training, lossvalue=nan starts to appear
The text was updated successfully, but these errors were encountered: