-
Notifications
You must be signed in to change notification settings - Fork 181
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Resume training #35
Comments
That shouldn't happen. Will look into it. |
Unfortunately I think this is still the case. When I reload saved parameters I get I've printed out the Edit: I just ran another test by saving/loading models and seeing if they were corrupted but couldn't find any such thing. That points the finger back at the QNetworks having some exploding gradient problem or something similar. |
Sorry I'm back without an answer but is it possible one of the issues that the alpha optimizer is not saved/loaded via checkpoints? |
I have a bit more time to look into this. Interestingly, the critic/Q Networks are what are filling up with |
Did you reload the state-dicts of target networks? If not, this might be the reason for exploding loss. |
Hello I am trying to use the SAC agent and resume training, to do that I do:
Is this correct? The loss explodes after resuming which is very strange.
The text was updated successfully, but these errors were encountered: