Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training collapses after 72k iterations #24

Open
Lauenburg opened this issue Oct 16, 2021 · 2 comments
Open

Training collapses after 72k iterations #24

Lauenburg opened this issue Oct 16, 2021 · 2 comments

Comments

@Lauenburg
Copy link

Lauenburg commented Oct 16, 2021

I am currently trying to test the model on the maps data set (satellite and card images from google maps). The training went well until the 72k iteration. Afterward, the training collapses or rather no result is returned:

Images for A2B until 72000 iterations:
A2B_0072000

Images for A2B after 72000 iterations:
A2B_0073000

Images for B2A until 72000 iterations:
B2A_0072000

Images for B2A until 72000 iterations:
B2A_0073000

Any idea what could have happened here?

@Q-Zhang98
Copy link

I am currently trying to test the model on the maps data set (satellite and card images from google maps). The training went well until the 72k iteration. Afterward, the training collapses or rather no result is returned:

Images for A2B until 72000 iterations: A2B_0072000

Images for A2B after 72000 iterations: A2B_0073000

Images for B2A until 72000 iterations: B2A_0072000

Images for B2A until 72000 iterations: B2A_0073000

Any idea what could have happened here?

Hello, I have the same problem. How did you solve it?

@alpc91
Copy link
Owner

alpc91 commented Dec 2, 2021

It may be that there is a phenomenon of model collapse. The training schedule can be adjusted according to the dataset, and the checkpoint can be saved for manual inspection or FID for model selection.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants