Skip to content

NaN in training #72

Answered by fredzzhang
weiyana asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @weiyana,

Please refer to #71. The issue is the batch size. The number of GPUs you use times the per-GPU batch size should be at least 16.

Fred.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by weiyana
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants