Loss is "nan" while training gpt2 #238

sankethgadadinni · 2023-07-27T10:13:23Z

model = BaseModel.create("gpt2")
instruction_dataset = InstructionDataset("/content/alpaca_data")
model.finetune(dataset=instruction_dataset)

Am I doing something wrong?

tushar2407 · 2023-07-31T17:48:44Z

Hi @sankethgadadinni , No you are not doing anything wrong, it's just that may be the loss is so high or so low in your case that it cannot be handled and hence you are getting NAN. It's normal to get that in many cases.

tushar2407 · 2023-08-01T10:49:18Z

@sankethgadadinni were you able to load the model after fine-tuning? Does the model work after fine-tuning?
I can help you with walking through the process, I am hold an all-hands session coming Friday, you are welcome to join!!
For more details please head to discord channel: https://discord.gg/xj5j3VJC

StochasticRomanAgeev closed this as completed Aug 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss is "nan" while training gpt2 #238

Loss is "nan" while training gpt2 #238

sankethgadadinni commented Jul 27, 2023 •

edited

Loading

tushar2407 commented Jul 31, 2023

tushar2407 commented Aug 1, 2023

Loss is "nan" while training gpt2 #238

Loss is "nan" while training gpt2 #238

Comments

sankethgadadinni commented Jul 27, 2023 • edited Loading

tushar2407 commented Jul 31, 2023

tushar2407 commented Aug 1, 2023

sankethgadadinni commented Jul 27, 2023 •

edited

Loading