do_sample is True, nan appears in "eager" mode #14

sunyanguomt · 2024-11-21T12:34:17Z

HI
When i run demp.py, in attn_implementation="eager", the following problem occurs。

Traceback (most recent call last):
File "demo.py", line 44, in
output_ids = model.generate(input_ids, images=[video_tensor], modalities=["video"],beacon_skip_first=beacon_skip_first,beacon_skip_last=beacon_skip_last, **gen_kwargs)
File "/opt/conda/envs/py38/lib/python3.8/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
File "/opt/conda/envs/py38/lib/python3.8/site-packages/videoxl/model/language_model/llava_qwen.py", line 1830, in generate
return super().generate(position_ids=position_ids, attention_mask=attention_mask,inputs=inputs,beacon_skip_first=beacon_skip_first, beacon_skip_last= beacon_skip_last, **kwargs)
File "/opt/conda/envs/py38/lib/python3.8/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
File "/opt/conda/envs/py38/lib/python3.8/site-packages/transformers/generation/utils.py", line 2024, in generate
result = self._sample(
File "/opt/conda/envs/py38/lib/python3.8/site-packages/transformers/generation/utils.py", line 3020, in _sample
next_tokens = torch.multinomial(probs.to("cpu"), num_samples=1).squeeze(1).to("musa")
RuntimeError: probability tensor contains either inf, nan or element < 0

Then I found him in this position，this softmax apperas nan

This problem does not occur when using “sdpa” mode

The text was updated successfully, but these errors were encountered:

shuyansy · 2024-11-28T19:49:13Z

In my implementation, the model is running with "flash-attention" mode. It may not work in other modes

ApolloRay · 2024-12-18T02:42:12Z

HI When i run demp.py, in attn_implementation="eager", the following problem occurs。

Traceback (most recent call last): File "demo.py", line 44, in output_ids = model.generate(input_ids, images=[video_tensor], modalities=["video"],beacon_skip_first=beacon_skip_first,beacon_skip_last=beacon_skip_last, **gen_kwargs) File "/opt/conda/envs/py38/lib/python3.8/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return func(*args, **kwargs) File "/opt/conda/envs/py38/lib/python3.8/site-packages/videoxl/model/language_model/llava_qwen.py", line 1830, in generate return super().generate(position_ids=position_ids, attention_mask=attention_mask,inputs=inputs,beacon_skip_first=beacon_skip_first, beacon_skip_last= beacon_skip_last, **kwargs) File "/opt/conda/envs/py38/lib/python3.8/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return func(*args, **kwargs) File "/opt/conda/envs/py38/lib/python3.8/site-packages/transformers/generation/utils.py", line 2024, in generate result = self._sample( File "/opt/conda/envs/py38/lib/python3.8/site-packages/transformers/generation/utils.py", line 3020, in _sample next_tokens = torch.multinomial(probs.to("cpu"), num_samples=1).squeeze(1).to("musa") RuntimeError: probability tensor contains either inf, nan or element < 0

Then I found him in this position，this softmax apperas nan

This problem does not occur when using “sdpa” mode

where can i change the flash-implementation？
same problem

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

do_sample is True, nan appears in "eager" mode #14

do_sample is True, nan appears in "eager" mode #14

sunyanguomt commented Nov 21, 2024

shuyansy commented Nov 28, 2024

ApolloRay commented Dec 18, 2024

do_sample is True, nan appears in "eager" mode #14

do_sample is True, nan appears in "eager" mode #14

Comments

sunyanguomt commented Nov 21, 2024

shuyansy commented Nov 28, 2024

ApolloRay commented Dec 18, 2024