Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

为什么4090单卡可以进行推理,而多卡会爆显存呢(More than 4090 cards burst video memory) #191

Open
3214894586 opened this issue Mar 6, 2025 · 1 comment

Comments

@3214894586
Copy link

我使用4090进行单GPU推理并用qwen-3b拓展提示词,正常生成了视频。但是4张4090使用提示词拓展就爆显存了。这是什么问题?明明显存多了那么多,求解答!
I used the 4090 for single GPU inference and extended the prompt words with qwen-3b, and the video was generated normally. But 4 4090 cards use prompt words to expand and explode the video storage. What is the problem? Obviously, there are so many savings, ask for answers!
40901:
Image
4090
4:

Image

@yizhixu
Copy link

yizhixu commented Mar 7, 2025

现在还没有开源模型并行的代码吧?
数据并行的话多卡通信有显存开销的

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants