We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi, I made some changes #1 to get it to work on larger models.
My 2x3090 did not help so I had to opt to do per-layer-loading.
Thanks for your work.
I also gave a few ideas for MoE there :
Parameter-Efficient-MoE Mergekit
Thanks again and keep it up !
Great to see that you publish papers AND code !
The text was updated successfully, but these errors were encountered:
Thank you so much for your feedback and contributions!
Sorry, something went wrong.
No branches or pull requests
Hi, I made some changes #1 to get it to work on larger models.
My 2x3090 did not help so I had to opt to do per-layer-loading.
Thanks for your work.
I also gave a few ideas for MoE there :
Parameter-Efficient-MoE
Mergekit
Thanks again and keep it up !
Great to see that you publish papers AND code !
The text was updated successfully, but these errors were encountered: