Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Note about Mistral models #67

Open
inflatebot opened this issue Sep 20, 2024 · 0 comments
Open

Note about Mistral models #67

inflatebot opened this issue Sep 20, 2024 · 0 comments

Comments

@inflatebot
Copy link

inflatebot commented Sep 20, 2024

IDK if you guys have been using mistral-common to test Mistral's models, but if you haven't, there's a chance you haven't been forming your requests properly. The templates used by a lot of tooling have been subtly broken for a long time. It'd be worth checking the new document from Mistral's Cookbook and possibly reimplementing the tokenizer/templates used for those tests if they need it, and redoing the tests.

(We're all having a collective panic attack about this in the RP scene right now, because it means a huge chunk of our finetunes and merges are probably also broken!)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant