Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add HF model support inc. DS-R1-Distill, Qwen needs yarn support #17421

Merged
merged 52 commits into from
Feb 7, 2025

Conversation

yieldthought
Copy link
Contributor

@yieldthought yieldthought commented Jan 31, 2025

Problem description

Existing codebase loads the meta checkpoint format but many derivative models are only available on huggingface.

What's changed

Add support for loading HuggingFace model formats, paving the way for full Qwen support (pending yarn rope implementation) and adding DeepSeek-R1-Distill-Llama-70B support.

Checklist

All passing locally.

Copy link
Contributor

@cglagovichTT cglagovichTT left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

bravo sir

models/demos/llama3/requirements.txt Show resolved Hide resolved
models/demos/llama3/tests/test_ref.py Outdated Show resolved Hide resolved
models/demos/llama3/tt/llama_attention.py Outdated Show resolved Hide resolved
@mtairum
Copy link
Contributor

mtairum commented Jan 31, 2025

@yieldthought Re-generated all Llama3 cache files in CI for N150 / N300 / T3K.

TG will need to be regenerated at a later date through CI.

Re-running all pipelines. The T3K old one was not even building correctly.

models/demos/llama3/tt/llama_attention.py Outdated Show resolved Hide resolved
@yieldthought
Copy link
Contributor Author

Ready to merge when tests pass

@mtairum
Copy link
Contributor

mtairum commented Feb 4, 2025

All passing locally. Running the latest CI pipelines here.

If they pass we're good to merge.

@mtairum
Copy link
Contributor

mtairum commented Feb 6, 2025

Updated the issues on the description. Investigating the remaining ones that consistently fail.

@yieldthought yieldthought merged commit d0b59bd into main Feb 7, 2025
11 checks passed
@yieldthought yieldthought deleted the hf-llama branch February 7, 2025 12:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants