Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request for Assistance with Replication #15

Open
WentaoTan opened this issue Jan 15, 2025 · 3 comments
Open

Request for Assistance with Replication #15

WentaoTan opened this issue Jan 15, 2025 · 3 comments
Assignees
Labels
help wanted Extra attention is needed question Further information is requested

Comments

@WentaoTan
Copy link

Hello! Our team has been working on reproducing SkyThought, which we find to be groundbreaking and insightful. We have successfully replicated the results using the Qwen2.5-32B model based on the llama-factory code, achieving significant performance improvements.

However, we encountered some challenges when attempting the same training with the llama3.3-70B model. Contrary to our expectations, we did not observe a notable performance boost, and there was even a slight decline in performance on the math500 benchmark.

We greatly appreciate any guidance or insights. Thank you!

@hxdtest
Copy link

hxdtest commented Jan 15, 2025

Have you tested eval scripts? issue I can't reproduce Qwen/QwQ-32B-Preview accuracy on AIME with eval scripts? Can you reproduce Qwen/QwQ-32B-Preview accuracy on AIME with eval scripts?

@WentaoTan
Copy link
Author

We tried to replicate the performance of QwQ but failed because our resources could not support very long output and QwQ output was very long.

@richardliaw
Copy link

richardliaw commented Jan 15, 2025

Hi @WentaoTan, we're currently working on more extensive experiments on different model sizes and architectures, and have not tested on 70b yet.

Can you share some of the numbers you've gotten so far?

@caoshiyi caoshiyi added help wanted Extra attention is needed question Further information is requested labels Jan 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed question Further information is requested
Projects
None yet
Development

No branches or pull requests

5 participants