How to run Qwen using Executorch? #7467

Arya-Hari · 2025-01-02T07:16:56Z

📚 The doc issue

Hi! I just wanted to how, how would I go about running Qwen using executorch? I was able to create the .pte file for Qwen. The example for Llama had a step 'Create a llama runner for android'. Do we have to do something similar for Qwen by creating a custom runner? Also the Qwen repository on Hugging Face Hub does not have a 'tokenizer.model' file, but the Llama example requires it for running inference using the adb shell. How to navigate around this?

Suggest a potential alternative/fix

No response

kimishpatel · 2025-01-03T16:25:32Z

i dont know the details how to run Qwen and whether there is any significant difference compared to llama as far as model's interface is concerned.

Also when you say you were abel to export the model, can you detail the steps you took. If you can run exported qwen model using https://github.com/pytorch/executorch/blob/main/examples/models/llama/runner/eager.py#L103 then highly likely that you can run via cpp runner. But you do need tokenizer, so not sure how hf runs this model

SS-JIA · 2025-01-06T21:17:34Z

@Arya-Hari for some more context, the llama_runner binary used in our examples is heavily tailored to the llama model architecture. So as Kimish mentioned, depending on the interface of Qwen compared to llama you may not be able to re-use the llama_runner binary. If you are familiar with the interface of the model, then the best way would be to fork or modify the llama_runner binary for the Qwen model; essentially creating a custom runner as you mentioned.

mergennachin · 2025-01-07T03:11:35Z

@guangy10, Is there guidelines on how to leverage from recent Hugging Face (huggingface/transformers#32253, huggingface/transformers#34102) and optimum integrations (https://huggingface.co/docs/optimum/main/en/exporters/executorch/usage_guides/export_a_model)

kimishpatel added the module: llm LLM examples and apps, and the extensions/llm libraries label Jan 3, 2025

SS-JIA moved this to To triage in ExecuTorch DevX improvements Jan 6, 2025

SS-JIA added this to ExecuTorch DevX improvements Jan 6, 2025

SS-JIA self-assigned this Jan 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to run Qwen using Executorch? #7467

How to run Qwen using Executorch? #7467

Arya-Hari commented Jan 2, 2025

kimishpatel commented Jan 3, 2025

SS-JIA commented Jan 6, 2025

mergennachin commented Jan 7, 2025

How to run Qwen using Executorch? #7467

How to run Qwen using Executorch? #7467

Comments

Arya-Hari commented Jan 2, 2025

📚 The doc issue

Suggest a potential alternative/fix

kimishpatel commented Jan 3, 2025

SS-JIA commented Jan 6, 2025

mergennachin commented Jan 7, 2025