Inference Speed #47

vefalun · 2025-03-05T06:45:34Z

Hello, I would like to ask what the approximate inference speed of the model can reach after deployment?

Thanks for your great work!

XMHZZ2018 · 2025-03-05T16:21:29Z

Thanks for your interest in our work! The inference speed largely depends on the data (e.g., image resolution) and the available computing resources (for batching the inputs). Additionally, we have several versions of VLM2Vec, each with a different number of parameters. For reference, on MMEB-eval, a 7B model takes about 10 ~ 20 GPU hours on an H100. Our model is also integrated into vLLM, which I believe further enhances inference speed. Let me know if this answers your question!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference Speed #47

Inference Speed #47

vefalun commented Mar 5, 2025

XMHZZ2018 commented Mar 5, 2025

Inference Speed #47

Inference Speed #47

Comments

vefalun commented Mar 5, 2025

XMHZZ2018 commented Mar 5, 2025