[Feat] Add support for llava_hf video, better loading logic for llava_hf ckpt #260

kcz358 · 2024-09-16T06:51:53Z

This PR update the llava_hf to enable the evaluation of the llava_hf on new series of llava model such as llava-onevision and llava-next(stronger). This PR also enable the video evaluation using llava_hf.

Noted that the video evaluation is only supported using llava onevision hf and would possibly failed if you other version of llava. Since the newest transformers version has not released you have to do

pip install git+https://github.com/huggingface/transformers.git

to install the transformers version from source if you want to use llava onevision hf.

However, after experiment, I think the performance still has some significance difference compare to the original llava. Thus, you are not recommended to use this model to get results of original llava or llava-onevision. This model is only recommended to those that wish to have a quick baseline or have finetuned their model using llava-hf

kcz358 · 2024-09-16T07:11:24Z

Video result aligned for 0.5-ov

Add support for llava_hf video, better loading logic for llava_hf ckpt

2203f1a

Luodian approved these changes Sep 17, 2024

View reviewed changes

Luodian merged commit 9f8d1b4 into main Sep 17, 2024
2 checks passed

kcz358 deleted the dev/llava_hf branch October 22, 2024 07:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feat] Add support for llava_hf video, better loading logic for llava_hf ckpt #260

[Feat] Add support for llava_hf video, better loading logic for llava_hf ckpt #260

kcz358 commented Sep 16, 2024 •

edited

Loading

kcz358 commented Sep 16, 2024

[Feat] Add support for llava_hf video, better loading logic for llava_hf ckpt #260

[Feat] Add support for llava_hf video, better loading logic for llava_hf ckpt #260

Conversation

kcz358 commented Sep 16, 2024 • edited Loading

kcz358 commented Sep 16, 2024

kcz358 commented Sep 16, 2024 •

edited

Loading