Wrong aspect_ratio param in llava-onevision evaluation #5

Luodian · 2024-12-16T09:38:22Z

I think the param is not set correctly.

Line 68 in ee2e9f3

overwrite_config["image_aspect_ratio"] = "pad"

It should be "anyrex_max_9" or directly read from the config.json

woodfrog · 2024-12-16T17:38:13Z

We followed your official tutorial for multi-image interleaved input. See the section of Image-Text Interleaved Input.

Do you have thoughts/insights on how this parameter setting would affect the performance? We can run again with anyrex_max_9 if the one in the tutorial is not the optimal setting.

woodfrog · 2024-12-16T17:42:31Z

I checked your git history. That padding config was introduced by this commit with the commit message "Fix tutorial error", so it seems to be an intentional change?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong aspect_ratio param in llava-onevision evaluation #5

Wrong aspect_ratio param in llava-onevision evaluation #5

Luodian commented Dec 16, 2024

woodfrog commented Dec 16, 2024

woodfrog commented Dec 16, 2024

Wrong aspect_ratio param in llava-onevision evaluation #5

Wrong aspect_ratio param in llava-onevision evaluation #5

Comments

Luodian commented Dec 16, 2024

woodfrog commented Dec 16, 2024

woodfrog commented Dec 16, 2024