: Unknown vision tower: /share/junjie/shuyan/clip-vit-large-patch14-336 #16

wwpu · 2024-11-27T03:51:28Z

Traceback (most recent call last):
File "/home/LLM/videoxl/videoxl/infer.py", line 17, in
tokenizer, model, image_processor, _ = load_pretrained_model(model_path, None, "llava_qwen", device_map="cuda:0")
File "/home/LLM/videoxl/videoxl/videoxl/model/builder.py", line 215, in load_pretrained_model
model = LlavaQwenForCausalLM.from_pretrained(model_path, low_cpu_mem_usage=True, attn_implementation=attn_implementation, **kwargs)
File "/home/LLM/videoxl/videoxl/videoxl/model/language_model/llava_qwen.py", line 1498, in from_pretrained
model, loading_info = super().from_pretrained(*args, **kwargs)
File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3404, in from_pretrained
model = cls(config, *model_args, **model_kwargs)
File "/home/LLM/videoxl/videoxl/videoxl/model/language_model/llava_qwen.py", line 1466, in init
self.model = LlavaQwenModel(config)
File "/home/LLM/videoxl/videoxl/videoxl/model/language_model/llava_qwen.py", line 1454, in init
super(LlavaQwenModel, self).init(config)
File "/home/LLM/videoxl/videoxl/videoxl/model/llava_arch.py", line 40, in init
self.vision_tower = build_vision_tower(config, delay_load=delay_load)
File "/home/LLM/videoxl/videoxl/videoxl/model/multimodal_encoder/builder.py", line 23, in build_vision_tower
raise ValueError(f"Unknown vision tower: {vision_tower}")

shuyansy · 2024-11-28T19:57:40Z

Hi, you can download the weight here, and change the path to yours.
https://huggingface.co/openai/clip-vit-large-patch14-336

wwpu · 2024-12-24T12:03:35Z

Hi, you can download the weight here, and change the path to yours. https://huggingface.co/openai/clip-vit-large-patch14-336

Thank you. I have downloaded it, but I can't find the path where this model is placed.

zzz130981 · 2024-12-25T11:19:55Z

hello, could you find how to change the path?

zzz130981 · 2024-12-25T11:41:13Z

Hi, you can download the weight here, and change the path to yours. https://huggingface.co/openai/clip-vit-large-patch14-336

Thank you. I have downloaded it, but I can't find the path where this model is placed.

may you can create /share/junjie/shuyan directory manually, although it is not elegent.

wwpu · 2024-12-26T01:53:18Z

Hi, you can download the weight here, and change the path to yours. https://huggingface.co/openai/clip-vit-large-patch14-336

Thank you. I have downloaded it, but I can't find the path where this model is placed.

may you can create /share/junjie/shuyan directory manually, although it is not elegent.
Thank u. I manually modified the path of the vision tower in "multimodal_encoder\builder.py"

VectorSpaceLab#16

DaozeZhang · 2025-02-09T15:23:30Z

hello, could you find how to change the path?

you may add config.mm_vision_tower = 'local_path' to line 1468 in the llava_qwen.py

TITC added a commit to TITC/Video-XL that referenced this issue Jan 3, 2025

clip path

ce6d9eb

VectorSpaceLab#16

TITC mentioned this issue Jan 3, 2025

fix issue#16 #28

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

: Unknown vision tower: /share/junjie/shuyan/clip-vit-large-patch14-336 #16

: Unknown vision tower: /share/junjie/shuyan/clip-vit-large-patch14-336 #16

wwpu commented Nov 27, 2024

shuyansy commented Nov 28, 2024

wwpu commented Dec 24, 2024

zzz130981 commented Dec 25, 2024

zzz130981 commented Dec 25, 2024

wwpu commented Dec 26, 2024

DaozeZhang commented Feb 9, 2025

: Unknown vision tower: /share/junjie/shuyan/clip-vit-large-patch14-336 #16

: Unknown vision tower: /share/junjie/shuyan/clip-vit-large-patch14-336 #16

Comments

wwpu commented Nov 27, 2024

shuyansy commented Nov 28, 2024

wwpu commented Dec 24, 2024

zzz130981 commented Dec 25, 2024

zzz130981 commented Dec 25, 2024

wwpu commented Dec 26, 2024

DaozeZhang commented Feb 9, 2025