Skip to content

Commit

Permalink
docs: update vision/clip.md
Browse files Browse the repository at this point in the history
  • Loading branch information
danbev committed Jan 20, 2025
1 parent 79f1694 commit 9c281cb
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions notes/vision/clip.md
Original file line number Diff line number Diff line change
Expand Up @@ -122,6 +122,10 @@ textual representations."
CLIP and VIT are not the same thing as I understand it. CLIP which stands for
contrastive language-image pretraining can use a vision transformer to process
the images but CLIP itself is the complete concept of the training process.
At inference time, like what is being worked on in llama.cpp, it is the ViT
component of CLIP thas being used and perhaps using clip as the prefix for
methods/structs/tensor can be a litle misleading. Would this be better off
named something like 'vit' or `vision_model`, `vision_layer` etc.

### CLIP image preprocessing

Expand Down

0 comments on commit 9c281cb

Please sign in to comment.