extract_lora : support tied embeddings #483

ngxson · 2025-01-08T16:48:50Z

Some small models like Llama 3.2 1B, 3B or Qwen 1.5B have tied embeddings, meaning token embeddings tensor and output tensor (lm_head) are the same.

Demo:

 mergekit-extract-lora \
  ngxson/MiniThinky-v2-1B-Llama-3.2 \
  meta-llama/Llama-3.2-1B-Instruct \
  lora_out --rank=16

cg123 · 2025-01-25T07:19:54Z

Thanks for the PR! I ended up implementing a more general solution that should also handle tied_names properly in #496 - hopefully this hits everything you need. If you have any more trouble with it let me know!

ngxson added 2 commits January 8, 2025 17:43

extract_lora : support tied embeddings

38e8cec

correct name

00243ac

jrruethe mentioned this pull request Jan 21, 2025

Handle weight aliases #490

Closed

8 tasks

cg123 closed this Jan 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extract_lora : support tied embeddings #483

extract_lora : support tied embeddings #483

ngxson commented Jan 8, 2025 •

edited

Loading

cg123 commented Jan 25, 2025

extract_lora : support tied embeddings #483

extract_lora : support tied embeddings #483

Conversation

ngxson commented Jan 8, 2025 • edited Loading

cg123 commented Jan 25, 2025

ngxson commented Jan 8, 2025 •

edited

Loading