Fix QLoRA weights and bias initialisation #833

caroteu · 2025-01-15T13:47:40Z

@anwai98 This includes the QLoRA initialisation changes and this also works for inference now. The changes that made the inference work are a bit hacky though, so we should think about a better/safer way to do this.

anwai98

Hi @caroteu,

I left some comments below. In general, I feel using strict=False is a bit tricky (and error-prone) to set it for loading model checkpoints. We should discuss the issues and find an elegant way to take care of this!

micro_sam/instance_segmentation.py

micro_sam/models/peft_sam.py

micro_sam/util.py

caroteu · 2025-01-15T17:49:38Z

@anwai98
The way we handle the QLoRA state dict is now updated to make things safer as discussed

anwai98

Hi @caroteu,

I merged the previous PR here. Can we briefly touch down on the comments below?

micro_sam/instance_segmentation.py

micro_sam/util.py

micro_sam/models/peft_sam.py

micro_sam/util.py

Does the inference for QLoRA-finetuned models properly!

anwai98 · 2025-01-21T16:14:28Z

I added docstrings to our new export function (and thanks to @caroteu for making an elegant solution out of this).

This is GTG from my side.

PS. @caroteu @constantinpape It would be nice to go over this and see if everything makes sense now!

constantinpape

This looks good to me. Feel free to merge.

caroteu added 3 commits January 12, 2025 13:01

add weight initialisation to 4bit linear layers

da290cf

load state dict with strict=False

65b9da1

make qlora bias initialisation dependent on original layer

9c5a8c6

anwai98 reviewed Jan 15, 2025

View reviewed changes

micro_sam/instance_segmentation.py Outdated Show resolved Hide resolved

micro_sam/models/peft_sam.py Show resolved Hide resolved

micro_sam/util.py Outdated Show resolved Hide resolved

micro_sam/util.py Outdated Show resolved Hide resolved

anwai98 and others added 2 commits January 15, 2025 16:22

Minor comments

8757482

handle quantization parameters in state dict

f423834

Add QLoRA on top of the default SAM model for inference (#840)

9193523

anwai98 requested changes Jan 20, 2025

View reviewed changes

micro_sam/instance_segmentation.py Outdated Show resolved Hide resolved

micro_sam/util.py Outdated Show resolved Hide resolved

micro_sam/util.py Outdated Show resolved Hide resolved

micro_sam/models/peft_sam.py Show resolved Hide resolved

caroteu added 2 commits January 20, 2025 17:24

revert previous qlora changes

fa17d8c

remove peft kwargs as argument

3e527b9

anwai98 reviewed Jan 20, 2025

View reviewed changes

micro_sam/util.py Outdated Show resolved Hide resolved

anwai98 reviewed Jan 20, 2025

View reviewed changes

micro_sam/util.py Outdated Show resolved Hide resolved

caroteu and others added 2 commits January 21, 2025 17:00

Add custom function to export QLoRA models (#842)

54cb859

Does the inference for QLoRA-finetuned models properly!

Add docstring

6bd8d39

anwai98 requested a review from constantinpape January 21, 2025 16:15

constantinpape approved these changes Jan 21, 2025

View reviewed changes

anwai98 merged commit 552dc55 into dev Jan 21, 2025
3 checks passed

anwai98 deleted the fix-qlora-initialisation branch January 21, 2025 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix QLoRA weights and bias initialisation #833

Fix QLoRA weights and bias initialisation #833

caroteu commented Jan 15, 2025

anwai98 left a comment

caroteu commented Jan 15, 2025

anwai98 left a comment

anwai98 commented Jan 21, 2025

constantinpape left a comment

Fix QLoRA weights and bias initialisation #833

Fix QLoRA weights and bias initialisation #833

Conversation

caroteu commented Jan 15, 2025

anwai98 left a comment

Choose a reason for hiding this comment

caroteu commented Jan 15, 2025

anwai98 left a comment

Choose a reason for hiding this comment

anwai98 commented Jan 21, 2025

constantinpape left a comment

Choose a reason for hiding this comment