Move quantization after LoRA surgery #828

caroteu · 2025-01-11T20:52:51Z

@anwai98 I noticed that the way we we doing quantization, we did not quantize the LoRA matrices (A and B matrices). Was that intended? If not this should be fixed by adding the quantization after doing the LoRA surgery.

I also added code to initialize the quantized model for inference.

caroteu · 2025-01-11T20:55:12Z

micro_sam/util.py

+                        linear_q4bit.bias = torch.nn.Parameter(bias_data)
+
+                    # Replace the original linear layer with the quantized one
+                    setattr(parent_module, layer_name, linear_q4bit)


inspired by https://github.com/bitsandbytes-foundation/bitsandbytes/blob/main/tests/test_linear4bit.py#L52C1-L70C56

anwai98 · 2025-01-11T23:18:53Z

Hi @caroteu,

we did not quantize the LoRA matrices (A and B matrices).

I see. I checked it out, and I realized there were some minor details I missed as well (eg. quantizing only the linear layers for A and B matrices).

With the latest updates, the training with mixed precision does not work anymore for me! (let me know if that's not the case for you)

And regarding the inference loading scripts, we should probably discuss it on Monday. I am a bit skeptic about the current additions!

anwai98 · 2025-01-12T00:07:02Z

Okay. I think I figured it out. The issue is that the module layers for forward and backward pass should align with each other (i.e. in both passes). And the safest way to do this is by taking care of using the right modules in the __init__ section of the surgery modules.

And thanks to @caroteu, we have a remarkable improvement in memory efficiency now! 🎉

I'll push my commits after confirming a few epochs!

PS. We should still discuss the inference changes you made because I think we don't need it. As mentioned, let's talk about the details on Monday!

finetuning/livecell_finetuning.py

anwai98 · 2025-01-13T12:14:44Z

I'll drop this, since we clarified a few confusions. We will work on finalizing QLoRA in another PR!

update quantization

5dbdf29

caroteu requested a review from anwai98 January 11, 2025 20:52

caroteu commented Jan 11, 2025

View reviewed changes

anwai98 added 2 commits January 12, 2025 00:19

Minor corner fix to avoid quantization in other parts

8ee6829

Merge branch 'dev' into fix-qlora

daaa316

Move quantization inside lora surgery to fix backprop issues

de6f955

anwai98 reviewed Jan 12, 2025

View reviewed changes

finetuning/livecell_finetuning.py Show resolved Hide resolved

anwai98 closed this Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move quantization after LoRA surgery #828

Move quantization after LoRA surgery #828

caroteu commented Jan 11, 2025

caroteu Jan 11, 2025

anwai98 commented Jan 11, 2025

anwai98 commented Jan 12, 2025 •

edited

Loading

anwai98 commented Jan 13, 2025

Move quantization after LoRA surgery #828

Move quantization after LoRA surgery #828

Conversation

caroteu commented Jan 11, 2025

caroteu Jan 11, 2025

Choose a reason for hiding this comment

anwai98 commented Jan 11, 2025

anwai98 commented Jan 12, 2025 • edited Loading

anwai98 commented Jan 13, 2025

anwai98 commented Jan 12, 2025 •

edited

Loading