[LoRA] Adds support for bias in LoRA #5733

followumesh · 2024-06-21T08:07:26Z

Motivation
PEFT, https://github.com/foundation-model-stack/fms-hf-tuning includes support for tuning LoRA bias. This PR enables bias for lora, so the adapters with bias will work with vLLM.

Changes Included

LoRA bias support for different types of modules.
LoRA bias support for fully sharded LoRA.
Test file test-lora-bias.py

Yard1

Could we add an argument to the engine enable_lora_bias and avoid initializing the bias tensors if it's false? If the user knows none of their loras will have bias, we can save memory.

…-for-lora

followumesh · 2024-06-27T02:30:52Z

@Yard1 Thanks for reviewing the PR. I have added the enable_lora_bias flag (default set to false), which prevents the allocation of lora bias tensors when false.

njhill · 2024-06-27T20:21:04Z

Related: #5930

Yard1

Looks good, can we also add an e2e test?

DarkLight1337 · 2024-06-28T06:29:49Z

To speed up the CI queue for #5905, I've cancelled the distributed tests for the latest CI run in this PR since they won't pass anyway until #5905 has been merged. Please merge main into your branch after that happens so that the CI can pass once again.

…-for-lora

followumesh · 2024-07-28T06:56:12Z

@Yard1 Thanks for reviewing. I've added an e2e test for the lora_bias support.

njhill · 2024-07-29T23:44:26Z

@followumesh you need to run ./format.sh to fix the linting errors

njhill · 2024-08-01T23:55:01Z

@followumesh apologies, this needs another conflict resolution!

followumesh · 2024-08-08T07:52:43Z

@Yard1 @njhill I have included the e2e test and merged the recent changes. Can you please review the commit? Thanks

maxdebayser · 2024-08-08T17:43:21Z

vllm/lora/layers.py

@@ -64,6 +64,64 @@ def dec(*args, **kwargs):
    return dec


+def apply_bias(


Would it be possible to add this inside PunicaWrapper.add_lora()? There could be an optional bias argument in add_lora() and then the logic of testing whether the bias is None and doing the index computation could be moved inside this function. It seems to me that it would eliminate repeated code lines in this file, but I don't know all the details.

@maxdebayser I don't have a strong opinion, but this was to keep the changes out of punica wrapper because of it not being directly releated to punica.

prashantgupta24 · 2024-08-15T22:22:40Z

vllm/lora/utils.py

+    assert parts[0] == "base_model"
+    assert parts[1] == "model"
+    if parts[-1] == "weight":
+        assert parts[-2] == "lora_A" or parts[-2] == "lora_B"


This assertion is failing a couple of lora_modules that we have:

>>> lora_modules[0].split(".") ['base_model', 'model', 'lm_head', 'weight'] >>> lora_modules[1].split(".") ['base_model', 'model', 'model', 'embed_tokens', 'weight']

Still investigating if this would have thrown an error with the previous code as well...

@followumesh actually it looks like this was a bad merge .. it's reverting a recent change that was made to improve the error message

@prashantgupta24 Can you point to a lora module I can test with?

Let me try to find one, also reverting this change gives me the error

ValueError: base_model.model.lm_head.weight is unsupported LoRA weight

To summarize, since vLLM expects only LoRA weights in the safetensors file, it was actually an error in our lora adapter that we had lm_head within it. But I think the original error is now being reverted because of this change. Ideally, the error should be

ValueError: base_model.model.lm_head.weight is unsupported LoRA weight

Instead, an assertion error assert parts[-2] == "lora_A" or parts[-2] == "lora_B" is now being thrown, which doesn't provide the right detail

Yes, @followumesh I think the changes here are not actually related to the main PR changes, wonder if you could revert (and any other changes in same category)?

@prashantgupta24 Can you check now?

Yeah this looks better. I'll let you know when I get a chance to test it!

Umesh Deshpande added 3 commits June 21, 2024 04:04

LoRA Bias Support

e491d72

Minor changes

b0ed274

Ignore types to avoid error

fced7ec

Yard1 reviewed Jun 21, 2024

View reviewed changes

Umesh Deshpande added 3 commits June 25, 2024 21:35

Merge branch 'main' of https://github.com/vllm-project/vllm into bias…

575032f

…-for-lora

Merge branch 'main' of https://github.com/vllm-project/vllm into bias…

882a8e8

…-for-lora

enable-lora-bias flag

29a58c2

Umesh Deshpande added 4 commits June 27, 2024 15:01

Resolved conflicts

7e64588

yapf formatting

06ba6cf

yapf formatting

84a37ea

yapf formatting

cd1bb03

Yard1 reviewed Jun 27, 2024

View reviewed changes

Umesh Deshpande added 13 commits July 9, 2024 15:58

LoRA Bias Support

d73cecb

Minor changes

857152b

Ignore types to avoid error

c02bee6

enable-lora-bias flag

0eaaecb

yapf formatting

5c8acd0

yapf formatting

f261cf6

yapf formatting

1c78eb2

Merge branch 'bias-for-lora' of github.com:followumesh/vllm into bias…

387be43

…-for-lora

E2E test for lora bias

4845dae

Merged main

1562590

isort imports

e0eca8a

yapf fix

2aacf10

Mixing bias and non-bias lora in a batch

942f2ab

Umesh Deshpande added 2 commits July 31, 2024 19:18

Formatting changes

f7deaef

Merge remote-tracking branch 'upstream/main' into bias-for-lora

ecd753d

Merge: punica api changes

4d1b8f0

maxdebayser reviewed Aug 8, 2024

View reviewed changes

prashantgupta24 reviewed Aug 15, 2024

View reviewed changes

Umesh Deshpande added 3 commits August 21, 2024 02:55

Merge remote-tracking branch 'upstream/main' into bias-for-lora

7e8bad0

Removed assert for lora check

3aeb63d

Ruff: Merged if

808e92c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoRA] Adds support for bias in LoRA #5733

[LoRA] Adds support for bias in LoRA #5733

followumesh commented Jun 21, 2024

Yard1 left a comment

followumesh commented Jun 27, 2024

njhill commented Jun 27, 2024

Yard1 left a comment

DarkLight1337 commented Jun 28, 2024

followumesh commented Jul 28, 2024

njhill commented Jul 29, 2024

njhill commented Aug 1, 2024

followumesh commented Aug 8, 2024

maxdebayser Aug 8, 2024

followumesh Aug 21, 2024

prashantgupta24 Aug 15, 2024

njhill Aug 15, 2024

followumesh Aug 16, 2024

prashantgupta24 Aug 16, 2024

prashantgupta24 Aug 19, 2024 •

edited

Loading

njhill Aug 20, 2024

followumesh Aug 21, 2024

prashantgupta24 Aug 21, 2024

		@@ -64,6 +64,64 @@ def dec(args, *kwargs):
		return dec


		def apply_bias(

[LoRA] Adds support for bias in LoRA #5733

Are you sure you want to change the base?

[LoRA] Adds support for bias in LoRA #5733

Conversation

followumesh commented Jun 21, 2024

Yard1 left a comment

Choose a reason for hiding this comment

followumesh commented Jun 27, 2024

njhill commented Jun 27, 2024

Yard1 left a comment

Choose a reason for hiding this comment

DarkLight1337 commented Jun 28, 2024

followumesh commented Jul 28, 2024

njhill commented Jul 29, 2024

njhill commented Aug 1, 2024

followumesh commented Aug 8, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

prashantgupta24 Aug 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

prashantgupta24 Aug 19, 2024 •

edited

Loading