Exllamav2 lora support #4229

Ph0rk0z · 2023-10-08T15:00:53Z

Doesn't work for multi GPU because of a kernel bug but it does work on single card models. Tested it for llama 7b.

Somehow multiple loras are possible but I'm not sure how to make that work.

Ph0rk0z · 2023-10-09T20:56:49Z

Its working for 70b now.

oobabooga · 2023-10-10T01:15:35Z

Nice. I'm waiting for a 0.0.6 release to merge this.

Unload lora so the memory is freed at next generation. Otherwise swapping many LoRA would cause OOM.

oobabooga · 2023-10-14T19:12:25Z

It should be working for multiple LoRAs now (if you find any issue, please let me know)

noiraku · 2023-10-14T21:14:26Z

It should be working for multiple LoRAs now (if you find any issue, please let me know)

Mistral + Lora works with exllamav2. This fixes #4272 Thank you!
(Exllamav1 still same error but no need when v2 faster.)

Ph0rk0z added 3 commits October 8, 2023 09:55

Update LoRA.py

938af73

Update exllamav2.py

aeddeeb

Update exllamav2_hf.py

b69d1e9

Ph0rk0z mentioned this pull request Oct 8, 2023

[BUG] CUDA error: invalid configuration argument /exllamav2/exllamav2/exllamav2_ext/cuda/rope.cu 131 turboderp/exllamav2#97

Closed

Ph0rk0z and others added 6 commits October 10, 2023 21:44

Free LoRA memory

86cc3a5

Unload lora so the memory is freed at next generation. Otherwise swapping many LoRA would cause OOM.

Merge branch 'main' into Ph0rk0z-exllamav2_lora

27a218e

Merge branch 'main' into Ph0rk0z-exllamav2_lora

0ec5fdd

Lint

07a8089

Multi-lora support

1df0138

Lint

8b3305d

oobabooga merged commit 8cce1f1 into oobabooga:main Oct 14, 2023

noiraku mentioned this pull request Oct 14, 2023

exllama Mistral loras doesnt work. #4272

Closed

1 task

Ph0rk0z deleted the exllamav2_lora branch October 16, 2023 12:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exllamav2 lora support #4229

Exllamav2 lora support #4229

Ph0rk0z commented Oct 8, 2023

Ph0rk0z commented Oct 9, 2023

oobabooga commented Oct 10, 2023

oobabooga commented Oct 14, 2023

noiraku commented Oct 14, 2023

Exllamav2 lora support #4229

Exllamav2 lora support #4229

Conversation

Ph0rk0z commented Oct 8, 2023

Ph0rk0z commented Oct 9, 2023

oobabooga commented Oct 10, 2023

oobabooga commented Oct 14, 2023

noiraku commented Oct 14, 2023