Fix: Adding an LLM param to fix generator producing only "#####" characters #1519

naveenk2022 · 2024-01-17T15:06:40Z

The newer versions of llama-cpp-python do not have the parameter offload_kqv set to True. Fixing this in privateGPT is pretty straightforward. Instead of installing a lower version of llama-cpp-python, adding "offload_kqv": True to model_kwargs enables KGV offloading, and also significantly boosts GPU performance.

imartinez

Thanks for the fix!

…-ai#1519)

Fix: Adding an LLM param to fix broken generator

827b1c2

naveenk2022 changed the title ~~Fix: Adding an LLM param to fix broken generator~~ Fix: Adding an LLM param to fix generator producing only "#####" characters Jan 17, 2024

imartinez approved these changes Jan 17, 2024

View reviewed changes

imartinez merged commit 869233f into zylon-ai:main Jan 17, 2024
6 checks passed

github-actions bot mentioned this pull request Jan 17, 2024

chore(main): release 0.3.0 #1413

Merged

simonbermudez pushed a commit to simonbermudez/saimon that referenced this pull request Feb 24, 2024

fix: Adding an LLM param to fix broken generator from llamacpp (zylon…

16e30c4

…-ai#1519)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Adding an LLM param to fix generator producing only "#####" characters #1519

Fix: Adding an LLM param to fix generator producing only "#####" characters #1519

naveenk2022 commented Jan 17, 2024

imartinez left a comment

Fix: Adding an LLM param to fix generator producing only "#####" characters #1519

Fix: Adding an LLM param to fix generator producing only "#####" characters #1519

Conversation

naveenk2022 commented Jan 17, 2024

imartinez left a comment

Choose a reason for hiding this comment