fix: use singleton in llama_cpp #1013

dartpain · 2024-06-25T13:38:09Z

What kind of change does this PR introduce? (Bug fix, feature, docs update, ...)
Why was this change needed? (You can also link to an open issue here)
Other information:

vercel · 2024-06-25T13:38:17Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
docs-gpt	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jun 25, 2024 1:41pm
nextra-docsgpt	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jun 25, 2024 1:41pm

codecov · 2024-06-25T13:41:45Z

Codecov Report

Attention: Patch coverage is 38.09524% with 13 lines in your changes missing coverage. Please review.

Project coverage is 21.83%. Comparing base (651eb33) to head (5aa8871).
Report is 7 commits behind head on main.

Files	Patch %	Lines
application/llm/llama_cpp.py	38.09%	13 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1013      +/-   ##
==========================================
+ Coverage   21.69%   21.83%   +0.14%     
==========================================
  Files          80       80              
  Lines        3632     3645      +13     
==========================================
+ Hits          788      796       +8     
- Misses       2844     2849       +5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

starkgate · 2024-06-25T15:17:36Z

@dartpain Looks good, thanks! Much cleaner than my proposal, too. This will be a great improvement for many people.

fix: use singleton

ce56a41

vercel bot deployed to Preview – nextra-docsgpt June 25, 2024 13:38 View deployment

github-actions bot added the application Application label Jun 25, 2024

dartpain linked an issue Jun 25, 2024 that may be closed by this pull request

🐛 Bug Report: Local model is recreated for each request, causing delay and out of memory errors #945

Closed

2 tasks

refactor: Add thread lock

5aa8871

vercel bot deployed to Preview – docs-gpt June 25, 2024 13:41 View deployment

vercel bot deployed to Preview – nextra-docsgpt June 25, 2024 13:42 View deployment

dartpain mentioned this pull request Jun 25, 2024

🐛 Bug Report: Local model is recreated for each request, causing delay and out of memory errors #945

Closed

2 tasks

dartpain requested a review from pabik June 25, 2024 16:39

pabik approved these changes Jun 25, 2024

View reviewed changes

dartpain merged commit 2985e3b into main Jun 25, 2024
16 checks passed

dartpain deleted the fix/singleton-llama-cpp branch June 25, 2024 17:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use singleton in llama_cpp #1013

fix: use singleton in llama_cpp #1013

dartpain commented Jun 25, 2024

vercel bot commented Jun 25, 2024 •

edited

Loading

codecov bot commented Jun 25, 2024 •

edited

Loading

starkgate commented Jun 25, 2024

fix: use singleton in llama_cpp #1013

fix: use singleton in llama_cpp #1013

Conversation

dartpain commented Jun 25, 2024

vercel bot commented Jun 25, 2024 • edited Loading

codecov bot commented Jun 25, 2024 • edited Loading

Codecov Report

starkgate commented Jun 25, 2024

vercel bot commented Jun 25, 2024 •

edited

Loading

codecov bot commented Jun 25, 2024 •

edited

Loading