Thread Safety in llama.cpp #596

martindevans · 2024-03-12T23:16:58Z

Tracking issue for thread safety in llama.cpp. The global inference lock can be removed once this is resolved.

zsogitbe · 2024-03-13T06:09:03Z

llama.cpp : add pipeline parallelism support #6017. Good news: seems high priority and will probably be ready soon. If this and the CUDA memory release bug correction is ready please add a quick intermediate release integration to LLamaSharp. This is important.

ggerganov/llama.cpp#6017

martindevans added the Upstream Tracking an issue in llama.cpp label Mar 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thread Safety in llama.cpp #596

Thread Safety in llama.cpp #596

martindevans commented Mar 12, 2024

zsogitbe commented Mar 13, 2024

Thread Safety in llama.cpp #596

Thread Safety in llama.cpp #596

Comments

martindevans commented Mar 12, 2024

zsogitbe commented Mar 13, 2024