GitHub - darshpanchal/llm-server: Serving open source models of your choice in as a docker container using llama-cpp-python's OpenAI compatible server

llm-server

Simple Docker script to create an OpenAI compatible server using llama-cpp-python on port 8000.

Build the Docker Container

docker build -t llm-server .

Run the Docker container

docker run -d --name llmserver -p 8000:8000 -v $PWD/config:/home/config -v $PWD/models:/home/models llm-server

You can change config.json to change port or model information. Supports multiple models, check llama-cpp-python documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
models		models
.gitattributes		.gitattributes
Dockerfile		Dockerfile
README.md		README.md