ModelMesh

The ModelMesh framework is a mature, general-purpose model serving management/routing layer designed for high-scale, high-density and frequently-changing model use cases. It works with existing or custom-built model servers and acts as a distributed LRU cache for serving runtime models.

For full Kubernetes-based deployment and management of ModelMesh clusters and models, see the ModelMesh Serving repo. This includes a separate controller and provides K8s custom resource based management of ServingRuntimes and InferenceServices along with common, abstracted handling of model repository storage and ready-to-use integrations with some existing OSS model servers.

For more information on supported features and design details, see these charts.

Get Started

To learn more about and get started with the ModelMesh framework, check out the documentation.

Developer guide

Use the developer guide to learn about development practices for the project.

Name		Name	Last commit message	Last commit date
Latest commit History 131 Commits
.github		.github
config		config
docs		docs
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
MAINTAINERS.md		MAINTAINERS.md
OWNERS		OWNERS
README.md		README.md
developer-guide.md		developer-guide.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ModelMesh

Get Started

Developer guide

About

Releases 13

Packages

Contributors 22

Languages

License

kserve/modelmesh

Folders and files

Latest commit

History

Repository files navigation

ModelMesh

Get Started

Developer guide

About

Resources

License

Stars

Watchers

Forks

Releases 13

Packages 0

Contributors 22

Languages

Packages