Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: Model-as-a-Service with AWS Bedrock #1631

Open
10 tasks
jvstme opened this issue Aug 27, 2024 · 0 comments
Open
10 tasks

[Feature]: Model-as-a-Service with AWS Bedrock #1631

jvstme opened this issue Aug 27, 2024 · 0 comments
Labels

Comments

@jvstme
Copy link
Collaborator

jvstme commented Aug 27, 2024

Problem

Some users need to work with both custom and off-the-shelf language models. Deploying off-the-shelf models with dstack may be less convenient and financially viable than using platforms such as AWS Bedrock or Vertex AI, which provide models as a service. In addition, some proprietary models are only available through MaaS platforms. This results in users having to switch back and forth between dstack and MaaS platforms.

Solution

Add support for MaaS platforms starting with AWS Bedrock. Add new router configurations that will allow Bedrock models to be exposed through the dstack-gateway OpenAI-compatible API.

Workaround

Users can use MaaS platforms directly or use proxy solutions to bring MaaS models and models deployed by dstack into a single interface.

Implementation Steps

  • Storing data about models in dstack-gateway
  • Internal API for model management in dstack-gateway
  • Bedrock implementation in dstack-gateway
  • MaaS-specific settings in backend configuration
  • Storing data about models and routers in dstack-server
  • dstack-server routers API
  • Routers Python API
  • dstack apply router configurations
  • dstack router list and dstack router delete commands
  • Documentation
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant