Add OpenAI as a pluggable spec #72

lantiga · 2024-04-30T19:05:20Z

🚀 Feature

We should enable serving a model through a spec, without having to implement it manually in decode_request and encode_response. The spec (could be more than one) would:

expose a route
implement specific ways of decoding requests and encoding responses
require the API to expose certain kinds of information (e.g. token used)

in a way that is pluggable at the LitServer level (spec=OpenAISpec) and independent from the API implementation itself.

Motivation

We want to make it seamless for users to expose a model using one or more standard specs.

Pitch

I define a LitAPI subclass, call LitServer(api, spec=OpenAISpec, ...) and I will get an v1/chat/completions/ endpoint that behaves like an OpenAI compatible endpoint.

Alternatives

We subclass LitServer and LitAPI, but this would't compose cleanly down the road with other pieces we want to factor out (e.g. kvcache management).

The text was updated successfully, but these errors were encountered:

lantiga · 2024-05-23T15:45:02Z

Closed by #98 #100 #101

lantiga added enhancement New feature or request help wanted Extra attention is needed labels Apr 30, 2024

lantiga closed this as completed May 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OpenAI as a pluggable spec #72

Add OpenAI as a pluggable spec #72

lantiga commented Apr 30, 2024 •

edited

Loading

lantiga commented May 23, 2024

Add OpenAI as a pluggable spec #72

Add OpenAI as a pluggable spec #72

Comments

lantiga commented Apr 30, 2024 • edited Loading

🚀 Feature

Motivation

Pitch

Alternatives

lantiga commented May 23, 2024

lantiga commented Apr 30, 2024 •

edited

Loading