-
Notifications
You must be signed in to change notification settings - Fork 77
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat: gpt-4 and gpt-4 32k services (#456)
* feat: gpt-4 and gpt-4 32k services * fix: add to universal * fix: add params
- Loading branch information
1 parent
b6ad4e0
commit 1f3ff38
Showing
11 changed files
with
193 additions
and
7 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,28 @@ | ||
name: openai_api_gpt4 | ||
display_name: GPT-4 | ||
component_type: Generative | ||
model_type: NN-based | ||
is_customizable: false | ||
author: [email protected] | ||
description: A multilingual instruction-based large language model | ||
which is capable of code generation and other complex tasks. | ||
More capable than any GPT-3.5 model, able to do more complex tasks, | ||
and optimized for chat. Paid. | ||
You must provide your OpenAI API key to use the model. | ||
Your OpenAI account will be charged according to your usage. | ||
ram_usage: 100M | ||
gpu_usage: null | ||
group: services | ||
connector: | ||
protocol: http | ||
timeout: 20.0 | ||
url: http://openai-api-gpt4:8159/respond | ||
dialog_formatter: null | ||
response_formatter: null | ||
previous_services: null | ||
required_previous_services: null | ||
state_manager_method: null | ||
tags: null | ||
endpoint: respond | ||
service: services/openai_api_lm/service_configs/openai-api-gpt4 | ||
date_created: '2023-04-16T09:45:32' |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,27 @@ | ||
name: openai_api_gpt4_32k | ||
display_name: GPT-4 32k | ||
component_type: Generative | ||
model_type: NN-based | ||
is_customizable: false | ||
author: [email protected] | ||
description: A multilingual instruction-based large language model | ||
which is capable of code generation and other complex tasks. | ||
Same capabilities as the base gpt-4 mode but with 4x the context length. | ||
Paid. You must provide your OpenAI API key to use the model. | ||
Your OpenAI account will be charged according to your usage. | ||
ram_usage: 100M | ||
gpu_usage: null | ||
group: services | ||
connector: | ||
protocol: http | ||
timeout: 20.0 | ||
url: http://openai-api-gpt4-32k:8160/respond | ||
dialog_formatter: null | ||
response_formatter: null | ||
previous_services: null | ||
required_previous_services: null | ||
state_manager_method: null | ||
tags: null | ||
endpoint: respond | ||
service: services/openai_api_lm/service_configs/openai-api-gpt4-32k | ||
date_created: '2023-04-16T09:45:32' |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
5 changes: 5 additions & 0 deletions
5
services/openai_api_lm/service_configs/openai-api-gpt4-32k/environment.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
SERVICE_PORT: 8160 | ||
SERVICE_NAME: openai_api_gpt4_32k | ||
PRETRAINED_MODEL_NAME_OR_PATH: gpt-4-32k | ||
CUDA_VISIBLE_DEVICES: '0' | ||
FLASK_APP: server |
31 changes: 31 additions & 0 deletions
31
services/openai_api_lm/service_configs/openai-api-gpt4-32k/service.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,31 @@ | ||
name: openai-api-gpt4-32k | ||
endpoints: | ||
- respond | ||
compose: | ||
env_file: | ||
- .env | ||
build: | ||
args: | ||
SERVICE_PORT: 8160 | ||
SERVICE_NAME: openai_api_gpt4_32k | ||
PRETRAINED_MODEL_NAME_OR_PATH: gpt-4-32k | ||
CUDA_VISIBLE_DEVICES: '0' | ||
FLASK_APP: server | ||
context: . | ||
dockerfile: ./services/openai_api_lm/Dockerfile | ||
command: flask run -h 0.0.0.0 -p 8160 | ||
environment: | ||
- CUDA_VISIBLE_DEVICES=0 | ||
- FLASK_APP=server | ||
deploy: | ||
resources: | ||
limits: | ||
memory: 100M | ||
reservations: | ||
memory: 100M | ||
volumes: | ||
- ./services/openai_api_lm:/src | ||
- ./common:/src/common | ||
ports: | ||
- 8160:8160 | ||
proxy: null |
5 changes: 5 additions & 0 deletions
5
services/openai_api_lm/service_configs/openai-api-gpt4/environment.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
SERVICE_PORT: 8159 | ||
SERVICE_NAME: openai_api_gpt4 | ||
PRETRAINED_MODEL_NAME_OR_PATH: gpt-4 | ||
CUDA_VISIBLE_DEVICES: '0' | ||
FLASK_APP: server |
31 changes: 31 additions & 0 deletions
31
services/openai_api_lm/service_configs/openai-api-gpt4/service.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,31 @@ | ||
name: openai-api-gpt4 | ||
endpoints: | ||
- respond | ||
compose: | ||
env_file: | ||
- .env | ||
build: | ||
args: | ||
SERVICE_PORT: 8159 | ||
SERVICE_NAME: openai_api_gpt4 | ||
PRETRAINED_MODEL_NAME_OR_PATH: gpt-4 | ||
CUDA_VISIBLE_DEVICES: '0' | ||
FLASK_APP: server | ||
context: . | ||
dockerfile: ./services/openai_api_lm/Dockerfile | ||
command: flask run -h 0.0.0.0 -p 8159 | ||
environment: | ||
- CUDA_VISIBLE_DEVICES=0 | ||
- FLASK_APP=server | ||
deploy: | ||
resources: | ||
limits: | ||
memory: 100M | ||
reservations: | ||
memory: 100M | ||
volumes: | ||
- ./services/openai_api_lm:/src | ||
- ./common:/src/common | ||
ports: | ||
- 8159:8159 | ||
proxy: null |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters