Bug Fix: chat completions API calls need `model_id` #114

tybrs · 2024-05-30T00:42:02Z

Description

Users must pass model parameter explicitly to use /v1/chat/completions API call:

curl http://localhost:8088/v1/chat/completions \
    -X POST \
    -d '{"model": "meta-llama/Meta-Llama-Guard-2-8B", "messages": [{"role": "user", "content": "Say this is a test!"}]}' \
    -H 'Content-Type: application/json'

Using ChatHuggingFace in langchain this looks like the following:

llm_guard_chat = ChatHuggingFace(llm=llm_guard, model_id=safety_guard_model)
llm_guard_chat.invoke([{"role": "user", "content": input.text}]).content

If you do not add a model_id it will use the _resolve_model_id method to get the model_id but this does not work for locally deployed TGI services (because it uses huggingface_hub list_inference_endpoints function which only lists hugging face endpoints for account). This issue can be found here langchain-ai/langchain#17779. Current implementation errors as follows:

llm_engine_hf= ChatHuggingFace(llm=llm_guard)
^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/site-packages/langchain_community/chat_models/huggingface.py", line 54, in init
self._resolve_model_id()
File "/usr/local/lib/python3.11/site-packages/langchain_community/chat_models/huggingface.py", line 158, in _resolve_model_id
raise ValueError(
ValueError: Failed to resolve model_id Could not find model id for inference server provided: http://xx.xx.xx.xxx/
Make sure that your Hugging Face token has access to the endpoint.

This PR adds a workaround to this issue by adding a DEFAULT_MODEL variable and a way to try to get model id from /info endpoint. I also moved the ChatHuggingFace to run at start. This should improve latency of API calls by 1-2 seconds.

Issues

n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)

Dependencies

langchain-community

Tests

I rain two local TGI services of both LlamaGuard1 and

docker run -p 8087:80 -v $PWD/data:/data  -e HTTPS_PROXY=$https_proxy -e HTTP_PROXY=$https_proxy ghcr.io/huggingface/text-generation-inference --model-id meta-llama/LlamaGuard-7b

docker run -p 8087:80 -v $PWD/data:/data  -e HTTPS_PROXY=$https_proxy -e HTTP_PROXY=$https_proxy ghcr.io/huggingface/text-generation-inference --model-id meta-llama/Meta-Llama-Guard-2-8B

Then I build and ran guardrails-tgi-server

docker build -t opea/gen-ai-comps:guardrails-tgi-server --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy -f comps/guardrails/langchain/docker/Dockerfile .

docker run -p 9090:9090  -e https_proxy=$https_proxy -e http_proxy=$http_proxy -e no_proxy=$no_proxy -e HUGGINGFACEHUB_API_TOKEN=$HUGGINGFACEHUB_API_TOKEN -e SAFETY_GUARD_ENDPOINT="http://localhost:8088" --network="host" opea/gen-ai-comps:guardrails-tgi-server

Tested with following curl:

$ curl http://localhost:9090/v1/guardrails -X POST   -d '{"text": "i am going to kill you"}'   -H 'Content-Type: application/json'
{"id":"e6a511430e2d5158d2923a4099502945","text":"Violated policies: Violent Crimes, please check your input."}twilbers@mlp-prod-skx-5675:/localdisk/twilbers/docker$

Signed-off-by: Tyler Wilbers <[email protected]>

for more information, see https://pre-commit.ci

tybrs · 2024-06-07T04:49:17Z

@lvliang-intel Thanks for the review. Should we assign another person for review?

comps/guardrails/langchain/guardrails_tgi_gaudi.py

Signed-off-by: Tyler Wilbers <[email protected]>

… tybrs-fix-model-id

Signed-off-by: Tyler Wilbers <[email protected]>

for more information, see https://pre-commit.ci

tybrs · 2024-06-19T00:22:20Z

@dcmiddle and @lvliang-intel Comments addressed and conflict fixed. Thanks!

Signed-off-by: Tyler Wilbers <[email protected]>

… tybrs-fix-model-id

* added default model Signed-off-by: Tyler Wilbers <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added instructions for enviornment variable Signed-off-by: Tyler Wilbers <[email protected]> * added bash to codeblock Signed-off-by: Tyler Wilbers <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed typo Signed-off-by: Tyler Wilbers <[email protected]> --------- Signed-off-by: Tyler Wilbers <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: sharanshirodkar7 <[email protected]>

* added default model Signed-off-by: Tyler Wilbers <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added instructions for enviornment variable Signed-off-by: Tyler Wilbers <[email protected]> * added bash to codeblock Signed-off-by: Tyler Wilbers <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed typo Signed-off-by: Tyler Wilbers <[email protected]> --------- Signed-off-by: Tyler Wilbers <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Yogesh Pandey <[email protected]>

* added default model Signed-off-by: Tyler Wilbers <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added instructions for enviornment variable Signed-off-by: Tyler Wilbers <[email protected]> * added bash to codeblock Signed-off-by: Tyler Wilbers <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed typo Signed-off-by: Tyler Wilbers <[email protected]> --------- Signed-off-by: Tyler Wilbers <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Daniel Whitenack <[email protected]>

Signed-off-by: chensuyue <[email protected]>

Tyler Wilbers and others added 2 commits May 29, 2024 17:38

added default model

24333f4

Signed-off-by: Tyler Wilbers <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

fc86c37

for more information, see https://pre-commit.ci

lvliang-intel approved these changes May 31, 2024

View reviewed changes

tybrs added 4 commits June 4, 2024 10:33

Merge branch 'opea-project:main' into tybrs-fix-model-id

0e49b6c

Merge branch 'main' into tybrs-fix-model-id

85ade2e

Merge branch 'main' into tybrs-fix-model-id

36fb67e

Merge branch 'main' into tybrs-fix-model-id

5dfb488

Merge branch 'main' into tybrs-fix-model-id

a2f973c

dcmiddle reviewed Jun 17, 2024

View reviewed changes

comps/guardrails/langchain/guardrails_tgi_gaudi.py Outdated Show resolved Hide resolved

dcmiddle reviewed Jun 17, 2024

View reviewed changes

comps/guardrails/langchain/guardrails_tgi_gaudi.py Show resolved Hide resolved

Tyler Wilbers and others added 5 commits June 18, 2024 17:00

added instructions for enviornment variable

a818c4f

Signed-off-by: Tyler Wilbers <[email protected]>

Merge branch 'tybrs-fix-model-id' of github.com:tybrs/GenAIComps into…

bbe0b34

… tybrs-fix-model-id

added bash to codeblock

9ccabd1

Signed-off-by: Tyler Wilbers <[email protected]>

fixed conflict with chat interface

fed97cf

Signed-off-by: Tyler Wilbers <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

6bade69

for more information, see https://pre-commit.ci

Tyler Wilbers and others added 3 commits June 18, 2024 19:04

fixed typo

d0bbb58

Signed-off-by: Tyler Wilbers <[email protected]>

Merge branch 'opea-project:main' into tybrs-fix-model-id

176e7c1

Merge branch 'tybrs-fix-model-id' of github.com:tybrs/GenAIComps into…

9fb5cfc

… tybrs-fix-model-id

dcmiddle approved these changes Jun 20, 2024

View reviewed changes

Merge branch 'main' into tybrs-fix-model-id

eaa8731

tybrs requested a review from dcmiddle June 20, 2024 23:54

lvliang-intel requested review from chensuyue and zehao-intel and removed request for dcmiddle June 21, 2024 01:20

zehao-intel approved these changes Jun 21, 2024

View reviewed changes

lvliang-intel merged commit 88a147d into opea-project:main Jun 21, 2024
7 checks passed

lkk12014402 pushed a commit that referenced this pull request Aug 8, 2024

update license_template.txt (#114)

d2497af

Signed-off-by: chensuyue <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug Fix: chat completions API calls need `model_id` #114

Bug Fix: chat completions API calls need `model_id` #114

tybrs commented May 30, 2024 •

edited

Loading

tybrs commented Jun 7, 2024

tybrs commented Jun 19, 2024

Bug Fix: chat completions API calls need model_id #114

Bug Fix: chat completions API calls need model_id #114

Conversation

tybrs commented May 30, 2024 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

tybrs commented Jun 7, 2024

tybrs commented Jun 19, 2024

Bug Fix: chat completions API calls need `model_id` #114

Bug Fix: chat completions API calls need `model_id` #114

tybrs commented May 30, 2024 •

edited

Loading