Add output evaluation for guardrails #332

tybrs · 2024-07-21T20:17:41Z

Description

This PR attempts to reduce latency by allowing safeguard microservice to be at end of DAG and process safety of both input and output with single query. This means you can structure the DAG as follows to reduce latency of RAG flow:

        self.megaservice.add(embedding).add(retriever).add(rerank).add(llm).add(guardrails)
        self.megaservice.flow_to(embedding, retriever)
        self.megaservice.flow_to(retriever, rerank)
        self.megaservice.flow_to(rerank, llm)
        self.megaservice.flow_to(llm, guardrails)
        self.gateway = ChatQnAGateway(megaservice=self.megaservice, host="0.0.0.0", port=self.port)

Currently our implementation only supports single input safeguarding. But the chat Messages API (v1/chat/completion) will allow for templating for both "user" inputs and "assistant" output. with a single query. This PR adds the ability to send a list of both "user" and "assistant" messages for safeguarding. Since LLM outputs both "prompt and "text" with GeneratedDoc this means you can feed output directly into guardrails microservice.

curl http://localhost:9090/v1/guardrails \
  -X POST \
  -d '{
    "prompt" : "How do you buy a tiger in the US",
    "text" : "Yes! Buy a tiger in the US.",
    "parameters":{"max_new_tokens":32}
  }

However it maintains backwards compatibility with following DAG:

        self.megaservice.add(guardrail_in).add(embedding).add(retriever).add(rerank).add(llm).add(guardrail_out)
        self.megaservice.flow_to(guardrail_in, embedding)
        self.megaservice.flow_to(embedding, retriever)
        self.megaservice.flow_to(retriever, rerank)
        self.megaservice.flow_to(rerank, llm)
        self.megaservice.flow_to(llm, guardrail_out)
        self.gateway = ChatQnAGateway(megaservice=self.megaservice, host="0.0.0.0", port=self.port)

It also can be queried with just text:

curl http://localhost:9090/v1/guardrails \
  -X POST \
  -d '{
    "text" : "How do you buy a tiger in the US",
    "parameters":{"max_new_tokens":32}
  }

Issues

n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

Changed to hugingface_hub<=0.24.0. because HugingFaceEndpoint does it assigns both endpoint_url or repo_id to InferenceClient.model attribute. But this is a bug since hugingface_hub>0.24.0. introduces a base_url kwarg for InferenceClient to be used.

Tests

Ran the following

docker run -d --name="guardrails-tgi-server" -p 9090:9090 --ipc=host -e http_proxy=$http_proxy -e https_proxy=$https_proxy -e no_proxy=$no_proxy -e SAFETY_GUARD_ENDPOINT=${SAFETY_GUARD_ENDPOINT} -e HUGGINGFACEHUB_API_TOKEN=${HUGGINGFACEHUB_API_TOKEN} opea/guardrails-tgi:latest

curl http://localhost:9090/v1/guardrails   -X POST   -d '{
    "messages":[{"role": "user", "text" : "How do you buy a tiger in the US?"}],
    "parameters":{"max_new_tokens":32}
  }'   -H 'Content-Type: application/json'

Output:

{"downstream_black_list":[".*"],"id":"6e70dcfa9db13087fc390d84c7869e7a","text":"Violated policies: Violent Crimes, please check your input."}

codecov · 2024-07-21T20:19:58Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Files	Coverage Δ
comps/cores/proto/docarray.py	`100.00% <100.00%> (ø)`

tybrs · 2024-08-06T16:40:53Z

@lvliang-intel @letonghan This PR should be ready for review. Is there a benchmark for latency with guardrails? I would love to measure the potential effect.

Signed-off-by: Tyler Wilbers <[email protected]>

ashahba

LGTM!

comps/guardrails/langchain/guardrails_tgi_gaudi.py

ashahba

LGTM!

* ChatQnA chinese version Signed-off-by: Yue, Wenjiao <[email protected]> * format chinese response * update chinese format response * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Yue, Wenjiao <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* add single query input/output guardrails Signed-off-by: Tyler Wilbers <[email protected]> * removed comment Signed-off-by: Tyler Wilbers <[email protected]> --------- Signed-off-by: Tyler Wilbers <[email protected]>

* add single query input/output guardrails Signed-off-by: Tyler Wilbers <[email protected]> * removed comment Signed-off-by: Tyler Wilbers <[email protected]> --------- Signed-off-by: Tyler Wilbers <[email protected]> Signed-off-by: BaoHuiling <[email protected]>

* add single query input/output guardrails Signed-off-by: Tyler Wilbers <[email protected]> * removed comment Signed-off-by: Tyler Wilbers <[email protected]> --------- Signed-off-by: Tyler Wilbers <[email protected]> Signed-off-by: siddhivelankar23 <[email protected]>

* add single query input/output guardrails Signed-off-by: Tyler Wilbers <[email protected]> * removed comment Signed-off-by: Tyler Wilbers <[email protected]> --------- Signed-off-by: Tyler Wilbers <[email protected]>

ashahba self-requested a review July 23, 2024 17:05

ftian1 approved these changes Jul 25, 2024

View reviewed changes

tybrs requested review from letonghan and lvliang-intel as code owners August 5, 2024 23:53

add single query input/output guardrails

4871435

Signed-off-by: Tyler Wilbers <[email protected]>

tybrs force-pushed the tybrs-fix-input-gaurd branch from 8e6a65d to 4871435 Compare August 6, 2024 18:24

removed comment

d9f52ef

Signed-off-by: Tyler Wilbers <[email protected]>

ashahba approved these changes Aug 6, 2024

View reviewed changes

comps/guardrails/langchain/guardrails_tgi_gaudi.py Outdated Show resolved Hide resolved

ashahba approved these changes Aug 6, 2024

View reviewed changes

Merge branch 'main' into tybrs-fix-input-gaurd

ab62e4b

Merge branch 'main' into tybrs-fix-input-gaurd

90fb881

ashahba merged commit 62ca5bc into opea-project:main Aug 13, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add output evaluation for guardrails #332

Add output evaluation for guardrails #332

tybrs commented Jul 21, 2024 •

edited

Loading

codecov bot commented Jul 21, 2024 •

edited

Loading

tybrs commented Aug 6, 2024 •

edited

Loading

ashahba left a comment

ashahba left a comment

Add output evaluation for guardrails #332

Add output evaluation for guardrails #332

Conversation

tybrs commented Jul 21, 2024 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

codecov bot commented Jul 21, 2024 • edited Loading

Codecov Report

tybrs commented Aug 6, 2024 • edited Loading

ashahba left a comment

Choose a reason for hiding this comment

ashahba left a comment

Choose a reason for hiding this comment

tybrs commented Jul 21, 2024 •

edited

Loading

codecov bot commented Jul 21, 2024 •

edited

Loading

tybrs commented Aug 6, 2024 •

edited

Loading