Skip to content

Commit

Permalink
'ghcr.io/huggingface/text-generation-inference:sha-e4201f4-intel-cpu'…
Browse files Browse the repository at this point in the history
… is intel cpu optimized tgi image, we need to use this one for all xeon platform. (#444)

Signed-off-by: zhlsunshine <[email protected]>
  • Loading branch information
zhlsunshine committed Sep 20, 2024
1 parent 2517e79 commit c84ac4c
Show file tree
Hide file tree
Showing 3 changed files with 4 additions and 3 deletions.
3 changes: 2 additions & 1 deletion helm-charts/common/tgi/values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,8 @@ image:
repository: ghcr.io/huggingface/text-generation-inference
pullPolicy: IfNotPresent
# Overrides the image tag whose default is the chart appVersion.
tag: "2.2.0"
# `sha-e4201f4-intel-cpu` is the image tag for intel cpu optimized tgi image
tag: "sha-e4201f4-intel-cpu"

# empty for CPU
accelDevice: ""
Expand Down
2 changes: 1 addition & 1 deletion microservices-connector/config/manifests/tgi.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -87,7 +87,7 @@ spec:
optional: true
securityContext:
{}
image: "ghcr.io/huggingface/text-generation-inference:2.2.0"
image: "ghcr.io/huggingface/text-generation-inference:sha-e4201f4-intel-cpu"
imagePullPolicy: IfNotPresent
volumeMounts:
- mountPath: /data
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@ The ChatQnA uses the below prebuilt images if you choose a Xeon deployment
- dataprep-redis: opea/dataprep-redis:latest
- tei_xeon_service: ghcr.io/huggingface/text-embeddings-inference:cpu-1.5
- tei_embedding_service: ghcr.io/huggingface/text-embeddings-inference:cpu-1.5
- tgi-service: ghcr.io/huggingface/text-generation-inference:2.2.0
- tgi-service: ghcr.io/huggingface/text-generation-inference:sha-e4201f4-intel-cpu
- redis-vector-db: redis/redis-stack:7.2.0-v9

Should you desire to use the Gaudi accelerator, two alternate images are used for the embedding and llm services.
Expand Down

0 comments on commit c84ac4c

Please sign in to comment.