[BUG] TGI versions inconsistency / use of old TGI versions #1563

eero-t · 2025-02-18T09:59:01Z

Currently latest used TGI versions in this repo are v2.3.1 (Gaudi) / v2.4.1 (CPU).

However there are several files where much older versions are used.

GenAIExamples, old CPU/rocm versions:

GenAIExamples$ git grep text-generation-inference: | grep -v -e github -e 2.[34].[01]
AudioQnA/kubernetes/gmc/README.md:- tgi-service: ghcr.io/huggingface/text-generation-inference:1.4
ChatQnA/docker_compose/nvidia/gpu/compose.yaml:    image: ghcr.io/huggingface/text-generation-inference:2.2.0
DBQnA/docker_compose/intel/cpu/xeon/README.md:docker run -d --name="test-text2sql-tgi-endpoint" --ipc=host -p $TGI_PORT:80 -v ./data:/data --shm-size 1g -e HUGGINGFACEHUB_API_TOKEN=${HUGGINGFACEHUB_API_TOKEN} -e HF_TOKEN=${HF_TOKEN} -e model=${model} ghcr.io/huggingface/text-generation-inference:2.1.0 --model-id $model
DBQnA/docker_compose/intel/cpu/xeon/compose.yaml:    image: ghcr.io/huggingface/text-generation-inference:2.1.0
DocSum/tests/test_compose_on_rocm.sh:    docker pull ghcr.io/huggingface/text-generation-inference:1.4
DocSum/tests/test_compose_on_xeon.sh:    docker pull ghcr.io/huggingface/text-generation-inference:1.4
FaqGen/tests/test_compose_on_xeon.sh:    docker pull ghcr.io/huggingface/text-generation-inference:1.4
MultimodalQnA/docker_compose/amd/gpu/rocm/compose.yaml:    image: ghcr.io/huggingface/text-generation-inference:3.0.1-rocm

GenAIComps, old CPU versions:

GenAIComps$ git grep text-generation-inference: | grep -v -e github -e 2.[34].[01]
comps/text2sql/src/README.md:docker run -d --name="text2sql-tgi-endpoint" --ipc=host -p $TGI_PORT:80 -v ./data:/data --shm-size 1g -e HF_TOKEN=${HUGGINGFACEHUB_API_TOKEN} -e model=${LLM_MODEL_ID} ghcr.io/huggingface/text-generation-inference:2.1.0 --model-id $LLM_MODEL_ID

GenAIExamples, old Gaudi versions (latest used version is 2.3.1):

$ git grep tgi-gaudi:2.0 | wc -l
40

PS. All TEI image references are for 1.5 version, i.e. consistent.

The text was updated successfully, but these errors were encountered:

xiguiw · 2025-02-25T15:23:53Z

@eero-t
Thanks! Good catch.

Do you have plan to submit PR to fix this?

eero-t · 2025-02-25T17:42:43Z

Do you have plan to submit PR to fix this?

@xiguiw No.

(Fixing this could be a good "beginner" / "first time" task PR.)

zhanmyz · 2025-03-06T06:26:23Z

OPEA_Team4 is working on this issue

- Update TGI CPU/rocm version to v2.4.1 - Update TGI Gaudi version to v2.3.1 Fixes opea-project#1563 Signed-off-by: xiaotia3 <xiaotian.chen@intel.com> Signed-off-by: Ma, YaZhan <yazhan.ma@intel.com> Signed-off-by: Gao, Fengqian <fengqian.gao@intel.com> Signed-off-by: Wang, Le3 <le3.wang@intel.com>

- Update TGI CPU/rocm version to v2.4.1 - Update TGI Gaudi version to v2.3.1 Fixes opea-project#1563 Signed-off-by: xiaotia3 <xiaotian.chen@intel.com> Signed-off-by: Ma, Yazhan <yazhan.ma@intel.com> Signed-off-by: Gao, Fengqian <fengqian.gao@intel.com> Signed-off-by: Wang, Le3 <le3.wang@intel.com>

yinghu5 · 2025-03-26T01:19:23Z

Hi @xiaotia3 thank you a lot for the contribution. will remind team to review. thanks

eero-t · 2025-03-31T08:24:16Z

PS. All TEI image references are for 1.5.0 version, i.e. consistent.

But somewhat out of date. 1.5.0 was released last summer, whereas latest tei-gaudi release is 1.5.3 (and "GenAIComps" project use 1.5.2): https://github.com/huggingface/tei-gaudi/releases

eero-t · 2025-04-01T09:53:38Z

GenAIComps CPU/rocm TGI is now consistent version, but this repo is not quite done yet, there's still lot of discrepancy.

While most are now on TGI 2.4.x, some references to older version still exist:

GenAIExamples$ git grep text-generation-inference: | wc -l
87
GenAIExamples$ git grep text-generation-inference: | grep -v 2.4.1 | wc -l
55
GenAIExamples$ git grep text-generation-inference: | grep -v 2.4 | wc -l
20

Same thing with Gaudi version:

GenAIExamples$ git grep /tgi-gaudi: | wc -l
51
GenAIExamples$ git grep /tgi-gaudi: | grep -v 2.3.1 | wc -l
8

Also in GenAIComps:

GenAIComps$ git grep /tgi-gaudi: | wc -l
10
GenAIComps$ git grep /tgi-gaudi: | grep -v 2.3.1 | wc -l
4

@chensuyue please re-open.

chensuyue · 2025-04-02T03:42:38Z

@xiaotia3 will you continue submit PR for this issue?

xiaotia3 · 2025-04-02T05:25:47Z

@xiaotia3 will you continue submit PR for this issue?

I will. The old version images that still exist were likely introduced during the period when the PR was trying to be merged. Let me update them.

And due to known issues, ChatQnA and AvatarChatbot may not be updated , is it ok?

eero-t · 2025-04-02T11:03:31Z

And due to known issues, ChatQnA and AvatarChatbot may not be updated , is it ok?

Those can be updated in a separate PR after their issues have been fixed.

xiguiw · 2025-04-16T08:00:22Z

OPEA_Team4 is working on this issue

@zhanmyz @xiaotia3
@1625 is merged.

Thanks for your contributions.
Found two TGI images.

Would you please help on this? Thanks!

GenAIExample
DocSum/tests/test_compose_tgi_on_xeon.sh: docker pull ghcr.io/huggingface/text-generation-inference:1.4

GenAIComp
comps/text2sql/src/README.md: docker run -d --name="text2sql-tgi-endpoint" --ipc=host -p $T G I_{P} O R T : 80 - v . / d a t a : / d a t a - - s h m - s i z e 1 g - e H F_{T} O K E N =$ {HUGGINGFACEHUB_API_TOKEN} -e model=${LLM_MODEL_ID} ghcr.io/huggingface/text-generation-inference:2.1.0 --model-id $LLM_MODEL_ID

yinghu5 added OPEAHack good first issue help wanted labels Feb 26, 2025

yinghu5 mentioned this issue Feb 28, 2025

Update DBQnA tgi docker image to latest tgi 2.4.0 #1593

Merged

yinghu5 added this to OPEA Feb 28, 2025

yinghu5 added this to the v1.3 milestone Feb 28, 2025

joshuayao added the Backlog label Mar 3, 2025

xiaotia3 mentioned this issue Mar 6, 2025

Update TGI image versions #1625

Merged

4 tasks

joshuayao moved this to In progress in OPEA Mar 24, 2025

joshuayao moved this from In progress to In review in OPEA Mar 24, 2025

xiaotia3 mentioned this issue Mar 25, 2025

Update TGI image versions partial #1718

Closed

4 tasks

yinghu5 added the A2 label Mar 26, 2025

chensuyue closed this as completed in #1625 Apr 1, 2025

github-project-automation bot moved this from In review to Done in OPEA Apr 1, 2025

chensuyue reopened this Apr 2, 2025

xiaotia3 linked a pull request Apr 3, 2025 that will close this issue

Update TGI image versions #1749

Open

4 tasks

joshuayao linked a pull request Apr 15, 2025 that will close this issue

Update TGI image versions #1749

Open

4 tasks

joshuayao moved this from Done to In review in OPEA Apr 15, 2025

joshuayao added the bug label Apr 16, 2025

joshuayao modified the milestones: v1.3, v1.4 Apr 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG] TGI versions inconsistency / use of old TGI versions #1563

[BUG] TGI versions inconsistency / use of old TGI versions #1563

eero-t commented Feb 18, 2025

xiguiw commented Feb 25, 2025

Uh oh!

eero-t commented Feb 25, 2025

Uh oh!

zhanmyz commented Mar 6, 2025

Uh oh!

yinghu5 commented Mar 26, 2025

Uh oh!

eero-t commented Mar 31, 2025

Uh oh!

eero-t commented Apr 1, 2025

Uh oh!

chensuyue commented Apr 2, 2025

Uh oh!

xiaotia3 commented Apr 2, 2025

Uh oh!

eero-t commented Apr 2, 2025

Uh oh!

xiguiw commented Apr 16, 2025

Uh oh!

[BUG] TGI versions inconsistency / use of old TGI versions #1563

[BUG] TGI versions inconsistency / use of old TGI versions #1563

Comments

eero-t commented Feb 18, 2025

xiguiw commented Feb 25, 2025

Uh oh!

eero-t commented Feb 25, 2025

Uh oh!

zhanmyz commented Mar 6, 2025

Uh oh!

yinghu5 commented Mar 26, 2025

Uh oh!

eero-t commented Mar 31, 2025

Uh oh!

eero-t commented Apr 1, 2025

Uh oh!

chensuyue commented Apr 2, 2025

Uh oh!

xiaotia3 commented Apr 2, 2025

Uh oh!

eero-t commented Apr 2, 2025

Uh oh!

xiguiw commented Apr 16, 2025

Uh oh!