Skip to content

Actions: huggingface/text-generation-inference

Server Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,427 workflow runs
2,427 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Choosing input/total tokens automatically based on available VRAM?
Server Tests #3261: Pull request #2673 synchronize by Narsil
October 23, 2024 07:03 9m 39s auto_length
October 23, 2024 07:03 9m 39s
Add support for stop words in TRTLLM
Server Tests #3259: Pull request #2678 synchronize by mfuntowicz
October 22, 2024 21:06 8m 42s trtllm-stop-words
October 22, 2024 21:06 8m 42s
Add support for stop words in TRTLLM
Server Tests #3256: Pull request #2678 opened by mfuntowicz
October 22, 2024 08:15 6m 37s trtllm-stop-words
October 22, 2024 08:15 6m 37s
[TENSORRT-LLM] - Implement new looper thread based backend
Server Tests #3255: Pull request #2357 synchronize by mfuntowicz
October 22, 2024 07:51 8m 25s trtllm-executor-thread
October 22, 2024 07:51 8m 25s
[TENSORRT-LLM] - Implement new looper thread based backend
Server Tests #3254: Pull request #2357 synchronize by mfuntowicz
October 21, 2024 19:44 8m 33s trtllm-executor-thread
October 21, 2024 19:44 8m 33s
Add support for FP8 KV cache scales
Server Tests #3253: Pull request #2628 synchronize by danieldk
October 21, 2024 17:25 7m 31s feature/fp8-kv-cache-scale
October 21, 2024 17:25 7m 31s
Add support for FP8 KV cache scales
Server Tests #3252: Pull request #2628 synchronize by danieldk
October 21, 2024 17:21 4m 16s feature/fp8-kv-cache-scale
October 21, 2024 17:21 4m 16s
[TENSORRT-LLM] - Implement new looper thread based backend
Server Tests #3251: Pull request #2357 synchronize by mfuntowicz
October 21, 2024 15:03 8m 23s trtllm-executor-thread
October 21, 2024 15:03 8m 23s
Choosing input/total tokens automatically based on available VRAM?
Server Tests #3250: Pull request #2673 synchronize by Narsil
October 21, 2024 13:24 7m 22s auto_length
October 21, 2024 13:24 7m 22s
Choosing input/total tokens automatically based on available VRAM?
Server Tests #3249: Pull request #2673 synchronize by Narsil
October 21, 2024 13:07 5m 51s auto_length
October 21, 2024 13:07 5m 51s
[TENSORRT-LLM] - Implement new looper thread based backend
Server Tests #3248: Pull request #2357 synchronize by mfuntowicz
October 21, 2024 13:07 6m 32s trtllm-executor-thread
October 21, 2024 13:07 6m 32s
Choosing input/total tokens automatically based on available VRAM?
Server Tests #3247: Pull request #2673 synchronize by Narsil
October 21, 2024 12:57 5m 58s auto_length
October 21, 2024 12:57 5m 58s
[TENSORRT-LLM] - Implement new looper thread based backend
Server Tests #3246: Pull request #2357 synchronize by mfuntowicz
October 21, 2024 12:57 6m 5s trtllm-executor-thread
October 21, 2024 12:57 6m 5s
[TENSORRT-LLM] - Implement new looper thread based backend
Server Tests #3245: Pull request #2357 synchronize by mfuntowicz
October 21, 2024 12:52 3m 53s trtllm-executor-thread
October 21, 2024 12:52 3m 53s
Fixing performance degradation on Intel.
Server Tests #3244: Pull request #2674 synchronize by Narsil
October 21, 2024 12:49 7m 12s close_dl_thread
October 21, 2024 12:49 7m 12s
Fixing performance degradation on Intel.
Server Tests #3243: Pull request #2674 opened by Narsil
October 21, 2024 12:46 3m 30s close_dl_thread
October 21, 2024 12:46 3m 30s
Choosing input/total tokens automatically based on available VRAM?
Server Tests #3242: Pull request #2673 opened by Narsil
October 21, 2024 11:03 6m 15s auto_length
October 21, 2024 11:03 6m 15s
Add support for FP8 KV cache scales
Server Tests #3241: Pull request #2628 synchronize by danieldk
October 21, 2024 10:42 6m 25s feature/fp8-kv-cache-scale
October 21, 2024 10:42 6m 25s
[TENSORRT-LLM] - Implement new looper thread based backend
Server Tests #3240: Pull request #2357 synchronize by mfuntowicz
October 21, 2024 10:31 5m 40s trtllm-executor-thread
October 21, 2024 10:31 5m 40s
PR 2634 CI - Fix the tool_choice format for named choice by adapting OpenAIs scheme
Server Tests #3239: Pull request #2645 synchronize by drbh
October 20, 2024 21:57 8m 49s pr-2634-ci-branch
October 20, 2024 21:57 8m 49s
PR 2634 CI - Fix the tool_choice format for named choice by adapting OpenAIs scheme
Server Tests #3237: Pull request #2645 synchronize by drbh
October 18, 2024 16:12 6m 48s pr-2634-ci-branch
October 18, 2024 16:12 6m 48s
CI job. Gpt awq 4
Server Tests #3236: Pull request #2665 synchronize by Narsil
October 18, 2024 15:55 6m 45s gpt_awq_4
October 18, 2024 15:55 6m 45s