Skip to content

Actions: huggingface/text-generation-inference

Server Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,427 workflow runs
2,427 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fixing auto bloom test.
Server Tests #3312: Pull request #2699 opened by Narsil
October 28, 2024 05:14 8m 51s fix_bloom_tests
October 28, 2024 05:14 8m 51s
Update poetry lock.
Server Tests #3311: Pull request #2698 opened by Narsil
October 28, 2024 04:12 7m 6s upgrade_poetry_lock
October 28, 2024 04:12 7m 6s
Upgrade outlines to 0.1.1
Server Tests #3310: Pull request #2690 synchronize by Narsil
October 28, 2024 04:11 3m 36s upgrade-outlines
October 28, 2024 04:11 3m 36s
Upgrade outlines to 0.1.1
Server Tests #3309: Pull request #2658 synchronize by Narsil
October 28, 2024 04:10 13s aW3st:upgrade-outlines
October 28, 2024 04:10 13s
Check if allowed tokens is None
Server Tests #3308: Pull request #2694 synchronize by Narsil
October 28, 2024 04:10 1s aW3st:upgrade-outlines
October 28, 2024 04:10 1s
Upgrade outlines to 0.1.1
Server Tests #3307: Pull request #2690 synchronize by Narsil
October 28, 2024 04:05 5m 15s upgrade-outlines
October 28, 2024 04:05 5m 15s
Support qwen2 vl
Server Tests #3306: Pull request #2689 synchronize by drbh
October 28, 2024 03:07 8m 9s support-qwen2-vl
October 28, 2024 03:07 8m 9s
Support qwen2 vl
Server Tests #3305: Pull request #2689 synchronize by drbh
October 28, 2024 02:20 6m 31s support-qwen2-vl
October 28, 2024 02:20 6m 31s
Support qwen2 vl
Server Tests #3304: Pull request #2689 synchronize by drbh
October 28, 2024 02:15 4m 26s support-qwen2-vl
October 28, 2024 02:15 4m 26s
chore: prepare 2.4.0 release
Server Tests #3303: Pull request #2695 opened by OlivierDehaene
October 25, 2024 20:41 8m 58s chore/prepare_2.4
October 25, 2024 20:41 8m 58s
feat: add triton kernels to decrease latency of large batches
Server Tests #3302: Pull request #2687 synchronize by OlivierDehaene
October 25, 2024 20:14 8m 59s feat/triton_prepare
October 25, 2024 20:14 8m 59s
feat: add triton kernels to decrease latency of large batches
Server Tests #3301: Pull request #2687 synchronize by OlivierDehaene
October 25, 2024 19:13 9m 29s feat/triton_prepare
October 25, 2024 19:13 9m 29s
Switch from fbgemm-gpu w8a8 scaled matmul to vLLM/marlin-kernels
Server Tests #3296: Pull request #2688 synchronize by danieldk
October 25, 2024 10:09 9m 3s feature/cc89-cutlass-w8a8
October 25, 2024 10:09 9m 3s
feat: add triton kernels to decrease latency of large batches
Server Tests #3295: Pull request #2687 synchronize by OlivierDehaene
October 25, 2024 09:34 7m 48s feat/triton_prepare
October 25, 2024 09:34 7m 48s
Upgrade outlines to 0.1.1
Server Tests #3294: Pull request #2690 synchronize by Narsil
October 25, 2024 08:49 6m 22s upgrade-outlines
October 25, 2024 08:49 6m 22s
feat: add triton kernels to decrease latency of large batches
Server Tests #3293: Pull request #2687 synchronize by OlivierDehaene
October 25, 2024 08:43 6m 33s feat/triton_prepare
October 25, 2024 08:43 6m 33s
feat: add triton kernels to decrease latency of large batches
Server Tests #3292: Pull request #2687 synchronize by OlivierDehaene
October 25, 2024 08:37 6m 21s feat/triton_prepare
October 25, 2024 08:37 6m 21s
Choosing input/total tokens automatically based on available VRAM?
Server Tests #3291: Pull request #2673 synchronize by Narsil
October 25, 2024 08:20 7m 18s auto_length
October 25, 2024 08:20 7m 18s
We can have a tokenizer anywhere.
Server Tests #3290: Pull request #2527 synchronize by Narsil
October 25, 2024 07:59 7m 28s omni_tokenizer
October 25, 2024 07:59 7m 28s
Fixing rocm gptq by using triton code too (renamed cuda into triton).
Server Tests #3289: Pull request #2691 opened by Narsil
October 25, 2024 05:27 6m 16s fix_rocm_ci
October 25, 2024 05:27 6m 16s
We can have a tokenizer anywhere.
Server Tests #3288: Pull request #2527 synchronize by Narsil
October 25, 2024 05:23 3m 4s omni_tokenizer
October 25, 2024 05:23 3m 4s
[TENSORRT-LLM] - Implement new looper thread based backend
Server Tests #3287: Pull request #2357 synchronize by Narsil
October 25, 2024 05:16 6m 51s trtllm-executor-thread
October 25, 2024 05:16 6m 51s