Skip to content

Actions: huggingface/optimum-benchmark

CLI CUDA vLLM Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
519 workflow runs
519 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Per token latency outliers (#225)
CLI CUDA vLLM Tests #69: Commit 7999050 pushed by IlyasMoutawwakil
July 3, 2024 15:28 4m 47s main
July 3, 2024 15:28 4m 47s
Per token latency outliers
CLI CUDA vLLM Tests #68: Pull request #225 opened by IlyasMoutawwakil
July 3, 2024 15:24 7m 7s better-per-token-latency
July 3, 2024 15:24 7m 7s
Patch release (#224)
CLI CUDA vLLM Tests #67: Commit 8ebe853 pushed by IlyasMoutawwakil
July 3, 2024 14:45 7m 23s main
July 3, 2024 14:45 7m 23s
Patch release
CLI CUDA vLLM Tests #66: Pull request #224 opened by IlyasMoutawwakil
July 3, 2024 14:45 4m 34s IlyasMoutawwakil-patch-2
July 3, 2024 14:45 4m 34s
Fix per token latency (#223)
CLI CUDA vLLM Tests #65: Commit 2a75c0b pushed by IlyasMoutawwakil
July 3, 2024 14:05 7m 9s main
July 3, 2024 14:05 7m 9s
Fix per token latency
CLI CUDA vLLM Tests #64: Pull request #223 opened by IlyasMoutawwakil
July 3, 2024 13:33 7m 8s fix-per-token-latency
July 3, 2024 13:33 7m 8s
bump version 0.3.0 (#221)
CLI CUDA vLLM Tests #63: Commit 19eeac5 pushed by IlyasMoutawwakil
July 2, 2024 10:15 10m 46s main
July 2, 2024 10:15 10m 46s
bump version 0.3.0
CLI CUDA vLLM Tests #62: Pull request #221 opened by IlyasMoutawwakil
July 2, 2024 10:15 7m 3s IlyasMoutawwakil-patch-1
July 2, 2024 10:15 7m 3s
Fix INC (#220)
CLI CUDA vLLM Tests #61: Commit 3731aa1 pushed by IlyasMoutawwakil
July 2, 2024 09:57 4m 27s main
July 2, 2024 09:57 4m 27s
Fix INC
CLI CUDA vLLM Tests #60: Pull request #220 synchronize by IlyasMoutawwakil
July 2, 2024 09:42 6m 48s fix-inc
July 2, 2024 09:42 6m 48s
Fix INC
CLI CUDA vLLM Tests #59: Pull request #220 synchronize by IlyasMoutawwakil
July 2, 2024 09:18 7m 13s fix-inc
July 2, 2024 09:18 7m 13s
Fix INC
CLI CUDA vLLM Tests #58: Pull request #220 opened by IlyasMoutawwakil
July 1, 2024 18:07 7m 5s fix-inc
July 1, 2024 18:07 7m 5s
Pin eager attn in torch-ort backend (#219)
CLI CUDA vLLM Tests #57: Commit dd02f26 pushed by IlyasMoutawwakil
July 1, 2024 16:40 4m 25s main
July 1, 2024 16:40 4m 25s
Pin eager attn in torch-ort backend
CLI CUDA vLLM Tests #56: Pull request #219 opened by IlyasMoutawwakil
July 1, 2024 16:22 7m 15s fix-torch-ort-attn
July 1, 2024 16:22 7m 15s
Fix PyTorchBackend TP vs DP inputs distribution across replicas and…
CLI CUDA vLLM Tests #55: Commit 156844a pushed by IlyasMoutawwakil
July 1, 2024 16:21 4m 27s main
July 1, 2024 16:21 4m 27s
Fix sentence transformers models (#212)
CLI CUDA vLLM Tests #47: Commit 347e13c pushed by IlyasMoutawwakil
May 23, 2024 12:31 6m 30s main
May 23, 2024 12:31 6m 30s
Fix sentence transformers models
CLI CUDA vLLM Tests #46: Pull request #212 opened by IlyasMoutawwakil
May 21, 2024 16:01 7m 19s fix-sentence-transformers-models
May 21, 2024 16:01 7m 19s
Numactl support (#211)
CLI CUDA vLLM Tests #45: Commit dc29eec pushed by IlyasMoutawwakil
May 20, 2024 18:18 6m 34s main
May 20, 2024 18:18 6m 34s
ProTip! You can narrow down the results and go further in time using created:<2024-05-20 or the other filters available.