Skip to content

Actions: vllm-project/vllm

pre-commit

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,316 workflow runs
2,316 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[CI/Build][v1] vLLM v1 automatic benchmarking
pre-commit #2266: Pull request #12919 synchronize by Shaoting-Feng
February 7, 2025 21:59 4m 44s Shaoting-Feng:v1-benchmark
February 7, 2025 21:59 4m 44s
Add flag for enabling finer-grained cuda graph capture
pre-commit #2265: Pull request #12920 synchronize by benchislett
February 7, 2025 21:51 4m 34s CentML:extra-small-graphs
February 7, 2025 21:51 4m 34s
[CI/Build][v1] vLLM v1 automatic benchmarking
pre-commit #2263: Pull request #12919 opened by Shaoting-Feng
February 7, 2025 21:45 4m 35s Shaoting-Feng:v1-benchmark
February 7, 2025 21:45 4m 35s
[V1] Use msgpack for core request serialization
pre-commit #2260: Pull request #12918 synchronize by njhill
February 7, 2025 20:55 4m 43s njhill:v1-msgpack-reqs
February 7, 2025 20:55 4m 43s
[V1] Use msgpack for core request serialization
pre-commit #2259: Pull request #12918 opened by njhill
February 7, 2025 20:46 4m 46s njhill:v1-msgpack-reqs
February 7, 2025 20:46 4m 46s
[Bugfix] Fix multi-round chat error when mistral tokenizer is used
pre-commit #2258: Pull request #12859 synchronize by zifeitong
February 7, 2025 20:14 4m 37s zifeitong:fix_mistral
February 7, 2025 20:14 4m 37s
[Hardware][TPU] Multi-LoRA implementation for the TPU backend
pre-commit #2256: Pull request #12623 synchronize by Akshat-Tripathi
February 7, 2025 19:11 4m 31s krai:multi_lora_tpu
February 7, 2025 19:11 4m 31s
[Model] Ultravox Model: Support v0.5 Release
pre-commit #2255: Pull request #12912 synchronize by farzadab
February 7, 2025 19:09 4m 40s fixie-ai:farzad-ultravox-v05
February 7, 2025 19:09 4m 40s
[ROCm][V1] Add intial ROCm support to V1
pre-commit #2254: Pull request #12790 synchronize by SageMoore
February 7, 2025 19:09 4m 50s neuralmagic:sage/amd-v1
February 7, 2025 19:09 4m 50s
[V1][Metrics] Add GPU prefix cache hit rate % gauge
pre-commit #2253: Pull request #12592 synchronize by comaniac
February 7, 2025 19:08 4m 38s comaniac:v1-cache-metric-2
February 7, 2025 19:08 4m 38s
[V1][Metrics] Add GPU prefix cache hit rate % gauge
pre-commit #2252: Pull request #12592 synchronize by comaniac
February 7, 2025 19:07 4m 54s comaniac:v1-cache-metric-2
February 7, 2025 19:07 4m 54s
[Hardware][TPU] Multi-LoRA implementation for the TPU backend
pre-commit #2251: Pull request #12623 synchronize by Akshat-Tripathi
February 7, 2025 19:05 3m 59s krai:multi_lora_tpu
February 7, 2025 19:05 3m 59s
[FEATURE] Enables /score endpoint for embedding models
pre-commit #2250: Pull request #12846 synchronize by gmarinho2
February 7, 2025 18:59 2m 36s gmarinho2:scoring-openai
February 7, 2025 18:59 2m 36s
[Hardware][TPU] Multi-LoRA implementation for the TPU backend
pre-commit #2249: Pull request #12623 synchronize by Akshat-Tripathi
February 7, 2025 18:56 3m 56s krai:multi_lora_tpu
February 7, 2025 18:56 3m 56s
[FEATURE] Enables /score endpoint for embedding models
pre-commit #2248: Pull request #12846 synchronize by gmarinho2
February 7, 2025 18:53 2m 33s gmarinho2:scoring-openai
February 7, 2025 18:53 2m 33s
[Hardware][TPU] Multi-LoRA implementation for the TPU backend
pre-commit #2247: Pull request #12623 synchronize by Akshat-Tripathi
February 7, 2025 18:48 4m 1s krai:multi_lora_tpu
February 7, 2025 18:48 4m 1s
[FEATURE] Enables /score endpoint for embedding models
pre-commit #2246: Pull request #12846 synchronize by gmarinho2
February 7, 2025 18:47 2m 28s gmarinho2:scoring-openai
February 7, 2025 18:47 2m 28s
[FEATURE] Enables /score endpoint for embedding models
pre-commit #2245: Pull request #12846 synchronize by gmarinho2
February 7, 2025 18:43 2m 34s gmarinho2:scoring-openai
February 7, 2025 18:43 2m 34s
Add pipeline parallel support to TransformersModel
pre-commit #2244: Pull request #12832 synchronize by hmellor
February 7, 2025 18:35 4m 56s hmellor:pipeline-parallel
February 7, 2025 18:35 4m 56s
[FEATURE] Enables /score endpoint for embedding models
pre-commit #2243: Pull request #12846 synchronize by gmarinho2
February 7, 2025 18:33 2m 34s gmarinho2:scoring-openai
February 7, 2025 18:33 2m 34s
Add pipeline parallel support to TransformersModel
pre-commit #2242: Pull request #12832 synchronize by hmellor
February 7, 2025 18:26 4m 42s hmellor:pipeline-parallel
February 7, 2025 18:26 4m 42s