Skip to content

Actions: vllm-project/vllm

pre-commit

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,307 workflow runs
2,307 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[V1] Move KV block hashes from Request to KVCacheManager (#12922)
pre-commit #2307: Commit 3243158 pushed by WoosukKwon
February 8, 2025 03:14 4m 51s main
February 8, 2025 03:14 4m 51s
[V1][Minor] Remove outdated comment (#12928)
pre-commit #2306: Commit b21f0f9 pushed by WoosukKwon
February 8, 2025 03:07 4m 42s main
February 8, 2025 03:07 4m 42s
[Misc] Fix typo in the example file
pre-commit #2305: Pull request #12896 synchronize by DK-DARKmatter
February 8, 2025 03:06 4m 33s DK-DARKmatter:main
February 8, 2025 03:06 4m 33s
[Hardware][Gaudi][Bugfix] Fix error for guided decoding
pre-commit #2303: Pull request #12317 synchronize by zhouyu5
February 8, 2025 02:57 4m 41s zhouyu5:hpu-guided-decoding
February 8, 2025 02:57 4m 41s
[Misc][Kernel]: Add GPTQAllSpark Quantization
pre-commit #2302: Pull request #12931 opened by wyajieha
February 8, 2025 02:38 2m 31s wyajieha:github-yajie-as
February 8, 2025 02:38 2m 31s
[Misc] Log time consumption on weight downloading
pre-commit #2301: Pull request #12926 synchronize by waltforme
February 8, 2025 01:45 4m 39s waltforme:log-downloading
February 8, 2025 01:45 4m 39s
[V1][Minor] Remove outdated comment
pre-commit #2300: Pull request #12928 opened by WoosukKwon
February 8, 2025 01:41 4m 45s v1-minor-comment-fix
February 8, 2025 01:41 4m 45s
[V1][Metrics] Add GPU prefix cache hit rate % gauge
pre-commit #2299: Pull request #12592 synchronize by comaniac
February 8, 2025 01:40 4m 39s comaniac:v1-cache-metric-2
February 8, 2025 01:40 4m 39s
[Bugfix] Fix multi-round chat error when mistral tokenizer is used
pre-commit #2298: Pull request #12859 synchronize by zifeitong
February 8, 2025 01:19 4m 35s zifeitong:fix_mistral
February 8, 2025 01:19 4m 35s
Expert Parallelism (EP) Support for DeepSeek V2
pre-commit #2296: Pull request #12583 synchronize by cakeng
February 8, 2025 01:09 3m 34s cakeng:moe
February 8, 2025 01:09 3m 34s
[Bugfix] Fix multi-round chat error when mistral tokenizer is used
pre-commit #2295: Pull request #12859 synchronize by zifeitong
February 8, 2025 01:08 4m 44s zifeitong:fix_mistral
February 8, 2025 01:08 4m 44s
[CI] Resolve transformers-neuronx version conflict
pre-commit #2294: Pull request #12925 synchronize by liangfu
February 8, 2025 01:07 4m 40s liangfu:fix-neuron-ci-2
February 8, 2025 01:07 4m 40s
[CI] Resolve transformers-neuronx version conflict
pre-commit #2293: Pull request #12925 synchronize by liangfu
February 8, 2025 01:06 4m 50s liangfu:fix-neuron-ci-2
February 8, 2025 01:06 4m 50s
[Misc] Log time consumption on weight downloading
pre-commit #2292: Pull request #12926 opened by waltforme
February 8, 2025 01:05 4m 42s waltforme:log-downloading
February 8, 2025 01:05 4m 42s
[V1] Move KV block hashes from Request to KVCacheManager
pre-commit #2291: Pull request #12922 synchronize by WoosukKwon
February 8, 2025 01:03 4m 46s v1-mv-block-hashes
February 8, 2025 01:03 4m 46s
[CI] Resolve transformers-neuronx version conflict
pre-commit #2290: Pull request #12925 synchronize by liangfu
February 8, 2025 00:55 4m 52s liangfu:fix-neuron-ci-2
February 8, 2025 00:55 4m 52s
[CI] Resolve transformers-neuronx version conflict
pre-commit #2289: Pull request #12925 opened by liangfu
February 8, 2025 00:51 4m 33s liangfu:fix-neuron-ci-2
February 8, 2025 00:51 4m 33s
[Bugfix] Fix disagg hang caused by the prefill and decode communicati…
pre-commit #2288: Commit 45cbc49 pushed by simon-mo
February 8, 2025 00:39 4m 44s main
February 8, 2025 00:39 4m 44s
[V1] Use msgpack for core request serialization
pre-commit #2287: Pull request #12918 synchronize by njhill
February 8, 2025 00:16 4m 43s njhill:v1-msgpack-reqs
February 8, 2025 00:16 4m 43s
[V1] Move KV block hashes from Request to KVCacheManager
pre-commit #2286: Pull request #12922 synchronize by WoosukKwon
February 8, 2025 00:06 4m 41s v1-mv-block-hashes
February 8, 2025 00:06 4m 41s
[FEATURE] Enables /score endpoint for embedding models
pre-commit #2284: Pull request #12846 synchronize by gmarinho2
February 7, 2025 23:53 2m 32s gmarinho2:scoring-openai
February 7, 2025 23:53 2m 32s