Skip to content

Actions: vllm-project/vllm

pre-commit

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,463 workflow runs
1,463 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Attention] Deepseek v3 MLA support with FP8 compute
pre-commit #1360: Pull request #12601 synchronize by LucasWilkinson
January 31, 2025 20:45 4m 4s LucasWilkinson:mla-fp8
January 31, 2025 20:45 4m 4s
[Docs][V1] Prefix caching design (#12598)
pre-commit #1359: Commit 60bcef0 pushed by simon-mo
January 31, 2025 20:30 5m 39s main
January 31, 2025 20:30 5m 39s
[Git] Automatically sign-off commits (#12595)
pre-commit #1358: Commit 847f883 pushed by simon-mo
January 31, 2025 20:30 5m 40s main
January 31, 2025 20:30 5m 40s
[Feature] Fix guided decoding blocking bitmask memcpy
pre-commit #1357: Pull request #12563 synchronize by xpbowler
January 31, 2025 20:29 4m 35s xpbowler:main
January 31, 2025 20:29 4m 35s
[Feature] Fix guided decoding blocking bitmask memcpy
pre-commit #1356: Pull request #12563 synchronize by xpbowler
January 31, 2025 20:25 4m 43s xpbowler:main
January 31, 2025 20:25 4m 43s
[Bugfix] Revert MoE Triton Config Default
pre-commit #1355: Pull request #12629 opened by robertgshaw2-redhat
January 31, 2025 20:20 4m 37s fix-triton-configs
January 31, 2025 20:20 4m 37s
[BugFix] Fix Torch.Compile For DeepSeek (#12594)
pre-commit #1353: Commit 325f679 pushed by simon-mo
January 31, 2025 20:06 4m 47s main
January 31, 2025 20:06 4m 47s
[Docs][V1] Prefix caching design
pre-commit #1352: Pull request #12598 synchronize by comaniac
January 31, 2025 20:01 4m 33s comaniac:v1-apc
January 31, 2025 20:01 4m 33s
[Docs][V1] Prefix caching design
pre-commit #1351: Pull request #12598 synchronize by comaniac
January 31, 2025 19:57 4m 39s comaniac:v1-apc
January 31, 2025 19:57 4m 39s
[Docs][V1] Prefix caching design
pre-commit #1350: Pull request #12598 synchronize by comaniac
January 31, 2025 19:56 4m 38s comaniac:v1-apc
January 31, 2025 19:56 4m 38s
[V1][Metrics] Add GPU prefix cache hit rate % gauge
pre-commit #1347: Pull request #12592 synchronize by comaniac
January 31, 2025 19:32 4m 42s comaniac:v1-cache-metric-2
January 31, 2025 19:32 4m 42s
Fix quantization for chatglm
pre-commit #1346: Pull request #12586 synchronize by kylesayrs
January 31, 2025 19:26 4m 50s neuralmagic:chatglm-quant-fix
January 31, 2025 19:26 4m 50s
[Core] Add Additional Metrics to vLLM Server
pre-commit #1345: Pull request #12627 opened by sahelib25
January 31, 2025 19:26 4m 28s krai:add_metrics
January 31, 2025 19:26 4m 28s
[CI] Fix flaky CI test
pre-commit #1344: Pull request #12626 opened by NickLucche
January 31, 2025 19:23 4m 37s NickLucche:flaky-test
January 31, 2025 19:23 4m 37s
[V1][Metrics] Add GPU prefix cache hit rate % gauge
pre-commit #1343: Pull request #12592 synchronize by comaniac
January 31, 2025 19:16 4m 25s comaniac:v1-cache-metric-2
January 31, 2025 19:16 4m 25s
[Attention] Deepseek v3 MLA support with FP8 compute
pre-commit #1342: Pull request #12601 synchronize by LucasWilkinson
January 31, 2025 19:14 4m 13s LucasWilkinson:mla-fp8
January 31, 2025 19:14 4m 13s
[RFC][vllm-API] Support tokenizer registry for customized tokenizer in vLLM
pre-commit #1340: Pull request #12518 synchronize by youngkent
January 31, 2025 18:58 2m 22s youngkent:main
January 31, 2025 18:58 2m 22s