Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,424 workflow runs
1,424 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Kernel] port sgl moe_align_block_size kernels
Add label on auto-merge enabled #1424: Pull request #12574 auto_merge_enabled by robertgshaw2-redhat
February 1, 2025 20:47 12s
February 1, 2025 20:47 12s
doc: fixing minor typo in readme.md
Add label on auto-merge enabled #1423: Pull request #12643 auto_merge_enabled by DarkLight1337
February 1, 2025 11:23 10s
February 1, 2025 11:23 10s
[Model]: Add transformers backend support
Add label on auto-merge enabled #1422: Pull request #11330 auto_merge_enabled by DarkLight1337
February 1, 2025 06:22 11s
February 1, 2025 06:22 11s
Disable chunked prefill and/or prefix caching when MLA is enabled
Add label on auto-merge enabled #1421: Pull request #12642 auto_merge_enabled by mgoin
February 1, 2025 06:02 11s
February 1, 2025 06:02 11s
Disable chunked prefill and/or prefix caching when MLA is enabled
Add label on auto-merge enabled #1420: Pull request #12638 auto_merge_enabled by simon-mo
February 1, 2025 04:11 13s
February 1, 2025 04:11 13s
Apply torch.compile to fused_moe/grouped_topk
Add label on auto-merge enabled #1419: Pull request #12637 auto_merge_enabled by simon-mo
February 1, 2025 03:20 12s
February 1, 2025 03:20 12s
Fix device return is bytecode instead of str
Add label on auto-merge enabled #1418: Pull request #12635 auto_merge_enabled by robertgshaw2-redhat
February 1, 2025 00:56 10s
February 1, 2025 00:56 10s
Fix target matching for fused layers with compressed-tensors
Add label on auto-merge enabled #1417: Pull request #12617 auto_merge_enabled by mgoin
February 1, 2025 00:38 14s
February 1, 2025 00:38 14s
[Docs][V1] Prefix caching design
Add label on auto-merge enabled #1416: Pull request #12598 auto_merge_enabled by comaniac
January 31, 2025 20:04 11s
January 31, 2025 20:04 11s
[Core] Improve hash collision avoidance in prefix caching
Add label on auto-merge enabled #1415: Pull request #12621 auto_merge_enabled by comaniac
January 31, 2025 17:25 12s
January 31, 2025 17:25 12s
[V1] Bugfix: Validate Model Input Length
Add label on auto-merge enabled #1414: Pull request #12600 auto_merge_enabled by WoosukKwon
January 31, 2025 09:22 11s
January 31, 2025 09:22 11s
[Git] Automatically sign-off commits
Add label on auto-merge enabled #1413: Pull request #12595 auto_merge_enabled by comaniac
January 31, 2025 08:50 13s
January 31, 2025 08:50 13s
[v1][Bugfix] Add extra_keys to block_hash for prefix caching
Add label on auto-merge enabled #1412: Pull request #12603 auto_merge_enabled by comaniac
January 31, 2025 06:06 13s
January 31, 2025 06:06 13s
[V1] Bugfix: Validate Model Input Length
Add label on auto-merge enabled #1411: Pull request #12600 auto_merge_enabled by comaniac
January 31, 2025 02:35 14s
January 31, 2025 02:35 14s
[ci] Upgrade transformers to 4.48.2 in CI dependencies
Add label on auto-merge enabled #1410: Pull request #12599 auto_merge_enabled by tlrmchlsmth
January 31, 2025 01:47 13s
January 31, 2025 01:47 13s
[BugFix] Fix Torch.Compile For DeepSeek
Add label on auto-merge enabled #1409: Pull request #12594 auto_merge_enabled by robertgshaw2-redhat
January 31, 2025 01:24 12s
January 31, 2025 01:24 12s
[Feature] Fix guided decoding blocking bitmask memcpy
Add label on auto-merge enabled #1408: Pull request #12563 auto_merge_enabled by mgoin
January 30, 2025 22:16 12s
January 30, 2025 22:16 12s
[Kernel] Update cutlass_scaled_mm to support 2d group (blockwise) scaling
Add label on auto-merge enabled #1407: Pull request #11868 auto_merge_enabled by tlrmchlsmth
January 30, 2025 19:14 14s
January 30, 2025 19:14 14s
[Bugfix] Gracefully handle huggingface hub http error
Add label on auto-merge enabled #1406: Pull request #12571 auto_merge_enabled by comaniac
January 30, 2025 16:25 12s
January 30, 2025 16:25 12s
[Kernel] Triton Configs for Fp8 Block Quantization
Add label on auto-merge enabled #1405: Pull request #11589 auto_merge_enabled by mgoin
January 30, 2025 03:28 14s
January 30, 2025 03:28 14s
[V1][Log] Add max request concurrency log to V1
Add label on auto-merge enabled #1404: Pull request #12569 auto_merge_enabled by robertgshaw2-redhat
January 30, 2025 03:23 11s
January 30, 2025 03:23 11s
[Misc] fix typo: add missing space in lora adapter error message
Add label on auto-merge enabled #1403: Pull request #12564 auto_merge_enabled by mgoin
January 30, 2025 00:30 12s
January 30, 2025 00:30 12s
[Misc][MoE] add Deepseek-V3 moe tuning support
Add label on auto-merge enabled #1402: Pull request #12558 auto_merge_enabled by mgoin
January 29, 2025 22:58 10s
January 29, 2025 22:58 10s
[V1][BugFix] Free encoder cache for aborted requests
Add label on auto-merge enabled #1401: Pull request #12545 auto_merge_enabled by WoosukKwon
January 29, 2025 20:19 15s
January 29, 2025 20:19 15s
Revert "[Build/CI] Fix libcuda.so linkage"
Add label on auto-merge enabled #1400: Pull request #12552 auto_merge_enabled by tlrmchlsmth
January 29, 2025 19:11 12s
January 29, 2025 19:11 12s