Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Bugfix] fix moe_wna16 get_quant_method ready ONLY add when PR is ready to merge/full CI is needed
#12648 opened Feb 1, 2025 by jinzhen-lin Loading…
[V1][Metrics] Add several request timing histograms ready ONLY add when PR is ready to merge/full CI is needed v1
#12644 opened Feb 1, 2025 by markmc Draft
[Core] choice-based structured output with xgrammar ci/build ready ONLY add when PR is ready to merge/full CI is needed structured-output
#12632 opened Jan 31, 2025 by russellb Loading…
[Misc] Add SPDX-License-Identifier headers to python source files ci/build documentation Improvements or additions to documentation frontend
#12628 opened Jan 31, 2025 by russellb Loading…
[Core] Add Additional Metrics to vLLM Server
#12627 opened Jan 31, 2025 by sahelib25 Loading…
[CI] Fix flaky CI test ci/build
#12626 opened Jan 31, 2025 by NickLucche Loading…
[ROCm] Using a more precise memory profiling
#12624 opened Jan 31, 2025 by gshtras Loading…
[Core] Improve hash collision avoidance in prefix caching needs-rebase ready ONLY add when PR is ready to merge/full CI is needed v1
#12621 opened Jan 31, 2025 by russellb Loading…
[Core] Silence unnecessary deprecation warnings ready ONLY add when PR is ready to merge/full CI is needed
#12620 opened Jan 31, 2025 by russellb Loading…
Fix quark fp8 format loading
#12612 opened Jan 31, 2025 by fxmarty-amd Loading…
From Lora Tensors
#12609 opened Jan 31, 2025 by borisshapa Loading…
[Core][v1] Unify allocating slots in prefill and decode in KV cache manager ready ONLY add when PR is ready to merge/full CI is needed v1
#12608 opened Jan 31, 2025 by ShawnD200 Loading…
[Draft] Qwen2.5-VL frontend v1
#12604 opened Jan 31, 2025 by ywang96 Draft
2 of 4 tasks
Implement MLA for deepseek v3/r1
#12597 opened Jan 31, 2025 by yessenzhar Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.