Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[fix] improve fp4_block_scale_moe_runner type check Community Engagement help/insights needed from community Community want to contribute PRs initiated from Community
#5681 opened Jul 2, 2025 by Alcanderian Loading…
refactor: decoding inputs
#5679 opened Jul 2, 2025 by Funatiq Draft
[Infra] - Waive failed cases on release/0.21
#5674 opened Jul 2, 2025 by EmmaQiaoCh Loading…
Fix rerun step
#5672 opened Jul 2, 2025 by yiqingy0 Draft
[Infra] - Waive failed tests for main 0702
#5671 opened Jul 2, 2025 by EmmaQiaoCh Loading…
chore: Remove unused isFullContextRequest method
#5666 opened Jul 2, 2025 by Funatiq Loading…
add supported models doc
#5662 opened Jul 2, 2025 by QiJune Loading…
fix: Set init value for moe expert id
#5660 opened Jul 2, 2025 by WeiHaocheng Loading…
[feat] Add TensorRT-Engine Qwen3 model support Community Engagement help/insights needed from community Community want to contribute PRs initiated from Community
#5650 opened Jul 1, 2025 by gkswns0531 Loading…
chore: bump version to 1.0.0rc2
#5645 opened Jul 1, 2025 by yiqingy0 Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.