-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[fix] improve fp4_block_scale_moe_runner type check
Community Engagement
help/insights needed from community
Community want to contribute
PRs initiated from Community
#5681
opened Jul 2, 2025 by
Alcanderian
Loading…
chore: update doc by replacing use_cuda_graph with cuda_graph_config
#5680
opened Jul 2, 2025 by
nv-guomingz
Loading…
[TRTLLM-6100] fix: Nvbug 5356427: autotuned TRTLLM Gen fp8 block scale MoE illegal memory access
bug
Something isn't working
#5676
opened Jul 2, 2025 by
DomBrown
Loading…
MTP and derivatives: Align sample state with trtllm sampler sample state
#5675
opened Jul 2, 2025 by
netanel-haber
Loading…
[TRTLLM-5966][feat] Initial steps towards Helix parallelism support
#5668
opened Jul 2, 2025 by
MatthiasKohl
Loading…
fix [nvbug5351244]: address remote mpi session submit
#5664
opened Jul 2, 2025 by
Superjomn
Loading…
Avoiding the kernel launch if no finished context requests
#5659
opened Jul 1, 2025 by
rakib-hasan
Loading…
test: Validate and add accuracy& perf tests for Ministral-8B-Instruct[-FP8](pytorch only)
#5654
opened Jul 1, 2025 by
venkywonka
•
Draft
5 of 7 tasks
[NVBUG:5355009] Modify check for fuse_fp4_quant on SM120
#5651
opened Jul 1, 2025 by
farazkh80
Loading…
[feat] Add TensorRT-Engine Qwen3 model support
Community Engagement
help/insights needed from community
Community want to contribute
PRs initiated from Community
#5650
opened Jul 1, 2025 by
gkswns0531
Loading…
[TRTLLM-4923][feat] Enable CUDA graphs for Nemotron-H
#5646
opened Jul 1, 2025 by
tomeras91
Loading…
[Draft] feat: Add phi-4-multimodal pytorch-backend support
#5644
opened Jul 1, 2025 by
Wanli-Jiang
•
Draft
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.