Skip to content

Pull requests: NVIDIA/TensorRT-Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[2/n] Add Core Sparse Attention Infrastructure
#527 opened Nov 7, 2025 by kaix-nv Loading…
parallel eagle draft
#523 opened Nov 6, 2025 by yeyu-nvidia Draft
[Bug #193] fix fp8 blockwise real quantization
#522 opened Nov 6, 2025 by meenchen Loading…
Fix BMM style MoE export in fp8_pc_pt recipe
#515 opened Nov 5, 2025 by Edwardf0t1 Loading…
Alit/moe dev2
#508 opened Nov 4, 2025 by JRD971000 Draft
Add decilm modelling code
#505 opened Nov 4, 2025 by danielkorzekwa Loading…
PyTorch geometric quantization support
#494 opened Nov 3, 2025 by i-riyad Loading…
Compress tutorial (PoC)
#492 opened Nov 3, 2025 by danielkorzekwa Loading…
Update benchmarking for diffusers
#487 opened Oct 31, 2025 by ajrasane Loading…
Yeyu/set block
#480 opened Oct 28, 2025 by yeyu-nvidia Draft
feat: add onnxslim support
#478 opened Oct 28, 2025 by inisis Loading…
Feat: Eagle3 HF Online - support nemotron models
#463 opened Oct 25, 2025 by h-guo18 Loading…
ProTip! What’s not been updated in a month: updated:<2025-10-09.