-
Notifications
You must be signed in to change notification settings - Fork 192
Pull requests: NVIDIA/TensorRT-Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enable Yarn RoPE in minitron pruning for gpt-oss support
#530
opened Nov 8, 2025 by
kevalmorabia97
•
Draft
Make wheel build manual CI job and diffusers test fix
#529
opened Nov 8, 2025 by
kevalmorabia97
Loading…
[BUG FIX 5616904] Add transformers version restoration after PTQ for VILA
#525
opened Nov 7, 2025 by
yueshen2016
Loading…
Update custom file name patterns when copy files and remove problematic parameters in export
#520
opened Nov 6, 2025 by
Edwardf0t1
Loading…
Fix DQ1 output type error in DQ1->DQ2 for FP4 weights in NVFP4 model
#513
opened Nov 5, 2025 by
vishalpandya1990
Loading…
[5591945][5589019@13][ONNX] Fix 'nodes not sorted' failure
#507
opened Nov 4, 2025 by
gcunhase
Loading…
[OMNIML-2917] handle lm_head and other un-quantized modules correctly
#504
opened Nov 4, 2025 by
shengliangxu
Loading…
[Draft] [5526696] Add kv cache quantization support for onnx quantization
#486
opened Oct 31, 2025 by
zhanghaoc
Loading…
Add functional test cases for published checkpoints on HF
#455
opened Oct 21, 2025 by
noeyy-mino
Loading…
Preserve original rope scaling type in export due to transformers library AutoConfig issue
#452
opened Oct 17, 2025 by
Edwardf0t1
Loading…
[1/2] Registry interface for custom quantization functional backend
#449
opened Oct 17, 2025 by
realAsma
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-10-09.