Skip to content

Commit e120c50

Browse files
Release CI fix
Signed-off-by: Keval Morabia <[email protected]>
1 parent e74a468 commit e120c50

File tree

2 files changed

+5
-9
lines changed

2 files changed

+5
-9
lines changed

.gitlab/release.yml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,17 +5,20 @@ build-and-upload-wheels:
55
stage: release
66
timeout: 15m
77
tags: [type/docker, os/linux] # Use a runner with these tags
8+
needs: []
89
rules:
910
- if: $JET_ONLY != null
1011
when: never
1112
- if: $CI_COMMIT_TAG =~ /^\d+\.\d+\.\d+$/
13+
when: manual
1214
variables:
1315
RELEASE: "true"
1416
TWINE_USERNAME: svc-dl-algo-ammo
1517
TWINE_PASSWORD: $ARTIFACTORY_TOKEN # Configured in GitLab > Settings > CI/CD
1618
REPO_URL: https://urm.nvidia.com/artifactory/api/pypi/sw-dl-algo-ammo-pypi-local
1719
- if: $CI_PIPELINE_SOURCE == "schedule"
1820
variables:
21+
when: manual
1922
RELEASE: "false"
2023
TWINE_USERNAME: gitlab-ci-token
2124
TWINE_PASSWORD: $CI_JOB_TOKEN

CHANGELOG.rst

Lines changed: 2 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,5 @@
11
Model Optimizer Changelog (Linux)
22
=================================
3-
0.41 (2025-12-xx)
4-
^^^^^^^^^^^^^^^^^
5-
6-
**Deprecations**
7-
8-
**New Features**
9-
- Add FP8/NVFP4 KV cache quantization support for Megatron Core models.
10-
113

124
0.40 (2025-12-xx)
135
^^^^^^^^^^^^^^^^^
@@ -20,8 +12,9 @@ Model Optimizer Changelog (Linux)
2012

2113
- Add MoE (e.g. Qwen3-30B-A3B) pruning support for ``num_moe_experts``, ``moe_ffn_hidden_size`` and ``moe_shared_expert_intermediate_size`` parameters in Minitron pruning (``mcore_minitron``).
2214
- Add ``specdec_bench`` example to benchmark speculative decoding performance. See `examples/specdec_bench/README.md <https://github.com/NVIDIA/TensorRT-Model-Optimizer/tree/main/examples/specdec_bench#speculative-decoding-benchmark>`_ for more details.
15+
- Add FP8/NVFP4 KV cache quantization support for Megatron Core models.
2316

24-
0.39 (2025-11-14)
17+
0.39 (2025-11-11)
2518
^^^^^^^^^^^^^^^^^
2619

2720
**Deprecations**

0 commit comments

Comments
 (0)