Projection connector by nune-tadevosyan · Pull Request #15468 · NVIDIA-NeMo/NeMo

nune-tadevosyan · 2026-03-05T16:52:26Z

Important

The Update branch button must only be pressed in very rare occassions.
An outdated branch is never blocking the merge of a PR.
Please reach out to the automation team before pressing that button.

What does this PR do ?

Adds MultiLayerProjectionConnector support for SALM models.

Changelog

Adds support for the MultiLayerProjectionConnector.
Adds support for initializing SALM model from pretrained checkpoints.

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

pzelasko · 2026-03-05T16:57:57Z

nemo/collections/speechlm2/models/salm.py

        setup_speech_encoder(self, pretrained_weights=self.cfg.pretrained_weights)

+
+        # Load pretrained weights if provided


Why is this needed in the PR? Looks to me like a separate feature that we may want to implement in a generic way so that every speechlm2 model can re-use.

pzelasko · 2026-03-05T17:02:47Z

nemo/collections/speechlm2/modules/perception.py

+        if isinstance(self.modality_adapter, (QformerConnector, MultiLayerProjectionConnector)):
+            # Store as a plain Python attribute (not an nn.Module submodule) to avoid duplicate
+            # state_dict entries: encoder_multilayer.encoder IS self.encoder, so safetensors
+            # would de-duplicate them and produce spurious "missing key" warnings on reload.


It might be an overkill. Can we instead do:

# note: not self.encoder encoder = self.from_config_dict(cfg.encoder) if isinstance(self.modality_adapter, (QformerConnector, MultiLayerProjectionConnector)): self.encoder_multilayer = ConformerMultiLayerFeatureExtractor(encoder, ...) else: self.encoder = encoder

pzelasko · 2026-03-06T14:44:39Z

nemo/collections/speechlm2/models/salm.py

-            missing_keys, unexpected_keys = self.load_state_dict(tensors, strict=False)
-            logging.warning(f"Missing keys: {missing_keys}")
-            logging.warning(f"Unexpected keys: {unexpected_keys}")
+        maybe_install_lora(self)


Can we move maybe_install_lora back to line 67?

Signed-off-by: Nune <ntadevosyan@nvidia.com>

Signed-off-by: ntadevosyan <ntadevosyan@nvidia.com>

Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>

nune-tadevosyan requested a review from pzelasko March 5, 2026 16:53

pzelasko requested changes Mar 5, 2026

View reviewed changes

pzelasko reviewed Mar 6, 2026

View reviewed changes

nune-tadevosyan added 4 commits March 8, 2026 08:35

projection fix

1fa51bc

Signed-off-by: Nune <ntadevosyan@nvidia.com>

fix for init from existing model

fbc297d

Signed-off-by: ntadevosyan <ntadevosyan@nvidia.com>

fixes

a79a568

Signed-off-by: ntadevosyan <ntadevosyan@nvidia.com>

update

acce061

Signed-off-by: ntadevosyan <ntadevosyan@nvidia.com>

nune-tadevosyan force-pushed the projection_connector branch from d48c89a to acce061 Compare March 8, 2026 15:35

Apply isort and black reformatting

1abfb28

Signed-off-by: nune-tadevosyan <nune-tadevosyan@users.noreply.github.com>

pzelasko approved these changes Mar 9, 2026

View reviewed changes

pzelasko added the Run CICD label Mar 9, 2026

pzelasko enabled auto-merge (squash) March 9, 2026 17:30

pzelasko temporarily deployed to test March 9, 2026 17:32 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Projection connector#15468

Projection connector#15468
nune-tadevosyan wants to merge 5 commits intoNVIDIA-NeMo:mainfrom
nune-tadevosyan:projection_connector

nune-tadevosyan commented Mar 5, 2026 •

edited

Loading

Uh oh!

pzelasko Mar 5, 2026

Uh oh!

pzelasko Mar 5, 2026

Uh oh!

nune-tadevosyan Mar 9, 2026

Uh oh!

pzelasko Mar 6, 2026

Uh oh!

nune-tadevosyan Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		setup_speech_encoder(self, pretrained_weights=self.cfg.pretrained_weights)


		# Load pretrained weights if provided

Conversation

nune-tadevosyan commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

Usage

GitHub Actions CI

Before your PR is "Ready for review"

Who can review?

Additional Information

Uh oh!

pzelasko Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

pzelasko Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

nune-tadevosyan Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

pzelasko Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

nune-tadevosyan Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nune-tadevosyan commented Mar 5, 2026 •

edited

Loading