[jax-inference-offloading] consolidate definitions for default tensor dtype by yhtang · Pull Request #1816 · NVIDIA/JAX-Toolbox

yhtang · 2025-12-05T00:28:55Z

Consolidates the definition for the default tensor dtype in refitting specs, via the dtype="bfloat16" keyword argument of make_mapping(...) in models/__init__.py. Since all current refitting specs are defined via make_mapping, this gives us a single source of truth for the default tensor dtype. This change should not introduce any visiblel functional changes for existing models.

Copilot

Pull request overview

This PR refactors the default dtype handling by moving the default value from Python code to the protobuf definition. Instead of using fallback logic (param.vllm_param.dtype or 'bfloat16') in the Python code, the default is now specified directly in the proto file, simplifying the code and making the default more explicit.

Key changes:

Added default value 'bfloat16' to the dtype field in the VllmParam message definition
Removed all fallback or 'bfloat16' logic from the Python code in four locations

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
jax-inference-offloading/jax_inference_offloading/api/param_mapping.proto	Added default value for the dtype field in VllmParam message
jax-inference-offloading/jax_inference_offloading/vllm/extension.py	Removed fallback logic for dtype in update_weights and update_weights_grouped methods

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

jax-inference-offloading/jax_inference_offloading/api/param_mapping.proto

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

jreiffers · 2025-12-05T07:47:30Z

Why?

yhtang · 2025-12-05T11:53:40Z

Why?

Would it be better to have a single source of truth for model's default dtype?

jreiffers · 2025-12-09T07:50:49Z

Default values in protos are a bit of an anti pattern (they even removed the feature completely in proto3). Once you put them in, you can never remove or change them again. I think they're acceptable when there's an obviously meaningful default, but I don't think that's the case here. I'd keep it in the application logic.

yhtang · 2025-12-10T07:10:20Z

Default values in protos are a bit of an anti pattern (they even removed the feature completely in proto3). Once you put them in, you can never remove or change them again. I think they're acceptable when there's an obviously meaningful default, but I don't think that's the case here. I'd keep it in the application logic.

That makes sense. I have removed the default value for dtype from the proto file. Since all refitting specs are currently defined using make_mapping, would it be sufficient to rely on the default value there instead? Assuming it is OK to do so, I've updated the PR title and descriptiona accordingly.

jreiffers · 2025-12-11T09:40:32Z

jax-inference-offloading/jax_inference_offloading/models/__init__.py


 def make_mapping(
-  jax_name, vllm_name, vllm_shape, *, transform=None, jax_prefix="model", vllm_prefix="model"
+  jax_name, vllm_name, vllm_shape, *, transform=None, jax_prefix="model", vllm_prefix="model", dtype="bfloat16"


s/dtype/vllm_dtype/?

At the moment we don’t support any dtype conversion between the JAX and vLLM sides, so only vllm_param carries a dtype field, and the dtypes are expected to match between JAX and vLLM. Once we add conversion support, it may even make sense to stop specifying dtype in make_mapping altogether and instead rely on the handshake to discover the dtype at runtime.

[jax-inference-offloading] Move default dtype to proto

3ad4a53

yhtang requested review from Copilot and jreiffers December 5, 2025 00:28

Copilot started reviewing on behalf of yhtang December 5, 2025 00:29 View session

Copilot finished reviewing on behalf of yhtang December 5, 2025 00:31

Copilot AI reviewed Dec 5, 2025

View reviewed changes

jax-inference-offloading/jax_inference_offloading/api/param_mapping.proto Outdated Show resolved Hide resolved

yhtang and others added 2 commits December 5, 2025 00:44

default dtype

83d79db

Fix proto

0093423

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Merge branch 'main' into yhtang/jio-default-dtype

5b8ea2d

remove default dtype from proto

301cd23

yhtang changed the title ~~[jax-inference-offloading] Move default dtype to proto~~ [jax-inference-offloading] consolidate definitions for default tensor dtype Dec 10, 2025

jreiffers reviewed Dec 11, 2025

View reviewed changes

jreiffers approved these changes Dec 11, 2025

View reviewed changes

Merge branch 'main' into yhtang/jio-default-dtype

740c4ff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[jax-inference-offloading] consolidate definitions for default tensor dtype#1816

[jax-inference-offloading] consolidate definitions for default tensor dtype#1816
yhtang wants to merge 6 commits intomainfrom
yhtang/jio-default-dtype

yhtang commented Dec 5, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

jreiffers commented Dec 5, 2025

Uh oh!

yhtang commented Dec 5, 2025

Uh oh!

jreiffers commented Dec 9, 2025

Uh oh!

yhtang commented Dec 10, 2025

Uh oh!

jreiffers Dec 11, 2025

Uh oh!

yhtang Dec 11, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

yhtang commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

jreiffers commented Dec 5, 2025

Uh oh!

yhtang commented Dec 5, 2025

Uh oh!

jreiffers commented Dec 9, 2025

Uh oh!

yhtang commented Dec 10, 2025

Uh oh!

jreiffers Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

yhtang Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yhtang commented Dec 5, 2025 •

edited

Loading

yhtang Dec 11, 2025 •

edited

Loading