[sharktank] Remove redundant parallelism fields in LlamaModelConfig.to_properties #2393

sogartar · 2025-09-30T15:55:46Z

We are already saving these in parallelism_config.

…o_properties We are already saving these in `parallelism_config`.

Alex-Vasile · 2025-09-30T16:03:18Z

sharktank/sharktank/layers/configs/llm_configs.py

        res["activation_dtype"] = dtype_to_serialized_name(self.activation_dtype)
        res["attention_dtype"] = dtype_to_serialized_name(self.attention_dtype)
        res["fake_quant"] = self.fake_quant
-        res["tensor_parallelism_size"] = self.tensor_parallelism_size


tensor_parallelism_size is not being saved by parallelism_config

But it is dynamically computed from there
LlamaModelConfig:

@property def tensor_parallelism_size(self) -> int: return self.parallelism_config.tensor_parallelism_size

ParallelismConfig:

@property def tensor_parallelism_size(self) -> int: """ How many devices are involved for tensor parallel sharding. If greater than 1, the model will expect sharded model parameters and function arguments. """ return len(self.devices_for_pipeline(0))

Actually until now this init.

return LlamaModelConfig(**kwargs)

Was causing an overwrite of the method-like properties.

@property def pipeline_parallelism_size(self) -> int: return self.parallelism_config.pipeline_size @property def tensor_parallelism_size(self) -> int: return self.parallelism_config.tensor_parallelism_size

This may cause a discrepancy with LlamaModelConfig.parallelism_config.

One more casualty of the power of Python's dynamism.

Nevermind, its not that.

[sharktank] Remove redundant parallelism fields in LlamaModelConfig.t…

d4bb7aa

…o_properties We are already saving these in `parallelism_config`.

sogartar requested a review from Alex-Vasile September 30, 2025 15:55

sogartar mentioned this pull request Sep 30, 2025

[sharktank] Add tests and fixes for pipeline-parallel Llama 405B and Toy f4 vs non-parallel #2353

Merged

Alex-Vasile reviewed Sep 30, 2025

View reviewed changes

sogartar marked this pull request as draft October 2, 2025 12:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[sharktank] Remove redundant parallelism fields in LlamaModelConfig.to_properties #2393

[sharktank] Remove redundant parallelism fields in LlamaModelConfig.to_properties #2393

Uh oh!

sogartar commented Sep 30, 2025

Uh oh!

Alex-Vasile Sep 30, 2025

Uh oh!

sogartar Sep 30, 2025

Uh oh!

sogartar Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[sharktank] Remove redundant parallelism fields in LlamaModelConfig.to_properties #2393

Are you sure you want to change the base?

[sharktank] Remove redundant parallelism fields in LlamaModelConfig.to_properties #2393

Uh oh!

Conversation

sogartar commented Sep 30, 2025

Uh oh!

Alex-Vasile Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

sogartar Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

sogartar Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants