[core] feat: support group offloading at the pipeline level #12283

sayakpaul · 2025-09-04T06:23:03Z

What does this PR do?

onload_device = torch.device("cuda")
offload_device = torch.device("cpu")
apply_group_offloading(
    text_encoder,
    onload_device=onload_device,
    offload_device=offload_device,
    offload_type="leaf_level",
    use_stream=True
)
transformer.enable_group_offload(
    onload_device=onload_device,
    offload_device=offload_device,
    offload_type="leaf_level",
    use_stream=True
)
apply_group_offloading(
    vae,
    onload_device=onload_device,
    offload_device=offload_device,
    offload_type="block_level",
    offload_type="leaf_level",
    use_stream=True
)

to

pipe.enable_group_offload(
    onload_device=onload_device,
    offload_device=offload_device,
    offload_type="leaf_level",
    use_stream=True
)

Of course, if users still want to apply different offloading techniques to different model-level components, they can easily choose to do so. But IMO, enable_group_offload() is an easier entrypoint.

We can allow users to pass mappings like we do for quant_mapping in PipelineQuantizationConfig in the future.

Will request for reviews after CI.

TODOs

Docs (cc: @stevhliu)

sayakpaul · 2025-09-04T06:25:08Z

src/diffusers/pipelines/pipeline_utils.py

+        record_stream: bool = False,
+        low_cpu_mem_usage=False,
+        offload_to_disk_path: Optional[str] = None,
+        exclude_modules: Optional[Union[str, List[str]]] = None,


I think it's okay to expose this as an argument as opposed to how we do model_cpu_offload_seq, for example:

diffusers/src/diffusers/pipelines/flux/pipeline_flux.py

Line 179 in 764b624

model_cpu_offload_seq = "text_encoder->text_encoder_2->image_encoder->transformer->vae"

This is because model CPU offloading relies on a sequence for device management. I don't think we have that constraint in the case of group offloading.

HuggingFaceDocBuilderDev · 2025-09-04T06:30:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

stevhliu · 2025-09-04T18:51:57Z

Really neat how simplified this has become! 🎉

a-r-r-o-w

Nice! We could reduce LoC by creating a kwargs dict and passing into both branches, but definitely not a blocker

Tests look good but maybe the two could be combined into one to reduce total run count

init Co-authored-by: Sayak Paul <[email protected]>

sayakpaul added 2 commits September 4, 2025 11:11

feat: support group offloading at the pipeline level.

25d9c70

add tests

e141f5c

sayakpaul commented Sep 4, 2025

View reviewed changes

sayakpaul requested review from DN6 and a-r-r-o-w September 4, 2025 07:14

stevhliu mentioned this pull request Sep 4, 2025

[docs] Pipeline group offloading #12286

Merged

sayakpaul added 2 commits September 5, 2025 06:49

Merge branch 'main' into support-group-offloading-pipeline-level

506424c

Merge branch 'main' into support-group-offloading-pipeline-level

1a8ebf6

a-r-r-o-w approved these changes Sep 9, 2025

View reviewed changes

sayakpaul and others added 3 commits September 10, 2025 08:01

Merge branch 'main' into support-group-offloading-pipeline-level

aa0cafb

up

8e2d038

[docs] Pipeline group offloading (#12286)

511056a

init Co-authored-by: Sayak Paul <[email protected]>

sayakpaul merged commit 4345907 into main Sep 10, 2025
34 checks passed

vladmandic mentioned this pull request Sep 11, 2025

Broken group offloading using block_level #12319

Open

sayakpaul mentioned this pull request Sep 15, 2025

[tests] clean the pipeline level group offloading tests #12330

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[core] feat: support group offloading at the pipeline level #12283

[core] feat: support group offloading at the pipeline level #12283

Uh oh!

sayakpaul commented Sep 4, 2025 •

edited

Loading

Uh oh!

sayakpaul Sep 4, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 4, 2025

Uh oh!

stevhliu commented Sep 4, 2025

Uh oh!

a-r-r-o-w left a comment

Uh oh!

Uh oh!

Uh oh!

[core] feat: support group offloading at the pipeline level #12283

[core] feat: support group offloading at the pipeline level #12283

Uh oh!

Conversation

sayakpaul commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

TODOs

Uh oh!

sayakpaul Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Sep 4, 2025

Uh oh!

stevhliu commented Sep 4, 2025

Uh oh!

a-r-r-o-w left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sayakpaul commented Sep 4, 2025 •

edited

Loading