add skip_layers argument to SD3 transformer model class #9880

bghira · 2024-11-06T20:10:31Z

What does this PR do?

Adds skip_layer parameter to the transformer model class for stable diffusion 3.

I can un-bundle the batched CFG and also include the pipeline changes for this pull request if you require.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@sayakpaul @yiyixuxu

yiyixuxu · 2024-11-07T00:07:02Z

hi @bghira
Thanks for the PR. Would you be able to provide a little bit of context on this parameter you are adding? for example, which layers would you skip, and what would the result look like?

cc @asomoza here too

HuggingFaceDocBuilderDev · 2024-11-07T00:07:42Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

bghira · 2024-11-07T00:14:59Z

this is a part of the work to implement skip-layer guidance for CFG in SD 3.5 Medium. the recommendation from SAI is to skip layers 7, 8, 9 when doing negative guidance. so this will have to be altered for that to work.

yiyixuxu · 2024-11-07T02:33:45Z

cc @asomoza here
can you also take a look to see if this is the best way to support the "negative guidance skipping" feature?

bghira · 2024-11-07T03:15:10Z

upstream;

        # Run cond and uncond in a batch together
        batched = self.model.apply_model(
            torch.cat([x, x]),
            torch.cat([timestep, timestep]),
            c_crossattn=torch.cat([cond["c_crossattn"], uncond["c_crossattn"]]),
            y=torch.cat([cond["y"], uncond["y"]]),
        )
        # Then split and apply CFG Scaling
        pos_out, neg_out = batched.chunk(2)
        scaled = neg_out + (pos_out - neg_out) * cond_scale
        # Then run with skip layer
        if (
            self.slg > 0
            and self.step > (self.skip_start * self.steps)
            and self.step < (self.skip_end * self.steps)
        ):
            skip_layer_out = self.model.apply_model(
                x,
                timestep,
                c_crossattn=cond["c_crossattn"],
                y=cond["y"],
                skip_layers=self.skip_layers,
            )
            # Then scale acc to skip layer guidance
            scaled = scaled + (pos_out - skip_layer_out) * self.slg

vladmandic · 2024-11-07T03:59:31Z

link to #9819

yiyixuxu · 2024-11-07T05:01:01Z

ohh got it
I will leave this open but we actually expect a PR for this feature soon:)

bghira · 2024-11-07T12:23:14Z

ok well i've already completed the feature last night and it's available in simpletuner

vanilla diffusers which looks awful

skipping layers 6, 7, 8, 9 with SLG scale of 5.6

skipping 7, 8, 9 with SLG scale of 2.8

it works better at 1024x1024*

bghira · 2024-11-07T12:25:16Z

@vladmandic see the above pipeline commit if you're interested

yiyixuxu · 2024-11-07T16:50:42Z

I've asked @Dango233 for a review, let's work on this PR and get it merged soon:)

bghira · 2024-11-08T19:02:42Z

i think a possible improvement would be to dynamically determine whether to skip a layer and what scale to apply but its a bit hard to do.

bghira · 2024-11-16T13:34:21Z

closing due to lack of updates.

bghira · 2024-11-16T14:07:22Z

for anyone wanting this feature it seems candle has an interest in keeping up with community pull requests. the diffusers project has been falling behind quite a bit lately in addressing development.

vladmandic · 2024-11-16T15:52:33Z

@bghira i agree diffusers have been falling behind lately, but why close this pr?

@yiyixuxu @sayakpaul what can we do here?

bghira · 2024-11-16T16:03:39Z

they said some other pull request was coming, so, i assume the preference is for that pull request.

at this point i've simply advocated for internal fork of Diffusers that fulfils our needs and merely cherry-pick fixes from this project where it makes sense to.

yiyixuxu · 2024-11-16T20:59:43Z

@bghira,

initially, the author of SD3.5 was going to send a pull request for this feature around the same time, that's why I said this #9880 (comment). I checked with him after we received your PR, and we agreed we should move forward with this PR, and have him to do a review instead,
I said it here #9880 (comment)

I was traveling for the past week and could not follow up. I understand your frustration. If you are willing to re-open the PR, we can continue to work on this and get it merged soon. Otherwise, we will add it in a new PR and add you as a co-author (cc @asomoza here )

vladmandic · 2024-11-17T13:08:46Z

regardless do we reopen this pr or implement independently, we need to figure out how to make diffusers current, at least when it comes to top models - this is slg is the default behavior in sd35-medium for 3+ weeks now and we're still discussing it here. i understand, that cannot be done for all models, but for something that is currently in top-3 we should aim for parity much faster.

bghira · 2024-11-17T13:18:19Z

as the branch has been deleted, the pull request can no longer be reopened

edit: found a hidden button

bghira · 2024-11-17T13:22:59Z

@vladmandic the strangest thing is i thought this would be much harder because no one had gotten around to it yet, which is why it took me so long to even get around to attempting it. when i saw how simple it was, i had it working in under an hour. but then made me greatly confused why SD 3.5 Medium in Diffusers is so far behind since even I was capable of doing it.

yiyixuxu · 2024-11-17T23:57:58Z

src/diffusers/models/transformers/transformer_sd3.py

        for index_block, block in enumerate(self.transformer_blocks):
+            # Skip specified layers
+            if skip_layers is not None and index_block in skip_layers:
+                if block_controlnet_hidden_states is not None and block.context_pre_only is False:


can we skip the block of code that needs to be skipped instead of adding duplicated code here
otherwise, if we have to change this part of the code that handles controlnet residual in the future, we have to remember to change both places, which is not great

yiyixuxu · 2024-11-18T00:00:09Z

src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3.py

        )

        self._guidance_scale = guidance_scale
+        self._skip_layer_guidance_scale = skip_layer_guidance_scale


need to add a decorator for this too, like this

diffusers/src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3.py

Line 642 in 07d0fbf

def guidance_scale(self):

src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3.py

asomoza · 2024-11-18T20:17:11Z

@bghira for what I take from the other discussion, you would prefer if we just take over this PR and finish it right?

yiyixuxu · 2024-11-19T05:11:33Z

@asomoza I made the change I requested - feel free to merge after you test it and think is ok.
we should add these parameters to img2img/inpaint/controlnet/pag too (can do this in a separate PR and ask community to test etc)

asomoza · 2024-11-19T20:06:43Z

So I did some tests with this, sadly for me, I couldn't find any combinations of layers and scale that makes the generation better, it seems that it makes better the text? but overall, in my experience it made the results worse or at least I personally didn't like them. I tested this with SD 3.5 Medium since it was released with this model.

clip prompt:

high quality photo of a tea party set in an enchanted forest, long rustic table, teapots, cups, fox with a monocle, rabbit in a waistcoat, squirrel wearing a tiny top hat, fireflies provide a soft, glowing light, hanging from the trees are lanterns made from acorn shells.

t5 prompt:

high quality photo of a tea party set in an enchanted forest. The scene includes a long, rustic table adorned with an assortment of colorful teapots and cups. Seated around the table are various forest creatures: a fox with a monocle, a rabbit in a waistcoat, and a squirrel wearing a tiny top hat. Above, fireflies provide a soft, glowing light, and hanging from the trees are lanterns made from acorn shells. The background features towering trees with twisting branches, some of which have treehouses nestled in them.

original

Still a nice addition to have and probably my example isn't the best to showcase this but I like to test with more complex and real use cases than simple ones.

I'll give this a try again when I build a SD 3.5 testing app that I can use faster.

asomoza · 2024-11-19T20:21:09Z

thanks!

bghira · 2024-11-19T20:21:28Z

remember that it's applying a skip to the negative prompt so that part of things also impacts testing.

for the use case for simpletuner it is to improve the results of validation so that they match inference in comfyUI where skip-layer guidance is more often than not used to improve the results. after introducing the feature, it's often reported that a given user does not want to disable the option anymore, because the results so much more closely match their inference results in CUI.

vladmandic · 2024-11-19T22:18:21Z

thanks all!
btw, i've run a few tests and i think there is more to be squeezed quality-wise than just using SAI recommended values.

* add skip_layers argument to SD3 transformer model class * add unit test for skip_layers in stable diffusion 3 * sd3: pipeline should support skip layer guidance * up --------- Co-authored-by: bghira <[email protected]> Co-authored-by: yiyixuxu <[email protected]>

add skip_layers argument to SD3 transformer model class

21075fa

bghira force-pushed the feature-sd3/skip-layer-guidance branch from cbeef91 to 21075fa Compare November 6, 2024 20:12

add unit test for skip_layers in stable diffusion 3

fd4a229

sd3: pipeline should support skip layer guidance

103536f

Merge branch 'main' into feature-sd3/skip-layer-guidance

b87a597

bghira mentioned this pull request Nov 7, 2024

add skip layer guidance to sd 3.5 #9819

Closed

Merge branch 'main' into feature-sd3/skip-layer-guidance

2583726

bghira closed this Nov 16, 2024

bghira deleted the feature-sd3/skip-layer-guidance branch November 16, 2024 14:07

bghira restored the feature-sd3/skip-layer-guidance branch November 17, 2024 13:19

bghira reopened this Nov 17, 2024

Merge branch 'main' into feature-sd3/skip-layer-guidance

791d5af

yiyixuxu requested a review from asomoza November 17, 2024 15:56

yiyixuxu reviewed Nov 17, 2024

View reviewed changes

yiyixuxu reviewed Nov 18, 2024

View reviewed changes

src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3.py Show resolved Hide resolved

up

481591a

asomoza approved these changes Nov 19, 2024

View reviewed changes

Merge branch 'main' into feature-sd3/skip-layer-guidance

092c358

asomoza merged commit 99c0483 into huggingface:main Nov 19, 2024
13 of 15 checks passed

bghira deleted the feature-sd3/skip-layer-guidance branch November 19, 2024 20:24

add skip_layers argument to SD3 transformer model class #9880

add skip_layers argument to SD3 transformer model class #9880

Conversation

bghira commented Nov 6, 2024

What does this PR do?

Before submitting

Who can review?

Uh oh!

yiyixuxu commented Nov 7, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Nov 7, 2024

Uh oh!

bghira commented Nov 7, 2024

Uh oh!

yiyixuxu commented Nov 7, 2024

Uh oh!

bghira commented Nov 7, 2024

Uh oh!

vladmandic commented Nov 7, 2024

Uh oh!

yiyixuxu commented Nov 7, 2024

Uh oh!

bghira commented Nov 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bghira commented Nov 7, 2024

Uh oh!

yiyixuxu commented Nov 7, 2024

Uh oh!

bghira commented Nov 8, 2024

Uh oh!

bghira commented Nov 16, 2024

Uh oh!

bghira commented Nov 16, 2024

Uh oh!

vladmandic commented Nov 16, 2024

Uh oh!

bghira commented Nov 16, 2024

Uh oh!

yiyixuxu commented Nov 16, 2024

Uh oh!

vladmandic commented Nov 17, 2024

Uh oh!

bghira commented Nov 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bghira commented Nov 17, 2024

Uh oh!

yiyixuxu Nov 17, 2024

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Nov 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

asomoza commented Nov 18, 2024

Uh oh!

yiyixuxu commented Nov 19, 2024

Uh oh!

asomoza commented Nov 19, 2024

Uh oh!

asomoza commented Nov 19, 2024

Uh oh!

bghira commented Nov 19, 2024

Uh oh!

Uh oh!

vladmandic commented Nov 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

bghira commented Nov 7, 2024 •

edited

Loading

bghira commented Nov 17, 2024 •

edited

Loading

yiyixuxu Nov 18, 2024 •

edited

Loading

vladmandic commented Nov 19, 2024 •

edited

Loading