Support InstantStyle #7586

haofanwang · 2024-04-05T16:58:08Z

This PR supports for our recent work InstantStyle in native diffusers API. The idea has also been discussed at #7534 .

The modifications are mainly about IP-Adapter loader, allowing users to specify target blocks used for image feature injection. After merged, InstantStyle can be achieved via following way

pipe.load_ip_adapter(pretrained_model_name_or_path_or_dict="./", 
                     subfolder="sdxl_models", 
                     weight_name="ip-adapter_sdxl.bin",
                     image_encoder_folder=image_encoder_path,
                     target_blocks=["block"]
                    )

yiyixuxu

looking great!
left one comment. Thanks!

yiyixuxu · 2024-04-05T17:19:02Z

src/diffusers/loaders/unet.py

+                    selected = False
+                    for block_name in target_blocks:
+                        if block_name in name:
+                            selected = True
+                            break


does this work?

Suggested change

selected = False

for block_name in target_blocks:

if block_name in name:

selected = True

break

selected = any( block_name in name for block_name in target_blocks)

Yes, it is much clearer. Updated.

HuggingFaceDocBuilderDev · 2024-04-05T17:21:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yiyixuxu · 2024-04-05T17:21:23Z

@asomoza can you do a final review and test this out?

asomoza · 2024-04-05T18:44:35Z

Nice, it works as intended and I really like the skip functionality, thank you for your work.

But I want to give my opinion here and open a discussion on about how is this implemented. Diffusers is used as a backend for UIs, for advanced users and also not for that advanced that knows all the diffusers naming for the blocks and layers, so I think we need to take this into consideration:

Doing this when loading the weights is restricting users and UIs that once the blocks are selected we can't change them, for that we have to reload the IP adapter. This is going to be specially noticeable when we use a lot of ip adapters, since even if we reload one, we need to reload all of them with the current implementation. I think we need to be able to change an IP adapter from a "full" configuration, a "style" configuration, a "composition" configuration or whatever we like on the fly without reloading.
Users will have to know this kind of names "up_blocks.0.attentions.1" to be able to use it and we already have the naming that @UmerHA implemented in Implements Blockwise lora #7352 which I like a lot more.

I really like to be able to use the same dict as the LoRAs for example.:

    {
        "down": {"block_1": [0.0, 0.0], "block_2": [0.0, 0.0]},
        "mid": 0.0,
        "up": {"block_0": 1.0, "block_1": [0.0, 0.0, 0.0]},
    },

on the other side, I really like the simplicity of this PR, if we go with the other implementation it will probably make the code a lot more complex.

haofanwang · 2024-04-05T19:13:18Z

I believe we can set different weights for each block just like LoRA dict, in this case, we don't need to reload modules. Will update soon.

ivanprado · 2024-04-08T09:39:36Z

Cool @haofanwang! What would be the mechanism to subtract the CLIP text embedding from the image embedding with diffusers? This is done here in the original implementation.

yiyixuxu · 2024-04-08T18:53:40Z

we will merge this one soon #7499
cool to rebase here? or is it easier the other way around?

haofanwang · 2024-04-09T02:46:52Z

Will rebase on #7499.

okaris · 2024-04-12T08:40:21Z

Is it possible to load multiple ip adapters at the same time?
ip-adapter + instantid
instantid + instantstyle

I would appreciate an example that shows how we could load/unload different adapters seperately or together.

Thanks!

haofanwang · 2024-04-14T08:59:07Z

@yiyixuxu @asomoza Our teammate has made a new PR. This PR will be closed soon.

haofanwang · 2024-04-14T09:02:55Z

Is it possible to load multiple ip adapters at the same time? ip-adapter + instantid instantid + instantstyle

I would appreciate an example that shows how we could load/unload different adapters seperately or together.

Thanks!

This is another problem. We can support it later.

haofanwang · 2024-04-14T09:03:43Z

Cool @haofanwang! What would be the mechanism to subtract the CLIP text embedding from the image embedding with diffusers? This is done here in the original implementation.

This can be natively achieved already. We will show how to do it once this PR merged.

ResearcherXman added 2 commits April 6, 2024 00:31

support instantstyle

63a63ec

quality check

820a2ab

yiyixuxu approved these changes Apr 5, 2024

View reviewed changes

format

4c0f414

format

14098f7

AMEERAZAM08 mentioned this pull request Apr 5, 2024

Gradio Demo on Huggingface instantX-research/InstantStyle#4

Merged

yiyixuxu mentioned this pull request Apr 11, 2024

Add native support to InstantStyle by allowing users to choose target_blocks for IPAdapters #7642

Closed

JY-Joy mentioned this pull request Apr 14, 2024

Support InstantStyle #7668

Merged

haofanwang closed this Apr 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support InstantStyle #7586

Support InstantStyle #7586

haofanwang commented Apr 5, 2024

yiyixuxu left a comment

yiyixuxu Apr 5, 2024

ResearcherXman Apr 5, 2024

HuggingFaceDocBuilderDev commented Apr 5, 2024

yiyixuxu commented Apr 5, 2024

asomoza commented Apr 5, 2024 •

edited

Loading

haofanwang commented Apr 5, 2024

ivanprado commented Apr 8, 2024

yiyixuxu commented Apr 8, 2024

haofanwang commented Apr 9, 2024

okaris commented Apr 12, 2024

haofanwang commented Apr 14, 2024

haofanwang commented Apr 14, 2024

haofanwang commented Apr 14, 2024

Support InstantStyle #7586

Support InstantStyle #7586

Conversation

haofanwang commented Apr 5, 2024

yiyixuxu left a comment

Choose a reason for hiding this comment

yiyixuxu Apr 5, 2024

Choose a reason for hiding this comment

ResearcherXman Apr 5, 2024

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Apr 5, 2024

yiyixuxu commented Apr 5, 2024

asomoza commented Apr 5, 2024 • edited Loading

haofanwang commented Apr 5, 2024

ivanprado commented Apr 8, 2024

yiyixuxu commented Apr 8, 2024

haofanwang commented Apr 9, 2024

okaris commented Apr 12, 2024

haofanwang commented Apr 14, 2024

haofanwang commented Apr 14, 2024

haofanwang commented Apr 14, 2024

asomoza commented Apr 5, 2024 •

edited

Loading