Fix [core/GLIGEN]: TypeError when iterating over 0-d tensor with In-painting mode when EulerAncestralDiscreteScheduler is used #5305

rchuzh99 · 2023-10-06T00:47:03Z

What does this PR do?

Fixes #5216
Re-opens PR #5214

This PR fixes the TypeError caused by trying to directly iterate over a 0-dimension tensor in the denoising stage of GLIGEN In-painting operation.

The error occurs when using diffusion noise schedulers that iterate over timesteps
(e.g. EulerAnchestralDiscreteScheduler, KDPM2AncestralDiscreteScheduler), during in-painting operation with the StableDiffusionGLIGENPipeline and StableDiffusionGLIGENTextImagePipeline .

For further clarification, this operation of the add_noise function 🔽

diffusers/src/diffusers/schedulers/scheduling_euler_ancestral_discrete.py

Lines 387 to 388 in ae2fc01

	step_indices = [(schedule_timesteps == t).nonzero().item() for t in timesteps]

in the affected noise schedulers expects the timesteps to be a non-0 dim torch Tensor. However, in the affected pipelines, timesteps is 0-dimension.

This PR references the approach found in the StableDiffusionInpaintingPipeline

diffusers/src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_inpaint.py

Lines 1029 to 1034 in ae2fc01

    
           if i < len(timesteps) - 1: 
        
               noise_timestep = timesteps[i + 1] 
        
               init_latents_proper = self.scheduler.add_noise( 
        
                   init_latents_proper, noise, torch.tensor([noise_timestep]) 
        
               )

which is to wrap the timestep(t) 0-d tensor in a list to convert to 1-d tensor as follow 🔽

  if gligen_inpaint_image is not None:
      gligen_inpaint_latent_with_noise = (
          self.scheduler.add_noise(
              gligen_inpaint_latent, torch.randn_like(gligen_inpaint_latent), torch.tensor([t])
          )
          .expand(latents.shape[0], -1, -1, -1)
          .clone()
      )

https://github.com/rchuzh99/diffusers/blob/fb82fc4bdcead457e24a780cfb193070227f3e31/src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_gligen.py#L799-L806 and https://github.com/rchuzh99/diffusers/blob/fb82fc4bdcead457e24a780cfb193070227f3e31/src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_gligen_text_image.py#L960-L967

Affected pipelines

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

cc: @sayakpaul , @nikhil-masterful, @tuanh123789

References

StableDiffusionGLIGENPipeline: Add GLIGEN implementation #4441
StableDiffusionGLIGENTextImagePipeline: Add GLIGEN Text Image implementation #4777
StableDiffusionInpaintingPipeline: https://github.com/huggingface/diffusers/blob/main/src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_inpaint.py

…ist to convert to 1-d tensor. This avoids the TypeError caused by trying to directly iterate over a 0-dimensional tensor in the denoising stage

…creteScheduler

…mesteps

sayakpaul · 2023-10-09T07:28:28Z

examples/custom_diffusion/train_custom_diffusion.py

@@ -207,7 +207,7 @@ def __init__(
                    with open(concept["class_prompt"], "r") as f:
                        class_prompt = f.read().splitlines()

-                class_img_path = [(x, y) for (x, y) in zip(class_images_path, class_prompt)]
+                class_img_path = list(zip(class_images_path, class_prompt))


sayakpaul

Thanks so much!

Will merge once the CI is green

rchuzh99 · 2023-10-09T07:32:40Z

Thanks so much!

Will merge once the CI is green

Thanks for the review @sayakpaul 👍🏻

WuyangLuo · 2023-12-12T19:50:12Z

Very useful ! Thx

…ainting mode when EulerAncestralDiscreteScheduler is used (huggingface#5305) * fix(gligen_inpaint_pipeline): 🐛 Wrap the timestep() 0-d tensor in a list to convert to 1-d tensor. This avoids the TypeError caused by trying to directly iterate over a 0-dimensional tensor in the denoising stage * test(gligen/gligen_text_image): unit test using the EulerAncestralDiscreteScheduler --------- Co-authored-by: zhen-hao.chu <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

zhen-hao.chu and others added 3 commits October 5, 2023 06:49

fix(gligen_inpaint_pipeline): 🐛 Wrap the timestep() 0-d tensor in a l…

1735422

…ist to convert to 1-d tensor. This avoids the TypeError caused by trying to directly iterate over a 0-dimensional tensor in the denoising stage

test(gligen/gligen_text_image): unit test using the EulerAncestralDis…

d797d51

…creteScheduler

Merge branch 'huggingface:main' into rchuzh99/fix-gligen-add-noise-ti…

9f41380

…mesteps

rchuzh99 mentioned this pull request Oct 6, 2023

Fix [core/GLIGEN]: TypeError when iterating over 0-d tensor with In-painting mode when EulerAncestralDiscreteScheduler is used #5214

Closed

6 tasks

DN6 requested a review from sayakpaul October 6, 2023 08:18

sayakpaul reviewed Oct 9, 2023

View reviewed changes

sayakpaul approved these changes Oct 9, 2023

View reviewed changes

Merge branch 'main' into rchuzh99/fix-gligen-add-noise-timesteps

6fdb932

sayakpaul merged commit 6bd55b5 into huggingface:main Oct 9, 2023
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix [core/GLIGEN]: TypeError when iterating over 0-d tensor with In-painting mode when EulerAncestralDiscreteScheduler is used #5305

Fix [core/GLIGEN]: TypeError when iterating over 0-d tensor with In-painting mode when EulerAncestralDiscreteScheduler is used #5305

rchuzh99 commented Oct 6, 2023 •

edited

Loading

sayakpaul Oct 9, 2023

sayakpaul left a comment

rchuzh99 commented Oct 9, 2023

WuyangLuo commented Dec 12, 2023

	if i < len(timesteps) - 1:
	noise_timestep = timesteps[i + 1]
	init_latents_proper = self.scheduler.add_noise(
	init_latents_proper, noise, torch.tensor([noise_timestep])
	)

Fix [core/GLIGEN]: TypeError when iterating over 0-d tensor with In-painting mode when EulerAncestralDiscreteScheduler is used #5305

Fix [core/GLIGEN]: TypeError when iterating over 0-d tensor with In-painting mode when EulerAncestralDiscreteScheduler is used #5305

Conversation

rchuzh99 commented Oct 6, 2023 • edited Loading

What does this PR do?

Affected pipelines

Before submitting

Who can review?

References

sayakpaul Oct 9, 2023

Choose a reason for hiding this comment

sayakpaul left a comment

Choose a reason for hiding this comment

rchuzh99 commented Oct 9, 2023

WuyangLuo commented Dec 12, 2023

rchuzh99 commented Oct 6, 2023 •

edited

Loading