Allow explicitly selecting xFormers at install time #7065

ebr · 2024-10-08T14:29:39Z

Summary

A recent Torch upgrade introduced a performance regression on modern Nvidia GPUs when using xformers.
This PR adds an install option to select xFormers for older GPUs, where it's needed, and avoid it for the newer cards that can use torch-sdp or other attention types.

This PR also points torch to the CUDA v12.4 extra-index-url for cpu-only Docker builds (not related to xformers issues).

QA Instructions

The updated installer, built from this PR, is attached here.. Please test it on Windows and macOS for completeness. On the mac it should simply proceed with the installation without displaying the GPU selection menu, and should result in a working install.

InvokeAI-installer-v5.1.0.zip

Unzip the installer and run it using install.sh
Install using option 2 (older nvidia GPU with xformers).
Activate the virtual environment and validate that xformers is installed (pip freeze | grep xformers)
Repeat this with option 1, and validate that xformers is NOT installed.

Merge Plan

Merge anytime

Checklist

The PR has a short but descriptive title, suitable for a changelog

RyanJDick · 2024-10-08T20:29:29Z

I feel like whether we use xformers should be controlled via the config rather than whether xformers is installed. Is there a reason for doing it this way?

psychedelicious · 2024-10-08T22:36:59Z

I think we can handle choosing the correct attention type at runtime:

diff --git a/invokeai/backend/stable_diffusion/diffusers_pipeline.py b/invokeai/backend/stable_diffusion/diffusers_pipeline.py
index 646e1a92d..06a66f64b 100644
--- a/invokeai/backend/stable_diffusion/diffusers_pipeline.py
+++ b/invokeai/backend/stable_diffusion/diffusers_pipeline.py
@@ -195,7 +195,14 @@ class StableDiffusionGeneratorPipeline(StableDiffusionPipeline):
 
         # the remainder if this code is called when attention_type=='auto'
         if self.unet.device.type == "cuda":
-            if is_xformers_available():
+            # On 30xx and 40xx series GPUs, `torch-sdp` is faster than `xformers`. This corresponds to a CUDA major
+            # version of 8 or higher. So, for major version 7 or below, we prefer `xformers`.
+            # See:
+            # - https://developer.nvidia.com/cuda-gpus
+            # - https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#compute-capabilities
+            prefer_xformers = torch.cuda.get_device_properties("cuda").major <= 7
+
+            if is_xformers_available() and prefer_xformers:
                 self.enable_xformers_memory_efficient_attention()
                 return
             elif hasattr(torch.nn.functional, "scaled_dot_product_attention"):

ebr · 2024-10-09T14:28:28Z

I feel like whether we use xformers should be controlled via the config rather than whether xformers is installed. Is there a reason for doing it this way?

This was a decision made in a Slack discussion, but I'm happy to do it whichever way preferable. I think not having xformers installed at all is the safest choice, because we can't control which 3rd-party code might use it if detected.

I like the runtime decision approach suggested by @psychedelicious. Can easily add that to the PR, but also think we shouldn't be installing it unless necessary.

ebr · 2024-10-09T14:45:29Z

I added the suggested runtime change in the latest commit

brandonrising · 2024-10-09T15:31:14Z

I guess if for some reason a person with a new card wanted to use xformers, they'd have to download it manually 🤔 As much as I am a fan of not installing something that won't be used the vast majority of the time

ebr · 2024-10-09T16:35:53Z

I guess if for some reason a person with a new card wanted to use xformers, they'd have to download it manually

There's no good reason for someone with a modern GPU to use xformers though, is there?

As much as I'd like to accommodate that individual who swaps their newer (30xx+) GPU for an older (20xx-) GPU and wants to continue seamlessly using Invoke with the same performance profile.... not convinced that's an edge case we really need to account for ;)

…the GPU model

ebr requested review from lstein, hipsterusername and blessedcoolant as code owners October 8, 2024 14:29

github-actions bot added docker installer PRs that change the installer labels Oct 8, 2024

ebr requested review from brandonrising and RyanJDick as code owners October 9, 2024 14:45

github-actions bot added python PRs that change python files backend PRs that change backend files labels Oct 9, 2024

ebr force-pushed the ebr/installer-xformers branch from 0fc785b to 4301812 Compare October 9, 2024 16:38

ebr and others added 4 commits October 10, 2024 11:00

feat(installer): add options to include or exclude xFormers based on …

e4284aa

…the GPU model

feat(docker): upgrade to CUDA 12.4 in container

dc9cf27

feat(backend): prefer xformers based on cuda compute capability

2b46724

feat(installer): use torch extra index on all cuda install pathways

ccde9f7

psychedelicious force-pushed the ebr/installer-xformers branch from b20b098 to ccde9f7 Compare October 10, 2024 00:00

hipsterusername approved these changes Oct 10, 2024

View reviewed changes

hipsterusername merged commit eda9793 into main Oct 10, 2024
14 checks passed

hipsterusername deleted the ebr/installer-xformers branch October 10, 2024 02:46

psychedelicious mentioned this pull request Oct 17, 2024

[bug]: xformers not uninstalled when using new installer option that excludes xformers #7142

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow explicitly selecting xFormers at install time #7065

Allow explicitly selecting xFormers at install time #7065

ebr commented Oct 8, 2024 •

edited

Loading

RyanJDick commented Oct 8, 2024

psychedelicious commented Oct 8, 2024

ebr commented Oct 9, 2024

ebr commented Oct 9, 2024

brandonrising commented Oct 9, 2024

ebr commented Oct 9, 2024

Allow explicitly selecting xFormers at install time #7065

Allow explicitly selecting xFormers at install time #7065

Conversation

ebr commented Oct 8, 2024 • edited Loading

Summary

QA Instructions

Merge Plan

Checklist

RyanJDick commented Oct 8, 2024

psychedelicious commented Oct 8, 2024

ebr commented Oct 9, 2024

ebr commented Oct 9, 2024

brandonrising commented Oct 9, 2024

ebr commented Oct 9, 2024

ebr commented Oct 8, 2024 •

edited

Loading