add support for attention slicing and other CUDA optimizations #155

ssube · 2023-02-16T04:48:09Z

)

ssube · 2023-02-18T18:02:20Z

I've written the code to enable the optimizations, but only when they have been set in the env. There should be some code to calculate the correct optimizations for the current platform, but it looks like some of them do not apply to ONNX:

[2023-02-18 18:01:02,270] DEBUG: onnx_web.diffusion.load: enabling attention slicing on SD pipeline                                                                                 
[2023-02-18 18:01:02,270] DEBUG: onnx_web.diffusion.load: enabling VAE slicing on SD pipeline
[2023-02-18 18:01:02,270] WARNING: onnx_web.diffusion.load: error while enabling VAE slicing: 'OnnxStableDiffusionPipeline' object has no attribute 'enable_vae_slicing'            
[2023-02-18 18:01:02,270] DEBUG: onnx_web.diffusion.load: enabling model CPU offload on SD pipeline
[2023-02-18 18:01:02,270] WARNING: onnx_web.diffusion.load: error while enabling model CPU offload: 'OnnxStableDiffusionPipeline' object has no attribute 'enable_model_cpu_offload'
[2023-02-18 18:01:02,270] DEBUG: onnx_web.server.model_cache: cache limit set to 0, not caching model: diffusion
[2023-02-18 18:01:02,270] DEBUG: onnx_web.server.model_cache: cache limit set to 0, not caching model: scheduler

ssube · 2023-02-18T22:07:39Z

The CUDA and ONNX optimizations are all available behind the ONNX_WEB_OPTIMIZATIONS variable, but need to be manually enabled until I can figure out which ones are available/appropriate for each platform.

ssube added status/new issues that have not been confirmed yet type/feature new features labels Feb 16, 2023

ssube added this to the v0.8 milestone Feb 16, 2023

ssube added a commit that referenced this issue Feb 18, 2023

feat(api): enable optimizations for SD pipelines based on env vars (#155

ab6462d

)

ssube added status/progress issues that are in progress and have a branch and removed status/new issues that have not been confirmed yet labels Feb 18, 2023

ssube mentioned this issue Feb 18, 2023

CUDA optimizations #177

Merged

ssube self-assigned this Feb 18, 2023

ssube added status/fixed issues that have been fixed and released and removed status/progress issues that are in progress and have a branch labels Feb 18, 2023

ssube closed this as completed in #177 Feb 18, 2023

ssube mentioned this issue Mar 5, 2023

v0.8.0 release checklist #217

Closed

99 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for attention slicing and other CUDA optimizations #155

add support for attention slicing and other CUDA optimizations #155

ssube commented Feb 16, 2023 •

edited

Loading

ssube commented Feb 18, 2023 •

edited

Loading

ssube commented Feb 18, 2023

add support for attention slicing and other CUDA optimizations #155

add support for attention slicing and other CUDA optimizations #155

Comments

ssube commented Feb 16, 2023 • edited Loading

ssube commented Feb 18, 2023 • edited Loading

ssube commented Feb 18, 2023

ssube commented Feb 16, 2023 •

edited

Loading

ssube commented Feb 18, 2023 •

edited

Loading