You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've written the code to enable the optimizations, but only when they have been set in the env. There should be some code to calculate the correct optimizations for the current platform, but it looks like some of them do not apply to ONNX:
[2023-02-18 18:01:02,270] DEBUG: onnx_web.diffusion.load: enabling attention slicing on SD pipeline
[2023-02-18 18:01:02,270] DEBUG: onnx_web.diffusion.load: enabling VAE slicing on SD pipeline
[2023-02-18 18:01:02,270] WARNING: onnx_web.diffusion.load: error while enabling VAE slicing: 'OnnxStableDiffusionPipeline' object has no attribute 'enable_vae_slicing'
[2023-02-18 18:01:02,270] DEBUG: onnx_web.diffusion.load: enabling model CPU offload on SD pipeline
[2023-02-18 18:01:02,270] WARNING: onnx_web.diffusion.load: error while enabling model CPU offload: 'OnnxStableDiffusionPipeline' object has no attribute 'enable_model_cpu_offload'
[2023-02-18 18:01:02,270] DEBUG: onnx_web.server.model_cache: cache limit set to 0, not caching model: diffusion
[2023-02-18 18:01:02,270] DEBUG: onnx_web.server.model_cache: cache limit set to 0, not caching model: scheduler
The CUDA and ONNX optimizations are all available behind the ONNX_WEB_OPTIMIZATIONS variable, but need to be manually enabled until I can figure out which ones are available/appropriate for each platform.
microsoft/onnxruntime#11118
The text was updated successfully, but these errors were encountered: