Reporting Performance Differences with Updates #1181
Replies: 5 comments 3 replies
-
hi I have a problem with controlnet regarding the pony models, I don't know where to write so I apologize in advance. |
Beta Was this translation helpful? Give feedback.
-
I'd like to report a performance difference with Flux - it's gotten insanely good and I don't understand why ^^ 1 image 832x1216 Euler simple 30 steps with: But.. how the hell am I suddenly running a full fp16 clip model on my 10GB gpu? o.o (just because I still don't understand this magic: Maybe relevant from the webui user bat: |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
If GPU weight is not recommended to set to max then what is the recommended safe value which guarantees the 10x slowdown never happens? Maybe something like max value minus X? |
Beta Was this translation helpful? Give feedback.
-
Would love to just hear general reports of what speeds people are getting with their hardware. I think I may be underperforming, for example: TL;DR: 1.7s/it
|
Beta Was this translation helpful? Give feedback.
-
Some people reported performance degradation with updates.
However, most are just caused by people forgetting that they are using different models (these days models change a lot and many people have downloaded lots of versions ... ); note that different model architecture (NF/GGUF/FP) are expected to have different performance; or because some old GPU are not very stable <- for example, this guy's Forge became slow after one line of text is added to readme🤯
If you are sure that some updates caused slow down, and knows how to git checkout, you can put full console logs before after some commit with different speed in this post.
We also noticed (with several strong evidences) that some people are spreading misinformation about Forge being slow with an intention to promote their custom workflow. <- if you are doing it right now, please stop it.
Finally, remember that giving full console logs before after some commit with different speed (if true) really helps us. Also, it would be better if you have screenshots like this:
(these are statics for generation)
PS: some people seem to have better performance with
--cuda-malloc
in their CMD args. Although that never happened to my 5 different test devices, you may try it and see what will happen.See also tuning flux speed
Do Not Set GPU WEIGHT to Max Value!
Some people think that setting GPU weight to max will fit everything into GPU and it is faster. No, it is not. If you set GPU weight to max value, you model is in GPU, but you do not have GPU free memory to do computation, and the speed may be 10x slower.
Typical Example of User Mistake
Below is a typical example of users randomly setting options to worse values and then complain about performance:
His screenshot is
There are 3 problems in this screenshot
This means this user will use 100% GPU memory to load weights with 0% memory to compute. So, it is expected to be about 10x slower.
This means this user will use two workers to move layers and compute together. This can make things faster if it works, but consider that 0% GPU memory can be used in computation, this will not work because there is no free VRAM to compute.
This mean the user will use shared GPU memory, and the Forge in the screenshot is doing that. This can make things faster if it works, but consider that 0% GPU memory can be used in computation, this will not work in this case.
In this screenshot, everything works perfectly. It is slow because the user sets it to run in a slow mode.
After read the original instructions again, the user get normal speed back.
So, make sure to read the instructions!
Beta Was this translation helpful? Give feedback.
All reactions