-
Notifications
You must be signed in to change notification settings - Fork 27k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Forcing Torch Version to 1.13.1 for RX 5000 series GPUs #11048
Forcing Torch Version to 1.13.1 for RX 5000 series GPUs #11048
Conversation
Why do you want to force the TORCH_COMMAND for Navi 2 cards if your issue is with Navi 1? |
Use |
Ok, i was thinking the HSA_OVERRIDE_GFX_VERSION was causing the issue, but there's must be something else going on. |
Any more feedback from AMD users? |
I'm the author of #10855. |
That's a good question. My first guess is that HSA_OVERRIDE_GFX_VERSION was causing problems, but that's also true for RX 6000, wich is working just fine. Sooo.... Who knows. We can't even be really sure it's just RX 5000, maybe there are other series wich have problems but no one has reported it yet |
RX 5700 user here. Arch Linux with ROCm 5.4.3. CPU generation worked fine but when using the GPU, generation would make 0 progress. CLI output gave nothing and the webui reported rightfully that nothing was happening. Radeontop would report Shader Interpolator pegged at 100%, Memory Clock would fluctuate between 0 and 100%, and Shader Clock would be pegged at 100%. VRAM at ~4500M with Stable Diffusion 1.5 iirc. Attempting to switch Stable Diffusion checkpoint through the webui would tick up indefinitely with an expected time of 1.5s on bottom. Changing to Python 3.10.12 and applying this patch makes everything functional. Small note: I don't have bc installed, so I had to cut out the check for whether Python 3.10 or else install it. There's probably a more portable solution. |
But...is there really no way to work around the issue other than fixing torch to 1.13.1? |
Description
As described on the discussion under #10465, this is an attempt to make work the WebUI again on older AMD cards like the RX5000 series.
The code forces TORCH_COMMAND for Navi and Renoir GPUs.
Also, the older pytorch version requires python version <= 3.10. I added some code to check it, so the user gets a cleaner error message when trying to run it on python 3.11 or later
This is still intended as a workaround, to be removed as soon as someone finds a way to run the webui properly on these cards.
Should fix #10873
Checklist: