-
Notifications
You must be signed in to change notification settings - Fork 500
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RTX 30x0 Support #10
Comments
Are you able to use RTX 30x0 when not using the docker image? |
Tf 1.x only support up to Cuda 10.x |
@reflare Can you try changing this code: https://github.com/NVlabs/stylegan2-ada/blob/main/dnnlib/tflib/custom_ops.py#L57:
to:
I don't have an Ampere card in my machine to test this but it's worth seeing what happens with it. Problems might still remain, as TensorFlow (from the old container) would still be compiled against an old architecture and might be slower than what it could be. |
With the current unstable drivers, I am hesitant to roll them out onto my main machine.
Nvidia/tensorflow (as opposed to tensorflow/tensorflow) supports CUDA 11.1 for TF 1.15. |
It got surprisingly far on CUDA 11.1 but unfortunately errored out with
I am monitoring memory use on both GPUs and limiting the run to a single GPU just to make sure it's not an issue with parallelism. Memory is not near being exhausted so it appears that the CuSolver cannot be created due to the mismatch in versions. I have run a second test on CUDA 11.0 and it errors out with the same message. However on CUDA 11.0 it also shows the following:
|
Hey @reflare I had that error too, it has to do with the latest bugfix for the NaNs in tensorboard commited by @tkarras ps: I would be very interested in hearing how you got this working for Cuda 11.* since I'm also planning on ordering a few RTX 3090 cards myself.. |
@tr1pzz Thank you for the input. That said, I didn't get around to testing it because: Nvidia just released a new NGC image with native support for Tensorflow 1.15 and CUDA 11.1. |
This works out of the box on newer NVIDIA GPUs such as RTX 3090. Note that an NVIDIA driver release r455.23 or later is required to run this image. See README for instructions on how to build for older drivers.
@reflare Thanks for the bug report and your comments on trying out I pushed a change that defaults to this base image. This requires a pretty new driver (r455.23) to run so I also added a README comment on how to revert back to the old base image. |
Hi, I have finished to run |
I got stylegan2-ada working without docker by bumping tensorflow to version 2 in compatibility mode. It’s quite trivial. nvlabs should do this or we pray they release a tensorflow1.16 with cuda 11 support / they probably won’t. You’ll need cuda toolkit 11.2 / driver 460 to get it working. You can use timeshift to save a snapshot of your working config. for training mode check the digressions branch / it has one tiny fix. |
…NVlabs#10)" This reverts commit 8c3fd8b.
So 2022 ... I got 3090 and can't do stylegan 2 ? (sad face here...) Because cuda/pytorch 10 will not play well with my hardware... |
This repository (stylegan2-ada) is in fact the TensorFlow version of StyleGAN2 ADA which is completely unsupported and unmaintained. You should be able to get the pytorch version (stylegan2-ada-pytorch or even more recently, stylegan3) working on 3090. I don't see why something like pytorch 1.11 with CUDA 11.3 wouldn't work on 3090. |
I have just given up on using this tensorflow 1.15 based model on RTX 3090. I had great success on an RTX 2080 Ti but have not been able to successfully train with RTX 3090. While still unworkable, I came closest to success upgrading the base image to
There are several github issues on stylegan2-ada-pytorch that indicate that the model is not compatible with pytorch 1.11. The code tree does have some explicit commits to support pytorch 1.9. It is unclear whether pytorch 1.10 will play well. The official base image for the repo is still using pytorch 1.8. I am about to switch over and see how successful stylegan2-ada-pytorch is with RTX 3090. |
Your current docker image relies on an older version of CUDA.
The current 3080 and 3090 series GPUs are only supported under CUDA 11.1.
It would be wonderful if you could update the image or offer a workaround.
Error message when running with older CUDA:
nvcc fatal : Value 'sm_86' is not defined for option 'gpu-architecture'
The text was updated successfully, but these errors were encountered: