-
Notifications
You must be signed in to change notification settings - Fork 283
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Faile to run ``gpustat --debug'': pynvml.NVMLError_LibraryNotFound: NVML Shared Library Not Found #90
Comments
What's the output of |
I don't install any nvida relevant drivers/tools/utlities on the machine, so the nvidia-smi command is not available currently. |
Unfortunately this tool (as well as any other GPU management tools) depends on the Nvidia driver and toolkit, you should at least install the Nvidia-driver to get things work.
… On Aug 25, 2020, at 6:05 PM, hongyi-zhao ***@***.***> wrote:
I don't install any nvida relevant drivers/tools/utlities on the machine, so the nvidia-smi command is not available currently.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub <#90 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABCYKDFIACPDCSDH6GG6PBDSCOEGXANCNFSM4QKHLAYQ>.
|
Thanks a lot for your explanations. I'll try and feedback if necessary. |
|
The problem has been solved. The reason is that I don't have a correct installation of cuda/nvidia-driver. Now, it works smoothly. See the following for details:
|
@radhikasethi2011 Your problem seems like windows compatibility issue from PyNVML. I don't have any windows GPU server by my side, so I'm afraid I cannot fix it my self. But can you take a look at this link (https://forum.faceswap.dev/viewtopic.php?t=14)? @wookayin How do you think adding a windows support section in the documentation. |
It is indeed a good datapoint where nvidia-smi works but PyNVML cannot load the shared library (first time seeing this) in Windows (@radhikasethi2011's case). On Ubuntu it was probably fine (@hongyi-zhao's case). Not sure why, but the link you posted says:
So we should provide an instruction saying that the drivers should be obtained from the Nvidia website. @radhikasethi2011, Can you confirm this is the case for yours and whether this solves your issue? I will add some notes in the README, and more informative error messages (which should be shipped from the next release though). |
In another issue #86, @eusoubrasileiro used a workaround of copying |
@Stonesjtu @wookayin updated my nvidia driver but nothing changed. Will uninstall and install again from the nvidia website and update here soon. |
Did you mean you updated your driver through windows installer? |
@wookayin no, through the nvidia website. Will try the workaround |
This was my solution hope it helps someone: pynvml ask for nvml.dll on "C:\Program Files\NVIDIA Corporation\NVSMI" and "C:\Windows\System32", but the new installer puts the file in "C:\Windows\System32\DriverStore\FileRepository\nv_dispi.inf_amd64_aXXXXXXXXXXXXXX", just copy the dll from "FileRepostory" to the "Program Files" location. If there is no "NVSMI" folder inside "C:\Program Files\NVIDIA Corporation" make one and just put the dll inside The nvml.dll on system32 is 596kb, the file inside "FileRepostory" is 1051kb, if there is a nvml.dll inside "Program Files" but is the 596kb version, just replace it for the 1051kb one. Make sure right click and copy the file and not just hold and move, it will take the original file from "File Repository" and you will not have privileges to copy back or undo the file move. |
Thanks a ton, I was running into this issue earlier while working with some Pytorch/fastai models. Now it seems good. Thanks again. |
This worked perfectly, thank you ! |
This works for me with a slight change: The location of nvml.dll is now in C:\Windows\System32\DriverStore\FileRepository\nvrzui.inf_amd64_8df10ddaac270452 |
You can solve this issue as belows:
|
Let me close this issue now, now that we have v1.0 released. I believe the new version of pynvml should have no problem, but if anyone runs into a similar issue on Windows, please create a new issue. Thanks. |
Hi,
On Ubuntu 20.04 with Python 3.8.3, I failed to run
gpustat --debug
, as shown below:The text was updated successfully, but these errors were encountered: