Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No longer able to build in Docker #1566

Closed
iarspider opened this issue Nov 11, 2019 · 7 comments
Closed

No longer able to build in Docker #1566

iarspider opened this issue Nov 11, 2019 · 7 comments

Comments

@iarspider
Copy link

Hi!

We were able to build torchvision 0.3.0 w/cuda in docker container, which doesn't have access to NVidia GPU. However when building 0.4.1 we get the following error:

AssertionError: 
Found no NVIDIA driver on your system. Please check that you
have an NVIDIA GPU and installed a driver from
http://www.nvidia.com/Download/index.aspx

Is there a way to build torchvision without "NVidia driver"?

@iarspider
Copy link
Author

This is how we build Torch:

export USE_NINJA=1
export USE_MKLDNN=0
export CUDA_HOME=/path/to/cuda
python setup.py install --prefix=/install/torch

This is how we build Torchvision:

export LD_LIBRARY_PATH=/install/torch/lib/pythonX.Y/site-packages/torch/lib
export FORCE_CUDA=1
export CUDA_HOME=/path/to/cuda
python setup.py install --prefix=/install/torchvision

@fmassa
Copy link
Member

fmassa commented Nov 13, 2019

Hi,

I don't have much experience with docker, and I'm not what has changed that could break things for you compared to 0.3.0.

Can you try building torchvision from master branch?

@iarspider
Copy link
Author

Hi,

same problem with "master" (commit 681c6c1) version of torchvision.

@ahirner
Copy link

ahirner commented Jan 5, 2020

Solved by supplying TORCH_CUDA_ARCH_LIST, otherwise cpp_extension.py tries to dynamically determine ARCH. See trace if TORCH_CUDA_ARCH_LIST is not supplied for 0.4.2 and pytorch 1.3.1.

You may want to check out our custom builds here.

@fmassa
Copy link
Member

fmassa commented Jan 26, 2020

@ahirner can you send a PR fixing the docker in torchvision?

Also, cc @seemethere for awareness

@iarspider
Copy link
Author

Worked for me, thanks @ahirner !

@ahirner
Copy link

ahirner commented Feb 29, 2020

I think the error only pertains custom builds not current wheels? We don't use them either so it would be good to repro the case with official builds and open a new issue if it is @fmassa

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants