Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

instructions to install for older compute architectures #4

Open
grlee77 opened this issue Jan 15, 2021 · 0 comments · May be fixed by #5
Open

instructions to install for older compute architectures #4

grlee77 opened this issue Jan 15, 2021 · 0 comments · May be fixed by #5

Comments

@grlee77
Copy link

grlee77 commented Jan 15, 2021

When trying to build this package for an NVIDIA GTX 1080 Ti, compilation completed, but the kernels failed to execute at run time.

I tracked this down to the following line which causes the kernels only to be built for Compute Capabilities 7.0 and 8.0.

if(NOT CUDA_COMPUTE_CAPABILITY)
set(CUDA_COMPUTE_CAPABILITY 70 80)
endif()

Building for a specific compute capability can be achieved by adding the CUDA_COMPUTE_CAPABILITY definition to the build_slines.sh script. For example, on my system I want compute capability 6.1, so I added -DCUDA_COMPUTE_CAPABILITY=61

# configure
cmake -DCMAKE_INSTALL_PREFIX=${install_dir} \
      -DCMAKE_BUILD_TYPE=Release \
      -DCMAKE_C_COMPILER=gcc \
      -DCMAKE_CXX_COMPILER=g++ \
      -DPYTHON_EXECUTABLE=$(which python) \
      -DCUDA_COMPUTE_CAPABILITY=61 \
      ..

Recent CMAKE also seems to have a CMAKE_CUDA_ARCHITECTURES property, but using that one did not seem to work: the CUDA_COMPUTE_CAPABILITY defaults ended up getting used instead.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant