instructions to install for older compute architectures #4

grlee77 · 2021-01-15T11:26:28Z

When trying to build this package for an NVIDIA GTX 1080 Ti, compilation completed, but the kernels failed to execute at run time.

I tracked this down to the following line which causes the kernels only to be built for Compute Capabilities 7.0 and 8.0.

GPUStreamlines/CMakeLists.txt

Lines 32 to 34 in b794717

    
           if(NOT CUDA_COMPUTE_CAPABILITY) 
        
             set(CUDA_COMPUTE_CAPABILITY 70 80) 
        
           endif()

Building for a specific compute capability can be achieved by adding the CUDA_COMPUTE_CAPABILITY definition to the build_slines.sh script. For example, on my system I want compute capability 6.1, so I added -DCUDA_COMPUTE_CAPABILITY=61

# configure
cmake -DCMAKE_INSTALL_PREFIX=${install_dir} \
      -DCMAKE_BUILD_TYPE=Release \
      -DCMAKE_C_COMPILER=gcc \
      -DCMAKE_CXX_COMPILER=g++ \
      -DPYTHON_EXECUTABLE=$(which python) \
      -DCUDA_COMPUTE_CAPABILITY=61 \
      ..

Recent CMAKE also seems to have a CMAKE_CUDA_ARCHITECTURES property, but using that one did not seem to work: the CUDA_COMPUTE_CAPABILITY defaults ended up getting used instead.

The text was updated successfully, but these errors were encountered:

closes #4

grlee77 added a commit that referenced this issue Jan 15, 2021

README.md: install with user-specified compute capability

df169e4

closes #4

grlee77 linked a pull request Jan 15, 2021 that will close this issue

DOC: install with user-specified compute capability #5

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

instructions to install for older compute architectures #4

instructions to install for older compute architectures #4

grlee77 commented Jan 15, 2021

instructions to install for older compute architectures #4

instructions to install for older compute architectures #4

Comments

grlee77 commented Jan 15, 2021