[REQUEST] make nanoVDB CUDA async allocation optional so it can be used on vGPU #1798

w0utert · 2024-04-24T15:54:44Z

Is your feature request related to a problem? Please describe.

Current nanoVDB implementation uses functions like cudaMallocAsync and cudaMemcpyAsync, for example in CudaDeviceBuffer when allocating or uploading data to the GPU. These functions are not available when using a vGPU that does not have unified memory enabled, which is common for example for GPU-enabled Azure VM's where the GPU is shared/sliced between multiple instances. Trying to run nanoVDB code on such a VM will result in CUDA 801 'not supported' exceptions.

Describe the solution you'd like

Projects such as PyTorch usually implement the async code paths using a switch to enable/disable them, plus a fallback path that uses synchronous functions. If nanoVDB had something similar, that would be the perfect solution, save for potential efficiency disadvantages the synchronous fallback paths could have.

Describe alternatives you've considered

For my situation there is not really an alternative, as I am not in a position to change hypervisor settings to enable unified memory support or use some other deployment target for the code I want to use with nanoVDB. The only option is to switch to a VM that uses passthrough GPU instead of vGPU, but again this is not something under my control.

The text was updated successfully, but these errors were encountered:

w0utert · 2024-04-25T13:03:13Z

Some more information/corrections:

This is only about cudaMallocAsync and cudaFreeAsync, not cudaMemcpyAsync etc.
Upon closer inspection of nanovdb/util/cuda/CudaUtils.h I found out there is already a fallback path when building with CUDA versions prior to 11.2 (the version that introduced async malloc functions)

Based on this, I created PR #1799 that introduces macros CUDA_MALLOC and CUDA_FREE, and a define NANOVDB_USE_SYNC_CUDA_MALLOC that can be set by the host build system to force synchronous CUDA allocations.

This has been verified to work on the vGPU deployment target I'm using.

w0utert added the enhancement label Apr 24, 2024

Idclip added the nanovdb label May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REQUEST] make nanoVDB CUDA async allocation optional so it can be used on vGPU #1798

[REQUEST] make nanoVDB CUDA async allocation optional so it can be used on vGPU #1798

w0utert commented Apr 24, 2024 •

edited

Loading

w0utert commented Apr 25, 2024 •

edited

Loading

[REQUEST] make nanoVDB CUDA async allocation optional so it can be used on vGPU #1798

[REQUEST] make nanoVDB CUDA async allocation optional so it can be used on vGPU #1798

Comments

w0utert commented Apr 24, 2024 • edited Loading

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

w0utert commented Apr 25, 2024 • edited Loading

w0utert commented Apr 24, 2024 •

edited

Loading

w0utert commented Apr 25, 2024 •

edited

Loading