Add managed memory #211

csbnw · 2023-09-19T14:07:24Z

Description

There are many ways to mix and match different types of CUDA memory. As preparation for this PR, an (performance) evaluation of different strategies was conducted, the findings are summarized below:

The use of cu::HostMemory and cu::DeviceMemory with explicit memory copies provides the best performance, but is also the most verbose.
Mapped memory (already supported in cudawrappers) allows addressing host memory on the GPU, which allows for simple code, but it performs rather poorly.
Managed memory (also known as unified memory), provides better performance and given the user the option to control data movement by prefetching to either host or device memory. When like option 1, performance is only slightly lower than.
With the changes in this MR, support for option 3 is added.

To be specific, the cu::DeviceMemory constructor can now also be used to allocate managed memory by passing CU_MEMORYTYPE_UNIFIED as CUmemorytype argument and optionally also some flags. This change is transparent to pre-existing code by having CU_MEMORYTYPE_DEVICE as the default CUmemorytype and default flags = 0.
Additionally, cu::Stream::memPrefetchAsync is added to expose the cuMemPrefetchAsync function.

The new functionality is tested in new sections of the test_vector_add test.

Related issues:

Add support for registered and managed memory #173

Instructions to review the pull request

Check that CHANGELOG.md has been updated if necessary

john-romein · 2023-09-19T15:45:16Z

A few minor things:

in the constructor, the assignment to manager can be done outside (after) the if-then block
there should be a checkCudaCall() around the call to cuMemPrefetchAsync()

csbnw · 2023-09-20T07:10:47Z

@john-romein,
I applied your suggestions 👍

include/cudawrappers/cu.hpp

tests/test_vector_add.cpp

include/cudawrappers/cu.hpp

matmanc

Looks really nice :)

csbnw added 5 commits September 19, 2023 14:41

Add option to allocate managed memory in DeviceMemory

a8c34d0

Update test_vector_add

d9cd6fb

Add vector_add test with managed memory

78d3a01

Add Stream::memPrefetchAsync and test

24ec11e

Update changelog

e3ce071

csbnw requested review from matmanc and john-romein September 19, 2023 14:07

csbnw self-assigned this Sep 19, 2023

csbnw added 2 commits September 20, 2023 09:09

Add missing checkCudaCall

b630efd

Move manager assignment outside of if-else block

32d06e6

matmanc reviewed Sep 20, 2023

View reviewed changes

include/cudawrappers/cu.hpp Show resolved Hide resolved

csbnw commented Sep 20, 2023

View reviewed changes

tests/test_vector_add.cpp Outdated Show resolved Hide resolved

csbnw commented Sep 20, 2023

View reviewed changes

include/cudawrappers/cu.hpp Outdated Show resolved Hide resolved

csbnw added 6 commits September 20, 2023 15:26

Add tests for invalid arguments

6d18c82

Swap arguments for Stream::memPrefetchAsync

e884449

Make CU_DEVICE_CPU the default option for memPrefetchAsync

43128db

Add operator T *() to DeviceMemory

ac512be

Add safeguard to operator

7435205

Add tests for cu::DeviceMemory operator T *()

d86bb12

matmanc approved these changes Sep 21, 2023

View reviewed changes

csbnw merged commit 2a12b83 into main Sep 21, 2023

csbnw deleted the add-managed-memory branch September 21, 2023 11:04

csbnw mentioned this pull request Sep 21, 2023

Add support for registered and managed memory #173

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add managed memory #211

Add managed memory #211

csbnw commented Sep 19, 2023

john-romein commented Sep 19, 2023

csbnw commented Sep 20, 2023

matmanc left a comment

Add managed memory #211

Add managed memory #211

Conversation

csbnw commented Sep 19, 2023

john-romein commented Sep 19, 2023

csbnw commented Sep 20, 2023

matmanc left a comment

Choose a reason for hiding this comment