Utilizing Unified Memory for Oceananigans simulations #3592
Replies: 1 comment 12 replies
-
That is really interesting!
What specifically does that line do? Is there documentation for
CUDA is specific to NVidia hardware. For Apple Silicon, we can use It would be interesting to test the hydrostatic model in a configuration that doesn't have FFTs and to compare performance between unified memory and explicit parallelization. @simone-silvestri might be interested in that. |
Beta Was this translation helpful? Give feedback.
-
Would it be helpful/appropriate to add some information about Unified Memory utilization in the Simulation Tips/GPU section? If so, I'd be happy to draft a short new bullet point for the "decrease memory use of your runs" section of the documentation.
I've recently run Oceananigans non-hydrostatic (3D isotropic) simulations utilizing Unified Memory on new NVIDIA hardware (GH200; Grace Hopper Superchips). Unified Memory allows bigger simulations on a single GPU (>1 billion grid points in this case), compared to using the GPUs Device Memory (which is the on-GPU RAM we typically use), but it comes at the cost of a slower simulation due to interconnect bandwidth. Some preliminary simulations on the GH200 (see below) are 2-3x larger but are up to 3x slower than expected by extrapolating Device Memory scalings.
As background:
{width=50}
Beta Was this translation helpful? Give feedback.
All reactions