Compatibility with AMD GPUs #111

svandenhaute · 2023-07-11T10:05:06Z

Does this plugin work with AMD GPUs? PyTorch+ROCm is not installable via conda...

The text was updated successfully, but these errors were encountered:

peastman · 2023-07-11T16:14:02Z

It includes an OpenCL implementation which works with AMD GPUs. However, I believe the pytorch model will get executed on the CPU and only the OpenMM calculations will run on the GPU.

svandenhaute · 2023-07-11T16:15:40Z

Right, but given that the pytorch evaluation constitutes most of the calculation time (at least for relatively small systems and complex torch models), this would turn out to be rather slow, no?

EDIT: PyTorch+ROCm is easily installable via pip. Would there be an easy way to patch things together?

peastman · 2023-07-11T16:28:34Z

PyTorch+ROCm is easily installable via pip. Would there be an easy way to patch things together?

There's two parts to that question. First, could PyTorch installed with pip and OpenMM installed with conda be made to work together? Possibly. It depends on whether they were compiled in ways that make them binary compatible. If not, you could always compile OpenMM from source.

The second part is what it would take to make TorchForce work with PyTorch using HIP. Take a look at the CUDA implementation in https://github.com/openmm/openmm-torch/blob/master/platforms/cuda/src/CudaTorchKernels.cpp for a sense of what it involves. It needs to move the model to correct device, and also ensure that any tensors it creates are on the correct device.

openmm-torch/platforms/cuda/src/CudaTorchKernels.cpp

Lines 77 to 79 in b76deb4

    
           const torch::Device device(torch::kCUDA, cu.getDeviceIndex()); // This implicitly initialize PyTorch 
        
           module.to(device); 
        
           torch::TensorOptions options = torch::TensorOptions().device(device).dtype(cu.getUseDoublePrecision() ? torch::kFloat64 : torch::kFloat32);

Then there's the fact that all the data for the tensors is stored on the GPU, so all accesses to it have to happen in the correct way. You could avoid that complexity by just bringing all data back to the CPU when communicating between PyTorch and OpenMM, though that would increase overhead.

Finally there's some bookkeeping needed to keep everything working properly, such as calls to cuDevicePrimaryCtxRetain() and cuDevicePrimaryCtxRelease(). HIP is modelled after CUDA, so converting everything probably wouldn't be hard.

Of course, we couldn't distribute it through conda-forge because it doesn't support HIP. :(

raimis added the enhancement New feature or request label Sep 22, 2023

jharrymoore mentioned this issue Mar 12, 2024

Compatability with Intel Ponte Vecchio GPUs? #138

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compatibility with AMD GPUs #111

Compatibility with AMD GPUs #111

svandenhaute commented Jul 11, 2023

peastman commented Jul 11, 2023

svandenhaute commented Jul 11, 2023 •

edited

Loading

peastman commented Jul 11, 2023

Compatibility with AMD GPUs #111

Compatibility with AMD GPUs #111

Comments

svandenhaute commented Jul 11, 2023

peastman commented Jul 11, 2023

svandenhaute commented Jul 11, 2023 • edited Loading

peastman commented Jul 11, 2023

svandenhaute commented Jul 11, 2023 •

edited

Loading