You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It may be the case that CUDA-Aware MPI stuff is initialized during MPI_Init, so cudaSetDevice must be called before that. Furthermore, cudaSetDevice may need to be called before MPI_Send.
To this end, we can probably only use this with one GPU per rank.