Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

hitting the integer limit for MPI-I/O on fat and dense GPU machines #9

Open
kostrzewa opened this issue May 26, 2023 · 0 comments
Open

Comments

@kostrzewa
Copy link
Member

kostrzewa commented May 26, 2023

On AMD MI250X (64 GB of GPU memory per GCD), the MG for an 80c160 lattice comfortably fits on two nodes with four GPUs each (eight GCDs each). This means that the 44 GB gauge configuration has to be read by 16 MPI tasks. This means that every MPI task reads around 2.7 GB (2949120086 bytes), which exceeds the silly 2^31 = 2147483648 byte limit of MPI I/O.

This means that we need to split reads and writes exceeding this limit into multiple I/O operations.

@kostrzewa kostrzewa mentioned this issue May 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant