Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Better error reporting of model crashes #513

Open
aidanheerdegen opened this issue Sep 16, 2024 · 0 comments
Open

Better error reporting of model crashes #513

aidanheerdegen opened this issue Sep 16, 2024 · 0 comments
Labels

Comments

@aidanheerdegen
Copy link
Collaborator

@blimlim response to a question on the ACCESS-Hive forum

The error “payu: Model exited with error code 1; aborting” can be a difficult one to find the cause of. My understanding is that whenever any of ESM’s submodels (e.g. atmosphere, ocean, ice) crash during a simulation, that crash will be picked up by Payu, which then reports the "payu: Model exited with error code 1; aborting” error to the user. As a result there could be a large range of different causes.

got me thinking. Can payu do better with error reporting of model crashes? Can payu identify which model crashed? I guess probably only through inspection of the logs, but are there any options in mpirun that could be used to report more fine-grained error codes?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant