Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

If primary is down abnormally, worker should exit with error #51

Open
larroy opened this issue Feb 2, 2021 · 4 comments
Open

If primary is down abnormally, worker should exit with error #51

larroy opened this issue Feb 2, 2021 · 4 comments

Comments

@larroy
Copy link
Contributor

larroy commented Feb 2, 2021

Worker doesn't exit with error when the primary is down abnormally as the StatusMessage is not checked. Would it be possible to exit workers with error when primary is down abnormally?

See flow here:

https://github.com/aws/sagemaker-spark-container/blob/master/src/smspark/job.py#L185

@apacker
Copy link
Contributor

apacker commented Feb 3, 2021

What impact does the worker not exiting with an error have? Presumably if the primary goes down prematurely it will exit with error and cause the job to fail. Is that not the case?

@larroy
Copy link
Contributor Author

larroy commented Feb 12, 2021

@xgchena

@larroy
Copy link
Contributor Author

larroy commented Feb 12, 2021

I think it results in false success messages in the worker algos which can cause confusion. I think for the primary yes it will exit with an error as you say. I think we should fail also the worker containers though.

@apacker
Copy link
Contributor

apacker commented Mar 4, 2021

Understood, and agreed this causes confusion. We'll have to update our shutdown logic to ensure the driver node communicates to the workers to exit successfully.

Thanks for the feedback, we're working this into our roadmap internally.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants