Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Job control to continue, restart, cancel distributed jobs #47

Open
agocorona opened this issue Jan 28, 2020 · 2 comments
Open

Job control to continue, restart, cancel distributed jobs #47

agocorona opened this issue Jan 28, 2020 · 2 comments

Comments

@agocorona
Copy link
Collaborator

agocorona commented Jan 28, 2020

A jobcontrol primitive that can get Cloud computations as parameter and let the user decide what to do when an non-handled exception happens: either continue after the problem is fixed, using the log facility, restart anew or cancel the job. That control will work regardless of the node where currently it is executing.

For example A program would invoke some distributed facility that is not running/not installed. The user can retry and if it continue failing he can check if it is down, has a failure or is not installed. He can stop the computation, install it and resume execution without re-execting possibly heavy tasks already done at that point.

Messages to the user will appear in the console of the node that initiated the cloud computation and will be managed with console primitives like option and input

This comes from cloudshell

@agocorona
Copy link
Collaborator Author

Also, add an option to see online the execution log of each job

@agocorona
Copy link
Collaborator Author

The motivation of this is because although there is job control using services (see the executor service) it is not able by design to execute a sequence of distributed computations and make them optionally restart/continue on failure. For some heavy processeses it is good to log/cache results and avoid to re-execute what was already done.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant