Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fault Tolerance #1

Open
fauh45 opened this issue Mar 19, 2024 · 0 comments
Open

Fault Tolerance #1

fauh45 opened this issue Mar 19, 2024 · 0 comments

Comments

@fauh45
Copy link

fauh45 commented Mar 19, 2024

I know that the code doesn't mentioned, or not really design for fault tolerance. Though I think there's a problem if it's going to be used on a cluster where fault tolerance might be a requirement.

If there's a call as such,
node 1 (count 1) --> check for count --> not equal --> exit
node 2 (count 2) --> check for count --> not equal --> exit
node 3 (count 3) --> (crash)

Note: time denoted by the amount of space in between, with going right means time are advancing.

In this instance, there's no node that will call the callback, thus rendering the process on a stuck state.

I'm still thinking of a solution for this problem, maybe do you have any idea how to make it more fault tolerant?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant