-
Notifications
You must be signed in to change notification settings - Fork 345
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CDAP-21091: Increasing timeout for task workers to recover when app fabric restarts #15855
base: develop
Are you sure you want to change the base?
Conversation
private static final int NUMBER_OF_RETRIES = 20; | ||
private static final int MIN_WAIT_TIME_MILLISECOND = 2000; | ||
private static final int MAX_WAIT_TIME_MILLISECOND = 60000; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we please move these to cconf
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's move only the NUMBER_OF_RETRIES
to cconf.
That's the value we would need to modify mainly to configure the retry behavior.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we should keep min / max wait time the same and make NUMBER_OF_RETRIES
configurable in cconf.
If later we figure out that the number of retries are insufficient, we won't have to make data plane changes for it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please add JIRA title in PR description
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
PR description should be of the form: data-integrations/google-cloud#1473 (comment)
|
Increasing timeout for task workers to recover when app fabric restarts
Jira: CDAP-21091
Description
This change lets the task workers restart gracefully when app fabric pod is restarted.
Code change
ComputeEngineCredentials.java