-
Notifications
You must be signed in to change notification settings - Fork 177
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix accidental ignition and stuck graceful termination #2212
Conversation
@tnozicka: GitHub didn't allow me to request PR reviews from the following users: tnozicka. Note that only scylladb members and repo collaborators can review this PR, and authors cannot review their own PRs. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
d9caa3b
to
995e3a5
Compare
(the e2e test on it's own failed as expected - the run was https://prow.scylla-operator.scylladb.com/view/gs/scylla-operator-prow/pr-logs/pull/scylladb_scylla-operator/2212/pull-scylla-operator-master-e2e-gke-serial/1862139318282227712#1:test-build-log.txt%3A994)
|
995e3a5
to
b51b4b3
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
one typo
lgtm
/assign zimnx
o.HaveKeyWithValue(naming.ScyllaContainerName, false), | ||
o.HaveKeyWithValue(naming.ScyllaDBIgnitionContainerName, false), | ||
), | ||
fmt.Sprintf("container in Pod %q don't match the expected state", pod.Name), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit
fmt.Sprintf("container in Pod %q don't match the expected state", pod.Name), | |
fmt.Sprintf("containers in Pod %q don't match the expected state", pod.Name), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok, I suppose container(s)
as it's 1-N
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
either works
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
now that we actually wait for ignition - the tuning took over 9 minutes which made the job timeout |
blocked on #2216 |
d98bd00
to
15d00e2
Compare
911d5b0
to
3533f6f
Compare
a491ba3
to
4b18dbf
Compare
2c8f717
to
baa2b8a
Compare
#2246 landed |
2d63a42
to
42ca696
Compare
42ca696
to
c98af31
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One comment, otherwise lgtm. Please fix PR description, one sentence contains duplicated parts.
c98af31
to
978875b
Compare
I fixed the typos and added more context, but one fix is for graceful termination before ignition and the other fix is after ignition. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm
/assign rzetelskik
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: rzetelskik, tnozicka, zimnx The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
one comment/question, rest lgtm
/lgtm |
bug #2249 |
Description of your changes:
This PR fixes the ignition touching the file before it's actually ignited and adds e2e coverage to prevent it from happening again. (Looks like it got broken in the early iterations.)
It also fixes stuck graceful termination before ignition is done because fixing one surfaces the other is broken and they can't be split without disabling some tests. (For this reason the ignition file is changed twice - best viewed as an overall diff then "by commit"). It also fixes graceful termination after ignition that was broken because of the manager agent container wasn't using
exec
. (@rzetelskik found that previously and it also failed in the test I wrote.)Which issue is resolved by this Pull Request:
Resolves #2213