Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow configuration MaxGracefulTerminationSec flag on ClusterAutoscaler #4697

Merged

Conversation

prashanth26
Copy link

@prashanth26 prashanth26 commented Sep 21, 2021

How to categorize this PR?

/area auto-scaling
/kind enhancement

What this PR does / why we need it:
During scale down of nodes in the cluster, the cluster autoscaler waits for a maximum of 10 minutes for any pod(s) to wait for graceful termination. However, this is a configurable flag at the cluster autoscaler to let the shoot owners decide the maximum time to wait while draining the node.

Which issue(s) this PR fixes:
Fixes #4695

Special notes for your reviewer:
The eventual plan would be to delegate the task of draining completely to the cluster autoscaler. In the meanwhile, this enhancement would give some breathing space for end-users to have a handle.

Release note:

Allows configuration of `MaxGracefulTerminationSeconds` flag on ClusterAutoscaler. This allows end-users to configure maximum graceful termination (drain) seconds beyond which the node is force deleted during scale-down of cluster nodes. The default value is 600 seconds. 

@prashanth26 prashanth26 requested a review from a team as a code owner September 21, 2021 09:40
@gardener-robot gardener-robot added kind/api-change API change with impact on API users needs/second-opinion area/auto-scaling Auto-scaling (CA/HPA/VPA/HVPA, predominantly control plane, but also otherwise) related kind/enhancement Enhancement, improvement, extension size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Sep 21, 2021
pkg/apis/core/types_shoot.go Outdated Show resolved Hide resolved
example/90-shoot.yaml Outdated Show resolved Hide resolved
pkg/apis/core/v1alpha1/types_shoot.go Outdated Show resolved Hide resolved
pkg/apis/core/types_shoot.go Outdated Show resolved Hide resolved
pkg/apis/core/v1alpha1/types_shoot.go Outdated Show resolved Hide resolved
pkg/apis/core/v1beta1/types_shoot.go Outdated Show resolved Hide resolved
rfranzke
rfranzke previously approved these changes Sep 27, 2021
Copy link
Member

@rfranzke rfranzke left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm
@prashanth26 do you plan to tackle @ialidzhikov's suggestion?

@prashanth26
Copy link
Author

/lgtm
@prashanth26 do you plan to tackle @ialidzhikov's suggestion?

Apologies. I somehow forgot about the PR. Took in suggested changes.
/squash

@rfranzke
Copy link
Member

@prashanth26 Can you check why the failing verify step is a flake or related to this PR?

@prashanth26
Copy link
Author

@prashanth26 Can you check why the failing verify step is a flake or related to this PR?

Will check it out.

@prashanth26
Copy link
Author

@prashanth26 Can you check why the failing verify step is a flake or related to this PR?

One of the test variables wasn't renamed. I made the change and now the pipeline passes. PTAL. Also squashed all the changes.

Copy link
Member

@rfranzke rfranzke left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@rfranzke rfranzke merged commit 095a07f into gardener:master Sep 28, 2021
Kristian-ZH pushed a commit to Kristian-ZH/gardener that referenced this pull request Sep 29, 2021
krgostev pushed a commit to krgostev/gardener that referenced this pull request Apr 21, 2022
krgostev pushed a commit to krgostev/gardener that referenced this pull request Jul 5, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/auto-scaling Auto-scaling (CA/HPA/VPA/HVPA, predominantly control plane, but also otherwise) related kind/api-change API change with impact on API users kind/enhancement Enhancement, improvement, extension size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Keep CA max-graceful-termination-sec configurable
7 participants