Allow configuration MaxGracefulTerminationSec flag on ClusterAutoscaler #4697

prashanth26 · 2021-09-21T09:40:00Z

How to categorize this PR?

/area auto-scaling
/kind enhancement

What this PR does / why we need it:
During scale down of nodes in the cluster, the cluster autoscaler waits for a maximum of 10 minutes for any pod(s) to wait for graceful termination. However, this is a configurable flag at the cluster autoscaler to let the shoot owners decide the maximum time to wait while draining the node.

Which issue(s) this PR fixes:
Fixes #4695

Special notes for your reviewer:
The eventual plan would be to delegate the task of draining completely to the cluster autoscaler. In the meanwhile, this enhancement would give some breathing space for end-users to have a handle.

Release note:

Allows configuration of `MaxGracefulTerminationSeconds` flag on ClusterAutoscaler. This allows end-users to configure maximum graceful termination (drain) seconds beyond which the node is force deleted during scale-down of cluster nodes. The default value is 600 seconds.

pkg/apis/core/types_shoot.go

example/90-shoot.yaml

pkg/apis/core/v1alpha1/types_shoot.go

pkg/apis/core/types_shoot.go

pkg/apis/core/v1alpha1/types_shoot.go

pkg/apis/core/v1beta1/types_shoot.go

pkg/apis/core/validation/shoot_test.go

rfranzke

/lgtm
@prashanth26 do you plan to tackle @ialidzhikov's suggestion?

prashanth26 · 2021-09-27T07:13:26Z

/lgtm
@prashanth26 do you plan to tackle @ialidzhikov's suggestion?

Apologies. I somehow forgot about the PR. Took in suggested changes.
/squash

rfranzke · 2021-09-27T08:43:02Z

@prashanth26 Can you check why the failing verify step is a flake or related to this PR?

prashanth26 · 2021-09-27T10:07:45Z

@prashanth26 Can you check why the failing verify step is a flake or related to this PR?

Will check it out.

Co-authored-by: Ismail Alidzhikov <[email protected]>

prashanth26 · 2021-09-28T05:14:26Z

@prashanth26 Can you check why the failing verify step is a flake or related to this PR?

One of the test variables wasn't renamed. I made the change and now the pipeline passes. PTAL. Also squashed all the changes.

rfranzke

/lgtm

…ardener#4697) Co-authored-by: Ismail Alidzhikov <[email protected]> Co-authored-by: Ismail Alidzhikov <[email protected]>

prashanth26 requested a review from a team as a code owner September 21, 2021 09:40

gardener-robot-ci-2 added reviewed/ok-to-test and removed reviewed/ok-to-test labels Sep 21, 2021

rfranzke requested changes Sep 22, 2021

View reviewed changes

gardener-robot added the needs/changes label Sep 22, 2021

gardener-robot-ci-1 added reviewed/ok-to-test and removed reviewed/ok-to-test labels Sep 22, 2021

ialidzhikov reviewed Sep 23, 2021

View reviewed changes

pkg/apis/core/validation/shoot_test.go Outdated Show resolved Hide resolved

rfranzke previously approved these changes Sep 27, 2021

View reviewed changes

gardener-robot added reviewed/lgtm and removed needs/changes labels Sep 27, 2021

prashanth26 dismissed rfranzke’s stale review via af0bf15 September 27, 2021 07:11

gardener-robot added needs/changes and removed needs/review labels Sep 27, 2021

gardener-robot-ci-3 added reviewed/ok-to-test and removed reviewed/ok-to-test labels Sep 27, 2021

gardener-robot added the merge/squash label Sep 27, 2021

Allow configuration MaxGracefulTerminationSec on ClusterAutoscaler

e9bf065

Co-authored-by: Ismail Alidzhikov <[email protected]>

prashanth26 force-pushed the update-max-terminating-seconds branch from af0bf15 to e9bf065 Compare September 28, 2021 04:50

gardener-robot-ci-1 added the reviewed/ok-to-test label Sep 28, 2021

gardener-robot-ci-2 removed the reviewed/ok-to-test label Sep 28, 2021

rfranzke approved these changes Sep 28, 2021

View reviewed changes

gardener-robot added reviewed/lgtm and removed needs/changes labels Sep 28, 2021

rfranzke merged commit 095a07f into gardener:master Sep 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow configuration MaxGracefulTerminationSec flag on ClusterAutoscaler #4697

Allow configuration MaxGracefulTerminationSec flag on ClusterAutoscaler #4697

prashanth26 commented Sep 21, 2021 •

edited

Loading

rfranzke left a comment

prashanth26 commented Sep 27, 2021

rfranzke commented Sep 27, 2021

prashanth26 commented Sep 27, 2021

prashanth26 commented Sep 28, 2021

rfranzke left a comment

Allow configuration MaxGracefulTerminationSec flag on ClusterAutoscaler #4697

Allow configuration MaxGracefulTerminationSec flag on ClusterAutoscaler #4697

Conversation

prashanth26 commented Sep 21, 2021 • edited Loading

rfranzke left a comment

Choose a reason for hiding this comment

prashanth26 commented Sep 27, 2021

rfranzke commented Sep 27, 2021

prashanth26 commented Sep 27, 2021

prashanth26 commented Sep 28, 2021

rfranzke left a comment

Choose a reason for hiding this comment

prashanth26 commented Sep 21, 2021 •

edited

Loading