This repository has been archived by the owner on Jul 22, 2024. It is now read-only.

draining stuck on hard instance failure #8

Open

svenwltr opened this issue Sep 24, 2018 · 0 comments

Labels

bug

Member

svenwltr commented Sep 24, 2018 •

edited

Loading

The node-drainer gets stuck on a hard instance failure. In this case:

the node gets NotReady in Kubernetes
the pods on this node get either Unknown or NodeLost
the instance is in Terminating:Wait state

It gets stuck, because:

The node gets not removed from Kubernetes, because it is still in the ASG.
The instance does not get removed from the ASG, because it waits for the draining.
The draining cannot happen, because the node is actually gone.

Also the node is nor reachable via SSH nor ping.

The text was updated successfully, but these errors were encountered:

svenwltr added the bug label

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.