volume leak during deployment rollout

The controller is designed such that it collects information about volumes from nodes as the nodes register themselves. This implies that the controller cannot know about existing volumes for nodes that haven't registered (yet).

This leads to the following problem:
- DeleteVolume is called for an existing volume that the controller doesn't know about at the moment.
- The controller cannot distinguish between "volume already deleted" (idempotency!) and "need to wait for some node with that volume".
- It assumes that the volume is gone and returns success without doing anything, after a misleading log message about "Volume pvc-bd-adc62b1395a868c243a74ee138e313a19c72211c5fbc0d5f2706e486 not created by this controller".
- external-provisioner removes PV

=> volume leak

This problem was triggered by the new version skew tests which restart the driver while volumes exist, then does some operations (including removal) with them right after the driver deployment comes up again.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

volume leak during deployment rollout #733

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

volume leak during deployment rollout #733

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions