Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner #505

sushrk · 2024-12-24T23:52:04Z

Description of changes:

PR 2, adding support for Centralize leaked ENI cleanup in the controller to delete leaked ENIs provisioned by the controller and VPC-CNI:

Refactor periodic cleanup routine and add support for node termination cleaner. The node termination is invoked upon a node deletion from the CNINode controller as part of the finalizer routine. The finalizer was added in PR 1 on CNINode objects.
Added metrics for leaked ENIs
Add support for DisassociateTrunkInterface, call DisassociateTrunkInterface to remove association between branch and trunk ENI before deleting the branch ENI as per recommendation by EC2 team.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…termination cleaner

pkg/aws/ec2/api/cleanup/eni_cleanup.go

pkg/aws/ec2/api/wrapper.go

pkg/aws/ec2/api/cleanup/eni_cleanup.go

pkg/provider/branch/provider.go

pkg/aws/ec2/api/wrapper.go

pkg/provider/branch/trunk/trunk.go

main.go

config/rbac/role.yaml

pkg/aws/ec2/api/cleanup/eni_cleanup.go

controllers/crds/cninode_controller.go

pkg/provider/branch/trunk/trunk.go

pkg/aws/ec2/api/wrapper.go

controllers/crds/cninode_controller.go

haouc

Added a couple of comments but they don't block this PR. Thanks.

…termination cleaner (aws#505) * Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner * defer updating metrics * WIP commit no.3 * instance id as tag, not requeuing on node cleanup failure * not requeing on node termination cleaner failure * minor log change * adding node termination failure metric * minor test change * using paginated calls to describe network interface while eni cleanup * refactor based on comments --------- Co-authored-by: Yash Thakkar <[email protected]>

* Centralized leaked ENI cleanup- CNINode CRD changes * Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner (#505) * Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner * defer updating metrics * WIP commit no.3 * instance id as tag, not requeuing on node cleanup failure * not requeing on node termination cleaner failure * minor log change * adding node termination failure metric * minor test change * using paginated calls to describe network interface while eni cleanup * refactor based on comments --------- Co-authored-by: Yash Thakkar <[email protected]> --------- Co-authored-by: sushrk <[email protected]> Co-authored-by: Sushmitha Ravikumar <[email protected]>

* Centralized leaked ENI cleanup- CNINode CRD changes * Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner (aws#505) * Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner * defer updating metrics * WIP commit no.3 * instance id as tag, not requeuing on node cleanup failure * not requeing on node termination cleaner failure * minor log change * adding node termination failure metric * minor test change * using paginated calls to describe network interface while eni cleanup * refactor based on comments --------- Co-authored-by: Yash Thakkar <[email protected]> --------- Co-authored-by: sushrk <[email protected]> Co-authored-by: Sushmitha Ravikumar <[email protected]>

* Centralized leaked ENI cleanup- CNINode CRD changes * Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner (#505) * Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner * defer updating metrics * WIP commit no.3 * instance id as tag, not requeuing on node cleanup failure * not requeing on node termination cleaner failure * minor log change * adding node termination failure metric * minor test change * using paginated calls to describe network interface while eni cleanup * refactor based on comments --------- Co-authored-by: Yash Thakkar <[email protected]> --------- Co-authored-by: sushrk <[email protected]> Co-authored-by: Sushmitha Ravikumar <[email protected]>

sushrk force-pushed the eni-cleanup branch from 7053082 to a94f4ed Compare December 24, 2024 23:56

Centralized leaked ENI cleanup- refactor periodic cleanup & add node …

ef0591a

…termination cleaner

sushrk force-pushed the eni-cleanup branch from a94f4ed to ef0591a Compare December 25, 2024 00:01

sushrk marked this pull request as ready for review December 26, 2024 22:02

sushrk requested a review from a team as a code owner December 26, 2024 22:02

sushrk requested a review from yash97 January 2, 2025 08:46

yash97 reviewed Jan 7, 2025

View reviewed changes

pkg/aws/ec2/api/cleanup/eni_cleanup.go Outdated Show resolved Hide resolved

yash97 reviewed Jan 7, 2025

View reviewed changes

pkg/aws/ec2/api/wrapper.go Show resolved Hide resolved

yash97 reviewed Jan 7, 2025

View reviewed changes

pkg/aws/ec2/api/wrapper.go Show resolved Hide resolved

yash97 reviewed Jan 8, 2025

View reviewed changes

pkg/aws/ec2/api/cleanup/eni_cleanup.go Show resolved Hide resolved

yash97 reviewed Jan 8, 2025

View reviewed changes

pkg/aws/ec2/api/cleanup/eni_cleanup.go Outdated Show resolved Hide resolved

yash97 reviewed Jan 8, 2025

View reviewed changes

pkg/aws/ec2/api/cleanup/eni_cleanup.go Outdated Show resolved Hide resolved

haouc reviewed Jan 8, 2025

View reviewed changes

pkg/provider/branch/provider.go Show resolved Hide resolved

pkg/aws/ec2/api/wrapper.go Show resolved Hide resolved

pkg/provider/branch/trunk/trunk.go Show resolved Hide resolved

main.go Show resolved Hide resolved

defer updating metrics

eaa26af

sushrk marked this pull request as draft February 27, 2025 00:33

WIP commit no.3

2ae064c

sushrk force-pushed the eni-cleanup branch from bc3dd99 to 2ae064c Compare February 27, 2025 01:06

yash97 added 5 commits March 3, 2025 17:53

instance id as tag, not requeuing on node cleanup failure

0be86cd

not requeing on node termination cleaner failure

74b43e5

minor log change

963ed53

adding node termination failure metric

4a0f135

minor test change

a80bbab

yash97 marked this pull request as ready for review March 4, 2025 20:47

yash97 requested a review from haouc March 5, 2025 18:17

yash97 mentioned this pull request Mar 6, 2025

adding eni owner tag if cluster name is present aws/amazon-vpc-cni-k8s#3228

Merged

using paginated calls to describe network interface while eni cleanup

b351fbb

haouc reviewed Mar 18, 2025

View reviewed changes

refactor based on comments

c74ed14

yash97 requested a review from haouc March 21, 2025 22:03

haouc approved these changes Mar 22, 2025

View reviewed changes

yash97 merged commit e6b6cda into aws:eni-cleanup Mar 24, 2025
4 checks passed

yash97 mentioned this pull request Mar 26, 2025

Merging Eni cleanup branch to master #535

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner #505

Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner #505

Uh oh!

sushrk commented Dec 24, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

haouc left a comment

Uh oh!

Uh oh!

Uh oh!

Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner #505

Centralized leaked ENI cleanup- refactor periodic cleanup & add node termination cleaner #505

Uh oh!

Conversation

sushrk commented Dec 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

haouc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sushrk commented Dec 24, 2024 •

edited

Loading