Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Egress: There is downtime on startup new NAT clients #287

Closed
4 tasks
terassyi opened this issue May 24, 2024 · 0 comments · Fixed by #288
Closed
4 tasks

Egress: There is downtime on startup new NAT clients #287

terassyi opened this issue May 24, 2024 · 0 comments · Fixed by #288
Assignees

Comments

@terassyi
Copy link
Contributor

What

When we create a new NAT client, there is slight downtime because of the time lag in configuring a link and a route between the client and the NAT pods.

Downtime will be caused by following flow.

  1. create a new NAT client pod
  2. CNI add is called and coild on the scheduled node configure the pod’s address and routes.
  3. pod_watcher on NAT pod also configure a FoU link for new client but not configure the route.
  4. finish the configuration on client side and start to send the traffic.
  5. Traffic will be dropped because NAT pod doesn’t finish configuring yet!
  6. NAT pod also finish configuring the route to client.

How

To minimize downtime, we change to set FoU link up just before adding a route.

Checklist

  • Finish implementation of the issue
  • Test all functions
  • Have enough logs to trace activities
  • Notify developers of necessary actions
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant