Add hacluster integration #308

Cynerva · 2023-10-31T16:51:54Z

Add hacluster integration to the ops version of kubernetes-control-plane. WIP.

This adds the ha relation endpoint and two config options: ha-cluster-vip and ha-cluster-dns. The implementation in this PR is able to register VIPs or DNS records with hacluster. If used, then those VIPs/hostnames will be used for Kubernetes API endpoints in kubeconfigs used by kubelet, kube-proxy, and end-users.

However, this work so far is missing any sort of failover mechanism. If kube-apiserver goes down on the unit that is holding the VIP, it will continue to hold that VIP. In the reactive charm, failover was handled by the charm during update-status hooks, where it would check the status of control-plane systemd services and update Pacemaker node status accordingly. See here. This is obviously not ideal since it can mean failovers take up to 5 minutes with default Juju configuration, and could stop occurring entirely if the charm is in a bad state.

Prior to that, we used to register the systemd services to hacluster. However, that was removed because of a long history of bugs involving Pacemaker taking control of the systemd services and failing to run them when it should. See here.

At this point, I see three potential ways to resolve this:

Handle failover during the update-status hook the same way the reactive charm does
Investigate hacluster/pacemaker configuration to see if there is a way to have it monitor the Kubernetes API without taking control of the systemd service
Deprecate hacluster support in kubernetes-control-plane entirely, and instead require kubeapi-load-balancer + hacluster to used for any solutions requiring HA

I suspect it may also be worth talking to the OpenStack team to figure out what the trajectory of the hacluster charm is. Will it be getting an ops uplift?

Cynerva · 2023-11-17T21:11:24Z

Fixes https://bugs.launchpad.net/bugs/2043695

I added basic VIP failover during the update-status hook, similar to how the reactive charm does it. This should be ready to go.

requirements.txt

src/charm.py

Co-authored-by: Adam Dyess <[email protected]>

addyess

LGTM

Cynerva force-pushed the gkk/hacluster branch 2 times, most recently from 8d471e3 to 8a6f3a2 Compare November 17, 2023 19:27

Add hacluster integration

acecc84

Cynerva force-pushed the gkk/hacluster branch from 8a6f3a2 to acecc84 Compare November 17, 2023 20:13

Cynerva marked this pull request as ready for review November 17, 2023 21:08

addyess reviewed Nov 18, 2023

View reviewed changes

requirements.txt Outdated Show resolved Hide resolved

addyess reviewed Nov 18, 2023

View reviewed changes

src/charm.py Outdated Show resolved Hide resolved

George Kraft and others added 2 commits November 21, 2023 11:56

Use interface-hacluster from fork

c4e31ee

Co-authored-by: Adam Dyess <[email protected]>

Ensure config_addrs values are never falsey

0f2b73f

Co-authored-by: Adam Dyess <[email protected]>

addyess approved these changes Nov 29, 2023

View reviewed changes

addyess merged commit d167ccf into ops Nov 29, 2023
7 checks passed

addyess deleted the gkk/hacluster branch November 29, 2023 14:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add hacluster integration #308

Add hacluster integration #308

Cynerva commented Oct 31, 2023

Cynerva commented Nov 17, 2023

addyess left a comment

Add hacluster integration #308

Add hacluster integration #308

Conversation

Cynerva commented Oct 31, 2023

Cynerva commented Nov 17, 2023

addyess left a comment

Choose a reason for hiding this comment