- IPs: Creating a simple HTTP Ingress will allocate an ephemeral IP. Creating an Ingress with a TLS section will allocate a static IP.
- Latency: GLBC is not built for performance. Creating many Ingresses at a time can overwhelm it. It won't fall over, but will take its own time to churn through the Ingress queue.
- Quota: By default, GCE projects are granted a quota of 3 Backend Services. This is insufficient for most Kubernetes clusters.
- Oauth scopes: By default GKE/GCE clusters are granted "compute/rw" permissions. If you setup a cluster without these permissions, GLBC is useless and you should delete the controller as described in the section below. If you don't delete the controller it will keep restarting.
- Default backends: All L7 loadbalancers created by GLBC have a default backend. If you don't specify one in your Ingress, GLBC will assign the 404 default backend mentioned above.
- Load Balancing Algorithms: The ingress controller doesn't support fine grained control over loadbalancing algorithms yet.
- Large clusters: Ingress on GCE isn't supported on large (>1000 nodes), single-zone clusters.
- Teardown: The recommended way to tear down a cluster with active Ingresses is to either delete each Ingress, or hit the
/delete-all-and-quit
endpoint on GLBC, before invoking a cluster teardown script (eg: kube-down.sh). You will have to manually cleanup GCE resources through the cloud console or gcloud CLI if you simply tear down the cluster with active Ingresses. - Changing UIDs: You can change the UID used as a suffix for all your GCE cloud resources, but this requires you to delete existing Ingresses first.
- Cleaning up: You can delete loadbalancers that older clusters might have leaked due to premature teardown through the GCE console.
Before you can receive traffic through the GCE L7 Loadbalancer Controller you need:
- A Working Kubernetes cluster >= 1.1
- At least 1 Kubernetes NodePort Service (this is the endpoint for your Ingress)
- A single instance of the L7 Loadbalancer Controller pod, if you're running Kubernetes < 1.3 (the GCP ingress controller runs on the master in later versions)
GLBC is not aware of your GCE quota. As of this writing users get 5 GCE Backend Services by default. If you plan on creating Ingresses for multiple Kubernetes Services, remember that each one requires a backend service, and request quota. Should you fail to do so the controller will poll periodically and grab the first free backend service slot it finds. You can view your quota:
$ gcloud compute project-info describe --project myproject
See GCE documentation for how to request more.
It takes ~1m to spin up a loadbalancer (this includes acquiring the public IP), and ~5-6m before the GCE API starts healthchecking backends. So as far as latency goes, here's what to expect:
Assume one creates the following simple Ingress:
apiVersion: extensions/v1beta1
kind: Ingress
metadata:
name: test-ingress
spec:
backend:
# This will just loopback to the default backend of GLBC
serviceName: default-http-backend
servicePort: 80
- time, t=0
$ kubectl get ing
NAME RULE BACKEND ADDRESS
test-ingress - default-http-backend:80
$ kubectl describe ing
No events.
- time, t=1m
$ kubectl get ing
NAME RULE BACKEND ADDRESS
test-ingress - default-http-backend:80 130.211.5.27
$ kubectl describe ing
target-proxy: k8s-tp-default-test-ingress
url-map: k8s-um-default-test-ingress
backends: {"k8s-be-32342":"UNKNOWN"}
forwarding-rule: k8s-fw-default-test-ingress
Events:
FirstSeen LastSeen Count From SubobjectPath Reason Message
───────── ──────── ───── ──── ───────────── ────── ───────
46s 46s 1 {loadbalancer-controller } Success Created loadbalancer 130.211.5.27
- time, t=5m
$ kubectl describe ing
target-proxy: k8s-tp-default-test-ingress
url-map: k8s-um-default-test-ingress
backends: {"k8s-be-32342":"HEALTHY"}
forwarding-rule: k8s-fw-default-test-ingress
Events:
FirstSeen LastSeen Count From SubobjectPath Reason Message
───────── ──────── ───── ──── ───────────── ────── ───────
46s 46s 1 {loadbalancer-controller } Success Created loadbalancer 130.211.5.27
GCE has a concept of ephemeral and static IPs. A production website would always want a static IP, which ephemeral IPs are cheaper (both in terms of quota and cost), and are therefore better suited for experimentation.
- Creating a HTTP Ingress (i.e an Ingress without a TLS section) allocates an ephemeral IP, because we don't believe HTTP is the right way to deploy an app.
- Creating an Ingress with a TLS section allocates a static IP, because GLBC assumes you mean business.
- Modifying an Ingress and adding a TLS section allocates a static IP, but the IP will change.
- You can promote an ephemeral to a static IP by hand, if required.
Right now, a kube-proxy NodePort service is a necessary condition for Ingress on GCP. This is because the cloud lb doesn't understand how to route directly to your pods. Incorporating kube-proxy and cloud lb algorithms so they cooperate toward a common goal is still a work in progress. If you really want fine grained control over the algorithm, you should deploy the nginx ingress controller.
Ingress is not yet supported on single zone clusters of size > 1000 nodes (issue). If you'd like to use Ingress on a large cluster, spread it across 2 or more zones such that no single zone contains more than a 1000 nodes. This is because there is a limit to the number of instances one can add to a single GCE Instance Group. In a multi-zone cluster, each zone gets its own instance group.
To completely stop the Ingress controller on GCE/GKE, please see this FAQ.
The Ingress controller configures itself to add the UID it stores in a configmap in the kube-system
namespace.
$ kubectl --namespace=kube-system get configmaps
NAME DATA AGE
ingress-uid 1 12d
$ kubectl --namespace=kube-system get configmaps -o yaml
apiVersion: v1
items:
- apiVersion: v1
data:
uid: UID
kind: ConfigMap
...
You can pick a different UID, but this requires you to:
- Delete existing Ingresses
- Edit the configmap using
kubectl edit
- Recreate the same Ingress
After step 3 the Ingress should come up using the new UID as the suffix of all cloud resources. You can't simply change the UID if you have existing Ingresses, because renaming a cloud resource requires a delete/create cycle that the Ingress controller does not currently automate. Note that the UID in step 1 might be an empty string, if you had a working Ingress before upgrading to Kubernetes 1.3.
A note on setting the UID: The Ingress controller uses the token --
to split a machine generated prefix from the UID itself. If the user supplied UID is found to
contain --
the controller will take the token after the last --
, and use an empty string if it ends with --
. For example, if you insert foo--bar
as the UID,
the controller will assume bar
is the UID. You can either edit the configmap and set the UID to bar
to match the controller, or delete existing Ingresses as described
above, and reset it to a string bereft of --
.
If you deleted a GKE/GCE cluster without first deleting the associated Ingresses, the controller would not have deleted the associated cloud resources. If you find yourself in such a situation, you can delete the resources by hand:
- Navigate to the cloud console and click on the "Networking" tab, then choose "LoadBalancing"
- Find the loadbalancer you'd like to delete, it should have a name formatted as:
k8s-um-ns-name--UUID
- Delete it, check the boxes to also cascade the deletion down to associated resources (eg: backend-services)
- Switch to the "Compute Engine" tab, then choose "Instance Groups"
- Delete the Instance Group allocated for the leaked Ingress, it should have a name formatted as:
k8s-ig-UUID