The aim of this project is to add trace points to applications running in Kubernetes and understand applications' behavior by end-to-end tracing.
As we find that Nginx takes response of Kubernetes external traffic, Our MVP changes to tracing Nginx with Jaeger.
We will implement end-to-end tracing for Kubernetes on the data-plane. This includes instrumenting one or more subsystems within the Kubernetes structure. As data-plane features become a part of the flow and behavior of an application, implementing trace points within them allows for a cross-layer, enabling a complex and deeper understanding of Kubernetes behavior. If issues come up in a distributed cluster operated by Kubernetes, tracing is an idea method to resolve said issues.
Tracing Nginx with Jaeger.
● Pre 1: Understand tracing(Read papers), get familiar with Kubernetes and MOC(Create cluster). https://github.com/BU-NU-CLOUD-SP18/Cross-Layer-Tracing-in-Kubernetes/blob/master/pre%201.pdf
● Pre 2: Explore a one node cluster(Minikube) locally(logging). https://github.com/BU-NU-CLOUD-SP18/Cross-Layer-Tracing-in-Kubernetes/blob/master/pre%202.pdf
● Pre 3: Figure out Kubernetes datapath(Internal: OVS, External: Nginx). https://github.com/BU-NU-CLOUD-SP18/Cross-Layer-Tracing-in-Kubernetes/blob/master/e2e%20tracing%20pre%203.pptx.pdf
● Pre 4: Figure out where to add trace points(Look inside Kubernetes source code, Nginx source code). https://github.com/BU-NU-CLOUD-SP18/Cross-Layer-Tracing-in-Kubernetes/blob/master/pre%204.pdf
● Pre 5: Figure out how to add these trace points(Open tracing framework, recomple Jaeger into C++). https://github.com/BU-NU-CLOUD-SP18/Cross-Layer-Tracing-in-Kubernetes/blob/master/pre%205.pptx
- Create a cluster on MOC: https://github.com/BU-NU-CLOUD-SP18/Cross-Layer-Tracing-in-Kubernetes/blob/master/instructions/Instruction%20of%20create%20a%20cluster%20on%20moc.md
- Deploy Kubernetes on MOC cluster: https://github.com/BU-NU-CLOUD-SP18/Cross-Layer-Tracing-in-Kubernetes/blob/master/instructions/Instruction%20of%20install%20Kubernetes%20on%20MOC.md
- Set up Jaeger on MOC instance and show GUI on local computer: https://github.com/BU-NU-CLOUD-SP18/Cross-Layer-Tracing-in-Kubernetes/blob/master/instructions/Install%20Jaeger%20on%20MOC%20node.md
- Kubernetes logging: https://github.com/BU-NU-CLOUD-SP18/Cross-Layer-Tracing-in-Kubernetes/blob/master/instructions/Kubernetes%20logging.md
- Containerize an application and make it a service in Kubernetes cluster: https://github.com/BU-NU-CLOUD-SP18/Cross-Layer-https://github.com/BU-NU-CLOUD-SP18/Cross-Layer-Tracing-in-Kubernetes/blob/master/pre%205.pdf
● Tracing: Tracing is a method to understand the performance/correctness of deploying distributed applications. When an app makes a request, it will generate a unique request ID and we name it as ‘Traceing ID’. Tracing ID propagate through the system with the request. Every time the request calls a component of the system, an event labeled by request's tracing ID is triggered. If we collect all the events and sort them into different requests by tracing ID, we will see series of events like 'threads' belongs to different requests, from which we can understand apps' behavior.
Three steps of tracing:
1.Adding trace points to datapath: Let commponents on datapth be 'tracable'.
2.Tracing ID propagation: Figure out a method to make sure tracing ID can propagate though layers.
3.Huge repository storage to store tracing information: Massive tracing information will be generated so we need huge storage to hold them.
● End to end tracing: End to End (e2e) Tracing is to follow the execution of requests infrastructure along the entirety of its “PATH” of propagation (including its dataplane), to provide detailed tracing for capacity planning and performance analysis
● Open tracing: Since developers and engineers have started to trade in the old monolithic systems for today’s microservice architectures tasks that were easy before have become difficult. Modern day distributed tracing systems aim to address these issues but they do so using application level instrumentation using incompatible APIs. Open Tracing aims to address these issues using consistent, vendor neutral APIs for popular platforms.
● Jaeger: Jaeger is a distributed tracing system released as open source by Uber Technologies. It can be used for monitoring microservices-based distributed systems. Jaeger backend, Web UI, and instrumentation libraries have been designed from ground up to support the OpenTracing standard.
Jaeger tracing example
HotRod: a sample application that has had tracing implemented by Jaeger. Tutorial link: https://medium.com/opentracing/take-opentracing-for-a-hotrod-ride-f6e3141f7941
● Distributed system: A distributed system is a network that consists of autonomous computers that are connected using a distribution middleware. They help in sharing different resources and capabilities to provide users with a single and integrated coherent network. The computers interact with each other in order to achieve a common goal.
● Container: A container provides an isolated environment with a unique namespace in which an app can run with its environment. The environment satisfies a description of a set of resources required by an app. App can only use the resources defined in this namespace and doesn’t know what is outside its container.
● Kubernetes: Kubernetes is an orchestration system coordinating a highly available cluster of computers for deploying, scaling and managing containerized components of a distributed application in a datacenter. It makes these applications agnostic and isolated from other containers deployed on the same node e.g. machine.
What Kubernetes does:
1.It allocates resources to containers to have them meet the required description
2.It offers a unique namespace to each container
3.It can scale in & scale out an application and make replication) when required
4.Provides an abstraction through which each container is able to communicate with the outside world
● Control plane: The Control Plane maintains a record of all of the Kubernetes Objects in the system, and runs continuous control loops to manage those objects’ state. At any given time, the Control Plane’s control loops will respond to changes in the cluster and work to make the actual state of all the objects in the system match the desired state that you provided.
● Data plane: The decision making part of Kubernetes which decides what has to be done with each containers according to its description. Data Plane is the part which enforces all of control plane’s decisions and it allows applications to remain agnostic to their surroundings.
● Services: A microservice; an abstraction which defines a set of Kubernetes pods and a policy to access them.
● Ingress: Typically, services and pods have IPs only routable by the cluster network. All traffic that ends up at an edge router is either dropped or forwarded elsewhere. Conceptually, this might look like:
Internet
|
------------
[ Services ]
An Ingress is a collection of rules that allow inbound connections to reach the cluster services.
Internet
|
[ Ingress ]
--|-----|--
[ Services ]
● Ingress controller: An Ingress Controller is a daemon, deployed as a Kubernetes Pod, that watches the apiserver's /ingresses endpoint for updates to the Ingress resource. Its job is to satisfy requests for Ingresses. You can choose any load balancer that provides an Ingress controller, which is software you deploy in your cluster to integrate Kubernetes and the load balancer.
● Niginx is a chooseable http headers load balancer for Kubernetes. It take response of Kubernetes external traffic. So this is where we are going to add tracing points.
From NGINX website: NGINX is a free, open-source, high-performance HTTP server and reverse proxy, as well as a proxy server. NGINX is known for its high performance, stability, rich feature set, simple configuration, and low resource consumption.
● Tracing OVS.
● Tracing Kubernetes data plane.
● Tracing Kubernetes control plane.
● Tracing application running in Kubernetes, on MOC.
● Compare the Kubernetes tracing and application tracing, have a deeper understand of Kubernetes behavior.
● Collect performance matrices, such as latency and bottlenecks.