Skip to content

Commit

Permalink
blog post
Browse files Browse the repository at this point in the history
  • Loading branch information
michaelfitzo committed Nov 6, 2024
1 parent abfd949 commit 3eacd84
Show file tree
Hide file tree
Showing 3 changed files with 89 additions and 12 deletions.
17 changes: 5 additions & 12 deletions gen3/docs/blog/authors.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,15 +3,8 @@ authors:
name: Jawad Qureshi
description: Lead Platform Engineer
avatar: https://github.com/jawadqur.png

# yangshun:
# name: Yangshun Tay
# title: Front End Engineer @ Facebook
# url: https://github.com/yangshun
# image_url: https://github.com/yangshun.png

# slorber:
# name: Sébastien Lorber
# title: Docusaurus maintainer
# url: https://sebastienlorber.com
# image_url: https://github.com/slorber.png
elise:
name: Elise Castle
description: Platform Engineer - Gen3
url: https://github.com/EliseCastle23
avatar: https://github.com/EliseCastle23.png
Binary file added gen3/docs/blog/posts/Grafana.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
84 changes: 84 additions & 0 deletions gen3/docs/blog/posts/gen3-observability.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,84 @@
---
draft: false
slug: Observability
authors:
- elise
tags:
- Observability
- Helm Chart
- Grafana
- Loki
- Mimir
- Alloy
- Faro Collector
- Real User Monitoring (RUM)
- Metrics
- Log Aggregation
- Dashboards
- Kubernetes Monitoring
- Time-Series Database
- Alerting
- Frontend Monitoring
- Grafana Dashboards
- Open Source Monitoring
date: 2024-10-15
categories:
- Operator

---

# Deploying a Comprehensive Observability Stack with Helm
Monitoring and observability are essential for maintaining modern infrastructure and applications. With the new Observability Helm Chart, setting up a robust monitoring system is easier than ever. This chart provides an integrated stack featuring Grafana for visualizations, Loki for log aggregation, and Mimir for metrics storage and querying. Alloy can then be deployed in any cluster to collect logs and metrics to foward to Loki and Mimir. Additionally, you can optionally deploy the Faro Collector Helm Chart to further enhance observability by supporting Real User Monitoring (RUM) via the Fence Service.

## Overview of the Observability Helm Chart
The Observability Helm Chart deploys a complete observability solution to your Kubernetes cluster. It bundles three core components:

### Grafana:
An industry-leading visualization platform that allows users to create dashboards, track metrics, and set alerts.
### Mimir:
A scalable time-series database optimized for efficiently storing and querying metrics across applications and infrastructure.
### Loki:
A log aggregation system designed to index and query logs with minimal resource usage, seamlessly integrating with Grafana.

## General Architecture

In this setup, Loki and Mimir are configured with internal ingress resources, enabling Alloy to send metrics and logs securely via VPC peering connections. Both Loki and Mimir write the ingested data to Amazon S3 for scalable and durable storage. This data can be queried and visualized through Grafana, which is hosted behind an internet-facing ingress. Access to Grafana can be restricted using CIDR ranges defined through the ALB ingress annotation: alb.ingress.kubernetes.io/inbound-cidrs: "cidrs". Additionally, the chart supports SAML authentication for Grafana, configured through the grafana.ini field, ensuring secure user access.

<p align="center">
<img src="Grafana.png" alt="Grafana Architecture" />
</p>


### Fips compliant images

Gen3 provides FIPS-compliant images, which are set as the default in the values file for Grafana, Mimir, and Loki. These images are self-hosted and maintained by the Gen3 Platform Team, ensuring secure and compliant operations. The Platform Team is responsible for managing image upgrades, and service versions will be updated as deemed necessary by the team.

### Built-in Gen3 Alerts

This Helm chart comes equipped with built-in Gen3 alerts, defined in the 'alerting' section of the values.yaml. These alerts enable you to immediately leverage your logs and metrics as soon as Grafana is up and running.

### Built-in Gen3 Dashboards

We'll soon be releasing Gen3 dashboards, providing users with Gen3-specific visualizations. Please check back here to see if they have been released.

## Alloy and Faro: Enhancing Observability

### Alloy:
Collects logs and metrics from your services and sends them to Loki and Mimir for storage and analysis. Alloy acts as a bridge between your services and the observability stack, ensuring data flows smoothly to the right destinations.
### Faro Collector:
A specialized configuration of Alloy designed to collect Real User Monitoring (RUM) data from Grafana Faro. This setup captures frontend metrics.

## Helm Charts Overview
Observability Helm Chart: Deploys Grafana, Loki, and Mimir as the foundation of your observability platform.

Alloy Helm Chart: Configures Alloy to collect logs and metrics and forward them to Loki and Mimir. Alloy can be deployed in a separate cluster or VPC or it can be deployed in multiple clusters/vpcs.

Faro Collector Helm Chart: Adds RUM data collection to the stack by configuring Alloy to receive frontend metrics from Grafana Faro.

## Conclusion
This new suite of Helm charts provides everything you need to monitor your Gen3 instance.

To see detailed instructions on how to set up these charts, please refer to the following links:
- [observability.md](https://github.com/uc-cdis/gen3-docs/blob/main/docs/tutorials/observability.md)
- [alloy.md](https://github.com/uc-cdis/gen3-docs/blob/main/docs/tutorials/alloy.md)
- [faro.md](https://github.com/uc-cdis/gen3-docs/blob/main/docs/tutorials/faro.md)

0 comments on commit 3eacd84

Please sign in to comment.