You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Automation Tooling, CI/CD, SDKs, reference and example projects, guides and API documentation for developers working on managed services and their dependencies.
29
+
Addons, Automation Tooling, CI/CD, SDKs, reference and example projects, guides and API documentation for developers working on managed services and their dependencies.
Copy file name to clipboardexpand all lines: content/en/docs/creating-addons/monitoring/deadmanssnitch_integration.md
+16-13
Original file line number
Diff line number
Diff line change
@@ -5,22 +5,24 @@ date: 2022-12-15T00:53:51+01:00
5
5
---
6
6
7
7
## Overview
8
+
8
9
[Dead Man's Snitch (DMS)](https://deadmanssnitch.com/) is essentially a constantly firing prometheus alert and an external receiver
9
10
(called a snitch) that will alert should the monitoring stack go down and stop sending alerts.
10
11
The generation of "snitch" urls is dynamic on cluster/add-on installed.
11
12
It's done via the [DMS operator](https://github.com/openshift/deadmanssnitch-operator),
12
13
which runs on hive and is owned by SREP.
13
14
14
15
## Usage
16
+
15
17
The Add-On metadata file (`addon.yaml`) allows you to provide a `deadmanssnitch` field (see `deadmansnitch` field in
16
18
the Add-On metadata file [schema documentation](https://github.com/mt-sre/managed-tenants-cli/blob/main/docs/tenants/zz_metadata_schema_generated.md)
17
19
for more information).
18
20
This field allows you to provide the required Dead Man's Snitch integration configuration. A `DeadmansSnitchIntegration`
19
21
resource is then created and applied to Hive alongside the Add-On [SelectorSyncSet (SSS)](https://github.com/openshift/hive/blob/master/docs/syncset.md#selectorsyncset-object-definition).
20
22
21
-
22
23
### DeadmansSnitchIntegration Resource
23
-
The default DMS configurations which will be created if you specify the bare minimum fields under 'deadmanssnitch'
24
+
25
+
The default DMS configurations which will be created if you specify the bare minimum fields under 'deadmanssnitch'
24
26
field in addon metadata:
25
27
26
28
```yaml
@@ -30,7 +32,7 @@ field in addon metadata:
30
32
name: addon-{{ADDON.metadata['id']}}
31
33
namespace: deadmanssnitch-operator
32
34
spec:
33
-
clusterDeploymentSelector: ## can be overriden by .deadmanssnitch.clusterDeploymentSelector field in addon metadata
35
+
clusterDeploymentSelector: ## can be overridden by .deadmanssnitch.clusterDeploymentSelector field in addon metadata
34
36
matchExpressions:
35
37
- key: {{ADDON.metadata['label']}}
36
38
operator: In
@@ -41,13 +43,13 @@ field in addon metadata:
41
43
name: deadmanssnitch-api-key
42
44
namespace: deadmanssnitch-operator
43
45
44
-
snitchNamePostFix: {{ADDON.metadata['id']}} ## can be overriden by .deadmanssnitch.snitchNamePostFix field in addon metadata
46
+
snitchNamePostFix: {{ADDON.metadata['id']}} ## can be overridden by .deadmanssnitch.snitchNamePostFix field in addon metadata
- [RHOAM addon: extraResources](https://gitlab.cee.redhat.com/service/managed-tenants/-/blob/09cf5112e7dc5588c14f158d6490f7f1e7051c6a/addons/managed-api-service-internal/metadata/production/addon.yaml#L40) field in `addon.yaml`
for more details on how this test harness will be run and how it is expected to
12
17
report results.
13
18
14
19
## Primer into OSD E2E tests and prow jobs
15
-
To ensure certain things such as validating that the addon can be easily and successfully installed on a customer’s cluster,
20
+
21
+
To ensure certain things such as validating that the addon can be easily and successfully installed on a customer’s cluster,
16
22
we have prow jobs setup which run e2e tests (one test suite per addon) every 12 hours.
17
23
If the e2e tests corresponding to any addon fail, then automated alerts/notifications are sent to the addon team.
18
-
Every addon's e2e tests are packaged in an image called “testHarness”, which is built and pushed to [quay.io](https://quay.io)
24
+
Every addon's e2e tests are packaged in an image called “testHarness”, which is built and pushed to [quay.io](https://quay.io)
19
25
by the team maintaining the addon.
20
-
Once the "testHarness" image is built and pushed, the team must register their addon to testHarness image’s e2e tests
21
-
by making a PR against this file as it holds the configuration about all the registered e2e tests. For example, this section. ??
26
+
Once the "testHarness" image is built and pushed, the team must register their addon to testHarness image’s e2e tests
27
+
by making a PR against [this file](https://github.com/openshift/release/blob/master/ci-operator/jobs/openshift/osde2e/openshift-osde2e-main-periodics.yaml).
22
28
23
-
You can access the portal for prow jobs [here](https://prow.ci.openshift.org). The prow jobs follow the below steps to
29
+
You can access the portal for prow jobs [here](https://prow.ci.openshift.org). The prow jobs follow the below steps to
24
30
run the e2e tests. For every e2e test defined inside [this file](https://github.com/openshift/release/blob/master/ci-operator/jobs/openshift/osde2e/openshift-osde2e-main-periodics.yaml):
25
-
* An OSD cluster is created and the addon, which is being tested, is installed. Openshift API is used to perform these operations via the API definition provided at https://api.openshift.com // TODO: Last sentence not needed?
26
-
* The e2e prow job definition, specifically for the addon from [this file](https://github.com/openshift/release/blob/master/ci-operator/jobs/openshift/osde2e/openshift-osde2e-main-periodics.yaml), is parsed and hence, the parameters required to run its e2e tests will be recognized as well.
27
-
* The "testHarness" image for the addon is parsed and executed against the parameters fetched from the above step.
28
-
* If an MT-SRE team member notices those tests failing, they should notify the respective team to take a look at them and fix them.
29
31
32
+
* An OSD cluster is created and the addon, which is being tested, is installed. Openshift API is used to perform these
33
+
operations via the API definition provided at <https://api.openshift.com>
34
+
* The e2e prow job definition, specifically for the addon from [this file](https://github.com/openshift/release/blob/master/ci-operator/jobs/openshift/osde2e/openshift-osde2e-main-periodics.yaml),
35
+
is parsed and hence, the parameters required to run its e2e tests will be recognized as well.
36
+
* The "testHarness" image for the addon is parsed and executed against the parameters fetched from the above step.
37
+
* If an MT-SRE team member notices those tests failing, they should notify the respective team to take a look at them and fix them.
0 commit comments