Fix Creation of AWS Sandboxes + Containerize #103

fridim · 2025-01-15T21:08:26Z

The creation of new AWS sandboxes is currently broken and is a manual process that is run from different places.

ansible playbooks to create the accounts
cypress automation to enable the accounts in access.redhat.com
playbook again to validate the sandboxes and make sure the GOLD images there

Instead, make it possible to run it as a single OpenShift job.

Tasks:

- [X] Create a container image that has all the software needed for the creation of a new sandbox - [X] Fix Locales in Containerfile.admin - [X] Create a wrapper script for automation => Python - [X] Automatically guess the next sandbox number from all different DB (prod and dev) - [X] Add profiling callback to ansible creation playbook runs. - [X] Enable Gold images by using the new HCC (console) APIs instead of Cypress + access.redhat.com There is a transition from access.redhat.com web page to HCC (console.redhat.com) Advantages: - much much faster - less dependencies in the images (roughly -500MB) - [X] Status script `creation_status.py` -- list creation in progress freshly created sandbox - [X] Add a `--retry sandbox123` capability - [ ] Add an Org Policy to p protect anything that is required by HCC (role, ...) - [ ] make slow task async in the playbook - [X] New feature: provide the reservation name, by default new sandboxes end up in a 'new' reservation - [X] sandboxes are created in a 'untested' reservation first. After the functional tests, if successful, we move the new sandboxes to the target reservation (default 'new') - [ ] Create monitoring dashboard or at least scripts for the creation - [ ] Add a test to ensure Vault value is correct. Try to read one key with the passed vault secret. If it doesn't work, exit. That will prevent accidentally creating sandboxes with a vault different that the one currently in use for the 'target DB' - [ ] allow to change the target OU - [ ] document (upstream and confluence) - [ ] Package everything for OpenShift: use OpenShift job to run the creation

jkupferer · 2025-01-15T22:11:25Z

playbooks/roles/infra-aws-sandbox/tasks/route53.yml

-      retries: 5
-      delay: "{{ 60|random(start=3, step=1) }}"
+      retries: 10
+      delay: "{{ 10|random(start=3, step=1) }}"


This random delay does not work as expected. Unfortunately Ansible only evaluates the delay value once when starting the task rather than for each retry. So if it gets 5 it will be a 5 second delay it will be the same delay for each retry.

fridim added 2 commits January 15, 2025 21:38

Add argument to toggle playbook output easily

8a2364e

jkupferer reviewed Jan 15, 2025

View reviewed changes

fridim added 6 commits January 16, 2025 15:39

Add sec, operation and billing info to account

b008ec7

Add option to disable hcc and validation. Fix guessing

17a8a44

Add the ability to skip playbook/validation/hcc

a8abe44

Handle concurrency better by backing off

933c282

Improve retry validation

212ac4c

Improve performance + validation

74ec314

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Creation of AWS Sandboxes + Containerize #103

Fix Creation of AWS Sandboxes + Containerize #103

fridim commented Jan 15, 2025 •

edited

Loading

jkupferer Jan 15, 2025

Fix Creation of AWS Sandboxes + Containerize #103

Are you sure you want to change the base?

Fix Creation of AWS Sandboxes + Containerize #103

Conversation

fridim commented Jan 15, 2025 • edited Loading

jkupferer Jan 15, 2025

Choose a reason for hiding this comment

fridim commented Jan 15, 2025 •

edited

Loading