tests: refactor build and deploy test case #179

DnPlas · 2024-02-12T14:59:45Z

Remove the assertion that tests the kubeflow-dashboard-operator goes to Blocked when the relation with kubeflow-profiles is not there. This assertion is already covered in unit tests, and it is only adding noise at deployment time.

~~This commit ensures the dependencies like charms and relations are deployed on time which will give time for everything to be set up before moving ahead with other test cases.~~

Remove the assertion that tests the kubeflow-dashboard-operator goes to Blocked when the relation with kubeflow-profiles is not there. This assertion is already covered in unit tests, and it is only adding noise at deployment time. This commit ensures the dependencies like charms and relations are deployed on time which will give time to everything to be setup before moving ahead with other test cases.

ca-scribner · 2024-02-12T19:51:25Z

@DnPlas can you give more context? afaict 178 is failing for legitimate reasons. That charm is hitting error during a relation-changed event, not blocked.

In principle I think I'm with you about how a unit test is sufficient to test if we go Blocked when missing a relation. But now I'm wondering if by removing this wait_for_idle we end up hiding another error entirely.

DnPlas · 2024-02-13T10:32:06Z

@DnPlas can you give more context? afaict 178 is failing for legitimate reasons. That charm is hitting error during a relation-changed event, not blocked.

Yes, the reason why #178 is failing (and potentially any other CI) is because the kubeflow-profile-controller is in maintenance mode, in fact Waiting for pod startup to complete, which can cause the kubeflow-dashboard to fail with hook failed: "kubeflow-profiles-relation-changed" because the dashboard charm depends on the profiles info (see here). By deploying both kubeflow-dashboard and kubeflow-profiles at the same time and wait for idle for BOTH charms, we avoid this.

In principle I think I'm with you about how a unit test is sufficient to test if we go Blocked when missing a relation. But now I'm wondering if by removing this wait_for_idle we end up hiding another error entirely.

It's still there, we are actually waiting for BOTH charms to be active and idle with a timeout of 600s.

ca-scribner · 2024-02-13T13:49:22Z

The CI is terminating because kubeflow-dashboard is in error:

kubeflow-dashboard/0* error idle 10.1.34.139 hook failed: "kubeflow-profiles-relation-changed"

juju.errors.JujuUnitError: Unit in error: kubeflow-dashboard/0

It should not be possible that this sort of error be caused by kubeflow-profiles in any way (whether it is in maintenance mode, etc). So what I'm saying is that the the CI is catching a legitimate error here where kubeflow-dashboard fails to handle the relation-changed event, and changing the CI to suppress it is not what we should do

ca-scribner

After some discussions and looking into #178, this now lgtm. #178 had a separate error so there is nothing the current PR hides on us that we need to worry about, and having an extra wait_for_idle/status check here isn't needed as @DnPlas mentions since we can check this in unit tests.

The only thing I'd change here is the model config setting. Let's handle that as a separate PR, so it is easier to trace if something goes wrong.

DnPlas · 2024-02-15T09:19:17Z

Thanks @ca-scribner for the review, agreed on the model config setting. Will update the PR shortly.

This reverts commit 09e5935.

DnPlas requested a review from a team as a code owner February 12, 2024 14:59

github-actions bot added the Libraries: Out of sync label Feb 12, 2024

DnPlas mentioned this pull request Feb 12, 2024

chore: use charmedkubeflow/kubeflow-central-dashboard as oci-image #178

Merged

skip: automatically-retry-hooks false

09e5935

ca-scribner suggested changes Feb 14, 2024

View reviewed changes

DnPlas added 3 commits February 15, 2024 10:21

Revert "skip: automatically-retry-hooks false"

c72607c

This reverts commit 09e5935.

skip: bring back active

d0431cb

Merge branch 'main' into KF4392-refactor-integration-tests

5a5def6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests: refactor build and deploy test case #179

tests: refactor build and deploy test case #179

DnPlas commented Feb 12, 2024 •

edited

Loading

ca-scribner commented Feb 12, 2024

DnPlas commented Feb 13, 2024

ca-scribner commented Feb 13, 2024

ca-scribner left a comment

DnPlas commented Feb 15, 2024

tests: refactor build and deploy test case #179

Are you sure you want to change the base?

tests: refactor build and deploy test case #179

Conversation

DnPlas commented Feb 12, 2024 • edited Loading

ca-scribner commented Feb 12, 2024

DnPlas commented Feb 13, 2024

ca-scribner commented Feb 13, 2024

ca-scribner left a comment

Choose a reason for hiding this comment

DnPlas commented Feb 15, 2024

DnPlas commented Feb 12, 2024 •

edited

Loading