feat: sanitization of resources for matching #2042

csviri · 2023-09-04T12:01:57Z

No description provided.

csviri · 2023-09-04T12:11:22Z

@metacosm @shawkins this is how we could fill data that is making mess now with SSA matching ( preprocess desired ). Pls let me know what do you think, we can discuss if we want to go this way.
Also make some checks on known issues

Also this is something that we would be able to turn off by feature flag.

shawkins · 2023-09-05T16:39:57Z

I'd still advocate for something like that update filter which will check to see if old and new are the same to help prevent reconciliation loops - with logging it could also help to capture additional cases that would need to be addressed by something like this pr.

Another consideration is making the use of SSA type based - in particular for Secrets. Upstream they certainly didn't feel there was much need to use SSA for every kind.

metacosm · 2023-09-06T10:27:28Z

...io/javaoperatorsdk/operator/processing/dependent/kubernetes/KubernetesDependentResource.java

@@ -189,7 +192,11 @@ protected void addMetadata(boolean forMatch, R actualResource, final R target, P
    addReferenceHandlingMetadata(target, primary);
  }

-  private boolean useSSA(Context<P> context) {
+  protected void sanitizeDesired(R desired, R actual, P primary, Context<P> context) {
+    DesiredResourceSanitizer.sanitizeDesired(desired, actual, primary, context);


Design-wise, this shouldn't be a static call, but rather a per-type instance so that we use polymorphism instead of having a big if-else cascade based on type with a default no-op being used in most cases.

I would not complicate that for now, until this is small, we can refactor when we have more of it.

csviri · 2023-09-06T10:35:27Z

I'd still advocate for something like that update filter which will check to see if old and new are the same to help prevent reconciliation loops - with logging it could also help to capture additional cases that would need to be addressed by something like this pr.

yes, this is also something that we should consider, but also these are not necessary mutual exclusive, or? I mean the reconciliation will be triggered by other sources and then matching will fail.

Another consideration is making the use of SSA type based - in particular for Secrets. Upstream they certainly didn't feel there was much need to use SSA for every kind.

This is a good question/point, how should we approach that case. If we don't really advise to do SSA for certain resource type. Should be this something pre-configured? or just documented?

shawkins · 2023-09-06T11:34:38Z

yes, this is also something that we should consider, but also these are not necessary mutual exclusive, or? I mean the reconciliation will be triggered by other sources and then matching will fail.

I'm not saying they are exclusive - just want to make sure it's included in comprehensively addressing the problems.

There is the problem behavior of performing a SSA when one is not needed. There are two possible consequences:

nothing happens on the server side, which the kubernetes folks say should have about the same performance as a get call. So avoiding this is good, but it will really only cause users problem in situations where there's a lot of api server load.
a new revision is created (that's the behavior that's been noted with at least Secrets, Ingress). This is dangerous as it can cause the operator sdk to simply keep reconciling.

metacosm · 2023-09-06T12:17:13Z

To me, all this is sounding like SSA is not really ready for mainstream usage. It's too fraught with subtleties and edge cases that can get you in trouble.

csviri · 2023-09-06T12:26:58Z

I'm not saying they are exclusive - just want to make sure it's included in comprehensively addressing the problems.

Yes, I think there is a separate PR for that, I agree with that part, only we have to take a look there to automatically add such filter and put it behind a feature flag.

shawkins · 2023-09-06T14:32:49Z

This is a good question/point, how should we approach that case. If we don't really advise to do SSA for certain resource type. Should be this something pre-configured? or just documented?

I'd opt for pre-configured in the case of Secrets rather than defaulting to an error condition if stringData is used. Out of an abundance of caution, ConfigMaps should be treated similarly.

To me, all this is sounding like SSA is not really ready for mainstream usage. It's too fraught with subtleties and edge cases that can get you in trouble.

Right and the feedback upstream is just to not use it when it's not working as expected.

An alternative is to change SSA to opt-in per DR rather than opt-out.

csviri · 2023-09-06T15:04:55Z

I'd opt for pre-configured in the case of Secrets rather than defaulting to an error condition if stringData is used. Out of an abundance of caution, ConfigMaps should be treated similarly.

I think those are two different things again. I agree that we could say that Secrets and config maps by default should not use SSA. But if we introduce a feature flag, someone could turn it on and still use stringData, than we can still throw the exception, but if some does not use the string data is should work as expected.

In other words I prefer such documentation as a code, if it does not bring too much complexity.

shawkins · 2023-09-06T15:22:41Z

I think those are two different things again.

Not entirely. In practical terms, what is the advantage of using SSA with a Secret? I'd argue that it's very minimal.

csviri · 2023-09-07T10:42:53Z

I think those are two different things again.

Not entirely. In practical terms, what is the advantage of using SSA with a Secret? I'd argue that it's very minimal.

I agree, although I can imaging that multiple controllers fill some values in a secret or config map, independently.

So basically I would say have it there both, so don't use ssa by default for secret and configmap, but if somebody turns it on, we can still throw the exception if it uses stringdata.

or do you think this is an overkill approach?
I see dependent resources is the high level approach where, the framework handles such common mistakes instead of user, so helps them also understand the issues, and by default works as it is intended. (Although what is intended might be sometimes bury in k8s)

shawkins · 2023-09-07T10:56:15Z

I agree, although I can imaging that multiple controllers fill some values in a secret or config map, independently.

They can, but the existing merging strategies are pretty straight-forward for doing that.

csviri · 2023-09-07T11:02:53Z

I agree, although I can imaging that multiple controllers fill some values in a secret or config map, independently.

They can, but the existing merging strategies are pretty straight-forward for doing that.

So what are the alternatives, shouldn't we allow to use CM/Secret with SSA?

Or just set it by default non-SSA, and that is it?

shawkins · 2023-09-08T09:56:05Z

Or just set it by default non-SSA, and that is it?

I'm fine with the operator framework being opinionated - whatever is likely to be the best / simplest solution should be the default. For Secrets that would be to not use SSA. That way noone will be surprised when migrating from an earlier JOSDK version that they need to change their logic away from using stringData or have to turn off SSA.

csviri · 2023-09-11T12:50:33Z

@metacosm @shawkins I added additional config that will opt out by default Secret and ConfigMap. The additional checks still there. So for now taking the "cover every aspect" approach. Thus check everything that we can, since this does not increase complexity that much. Most of these checks are trivial to read and understand.

shawkins

LGTM, just one nit. Do you want me to refine the #2028 into something that will automatically detect/prevent reconciliation loops with SSA?

...va/io/javaoperatorsdk/operator/processing/dependent/kubernetes/DesiredResourceSanitizer.java

Signed-off-by: Attila Mészáros <[email protected]>

...va/io/javaoperatorsdk/operator/processing/dependent/kubernetes/DesiredResourceSanitizer.java

Signed-off-by: Chris Laprun <[email protected]>

Signed-off-by: Attila Mészáros <[email protected]>

openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Sep 4, 2023

csviri self-assigned this Sep 4, 2023

csviri force-pushed the automatic-corrections-dr branch from c0724cf to 3a352e1 Compare September 4, 2023 12:02

csviri linked an issue Sep 4, 2023 that may be closed by this pull request

Issues with SSA Matching and Approach to Handle Them #2038

Closed

csviri requested review from shawkins and metacosm September 4, 2023 12:03

metacosm reviewed Sep 6, 2023

View reviewed changes

openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Sep 9, 2023

csviri force-pushed the next branch from 39bc7af to a3e2c24 Compare September 11, 2023 08:27

csviri force-pushed the automatic-corrections-dr branch from 3a352e1 to f31b5ce Compare September 11, 2023 12:15

openshift-merge-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Sep 11, 2023

csviri requested a review from metacosm September 11, 2023 12:49

csviri marked this pull request as ready for review September 11, 2023 12:49

openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Sep 11, 2023

openshift-ci bot requested a review from adam-sandor September 11, 2023 12:49

shawkins approved these changes Sep 11, 2023

View reviewed changes

...va/io/javaoperatorsdk/operator/processing/dependent/kubernetes/DesiredResourceSanitizer.java Outdated Show resolved Hide resolved

csviri added 9 commits September 12, 2023 13:37

test

ddf02dc

Signed-off-by: Attila Mészáros <[email protected]>

feat: sanitization of resources for matching

3306281

Signed-off-by: Attila Mészáros <[email protected]>

add default resources

ee1970e

Signed-off-by: Attila Mészáros <[email protected]>

format

5faf9d7

Signed-off-by: Attila Mészáros <[email protected]>

generic configs for default non SSA resources

4266e6c

Signed-off-by: Attila Mészáros <[email protected]>

format

128328a

Signed-off-by: Attila Mészáros <[email protected]>

docs

62f6b91

Signed-off-by: Attila Mészáros <[email protected]>

remove unnecessary line

81f9830

Signed-off-by: Attila Mészáros <[email protected]>

fix it

34874e0

Signed-off-by: Attila Mészáros <[email protected]>

csviri force-pushed the automatic-corrections-dr branch from 926ae47 to 34874e0 Compare September 12, 2023 11:37

fix IT

c2fbacf

Signed-off-by: Attila Mészáros <[email protected]>

metacosm reviewed Sep 12, 2023

View reviewed changes

...va/io/javaoperatorsdk/operator/processing/dependent/kubernetes/DesiredResourceSanitizer.java Show resolved Hide resolved

metacosm force-pushed the automatic-corrections-dr branch from 08a0f4a to 5fb89ea Compare September 12, 2023 14:16

metacosm added 3 commits September 12, 2023 16:18

refactor: make intent more explicit

ec9c355

Signed-off-by: Chris Laprun <[email protected]>

fix: remove duplicated line

43056e0

Signed-off-by: Chris Laprun <[email protected]>

docs: improve javadoc

0c4db58

Signed-off-by: Chris Laprun <[email protected]>

metacosm force-pushed the automatic-corrections-dr branch from 5fb89ea to 0c4db58 Compare September 12, 2023 14:19

metacosm approved these changes Sep 12, 2023

View reviewed changes

csviri merged commit 2827206 into next Sep 12, 2023

csviri deleted the automatic-corrections-dr branch September 12, 2023 15:00

This was linked to issues Sep 14, 2023

A dependent Statefulset resource is always updated when reconcile is triggered #1989

Closed

NullPointerException in SSABasedGenericKubernetesResourceMatcher #2032

Closed

metacosm pushed a commit that referenced this pull request Sep 15, 2023

feat: sanitization of resources for matching (#2042)

3fe0f8e

Signed-off-by: Attila Mészáros <[email protected]>

csviri added a commit that referenced this pull request Sep 18, 2023

feat: sanitization of resources for matching (#2042)

ec96cd5

Signed-off-by: Attila Mészáros <[email protected]>

csviri added a commit that referenced this pull request Sep 18, 2023

feat: sanitization of resources for matching (#2042)

a42c8a0

Signed-off-by: Attila Mészáros <[email protected]>

csviri added a commit that referenced this pull request Oct 3, 2023

feat: sanitization of resources for matching (#2042)

50511e6

Signed-off-by: Attila Mészáros <[email protected]>

shawkins pushed a commit to shawkins/java-operator-sdk that referenced this pull request Oct 4, 2023

feat: sanitization of resources for matching (operator-framework#2042)

e2283a1

Signed-off-by: Attila Mészáros <[email protected]>

csviri added a commit that referenced this pull request Oct 4, 2023

feat: sanitization of resources for matching (#2042)

002a25c

Signed-off-by: Attila Mészáros <[email protected]>

csviri added a commit that referenced this pull request Oct 4, 2023

feat: sanitization of resources for matching (#2042)

857ba1c

Signed-off-by: Attila Mészáros <[email protected]>

csviri added a commit that referenced this pull request Oct 18, 2023

feat: sanitization of resources for matching (#2042)

50f844b

Signed-off-by: Attila Mészáros <[email protected]>

feat: sanitization of resources for matching #2042

feat: sanitization of resources for matching #2042

Uh oh!

Conversation

csviri commented Sep 4, 2023

Uh oh!

csviri commented Sep 4, 2023

Uh oh!

shawkins commented Sep 5, 2023

Uh oh!

metacosm Sep 6, 2023

Choose a reason for hiding this comment

Uh oh!

csviri Sep 6, 2023

Choose a reason for hiding this comment

Uh oh!

csviri commented Sep 6, 2023

Uh oh!

shawkins commented Sep 6, 2023

Uh oh!

metacosm commented Sep 6, 2023

Uh oh!

csviri commented Sep 6, 2023

Uh oh!

shawkins commented Sep 6, 2023

Uh oh!

csviri commented Sep 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shawkins commented Sep 6, 2023

Uh oh!

csviri commented Sep 7, 2023

Uh oh!

shawkins commented Sep 7, 2023

Uh oh!

csviri commented Sep 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shawkins commented Sep 8, 2023

Uh oh!

csviri commented Sep 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shawkins left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

csviri commented Sep 6, 2023 •

edited

Loading

csviri commented Sep 7, 2023 •

edited

Loading

csviri commented Sep 11, 2023 •

edited

Loading