-
Notifications
You must be signed in to change notification settings - Fork 249
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add retry support to resolving in opm alpha render-template
#1513
Comments
So you're saying that we should, in the case of the 403 that you give, retry? How can we tell the difference between a real 403, and this one that you say is being made up erroneously by the registry? If it's a real 403, retrying is a waste of time and if a tool output "Error 403, retrying.", I'd really wonder about the tool. Can you give a more relevant example, like a server-side fail where we don't properly retry? Because it looks like here:
we use the standard Go retry code and you're just hitting a non-retriable error, no? |
Unfortunately, yes 😢 According to the data from the image prefetching service for our CI clusters (not directly related to our A quick calculation shows that with a catalog template of 100 bundles (roughly the size of ours at the moment) there is a ~0.7% chance of the
I think you cannot! 😭
Actually looking at the code you pointed at, you already seem to retry fetching on all errors, retriable or not! The only non-retried case is the sentinel error for docker schema v1 manifest. So this is good from my PoV.
I actually took a closer look, and judging from the error message I pasted, and from the registry code you pointed at, it seems like the error happens during resolving, here, which happens before the retry handling you pointed out. There do not seem to be any retries around the Could I ask for opt-in retrying of all errors around that call? |
opm alpha render-template
opm alpha render-template
@porridge you're right, the code I highlighted isn't the code in question. @grokspawn I think we could add (optional?) retries at resolve time? |
With a larger catalog, this command might want to pull a large number of images.
In my experience even the public production image registries are not reliable enough to return all images successfully.
What is important, even some errors which seem to be client-side issues (such as 403s, see following example) need to be retried, as in my experience they are regularly reported as intermittent issues by (apparently) confused registries.
While it is possible to retry the whole
opm
command, this is not terribly effective, since everything is re-fetched from scratch, so the probability of success does not increase.The text was updated successfully, but these errors were encountered: