-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Don't write GND-lastSuccessfulUpdate.txt and send alert mail if exceptions were caught #338
Comments
It seems this is not actually related. I was thinking of ed7e70a, but the thing I saw in the log [1] seems unrelated, since it's happening before running the Metafacture workflow, in ConvertUpdates:
Also, this happened for the 2022-12-15 update, so it's unrelated to the problem that triggered this issue (which happened on 2022-12-19). For this 2022-12-15 problem, there seems to be some issue with the dates used when retrying:
It seems like after an error when getting 2022-12-16, we're getting 2022-12-15 to recover. Looking into the issue on 2022-12-19, there also seems to be a problem related to the date, though different (calling from 2022-12-20 until 2022-12-19):
So I think these are the tasks here:
[1] Looking at output of e.g. |
Looks like these were the days with errors during updates since 2022-10-13 (the data state of the latest baseline, see https://data.dnb.de/opendata/, "Stand:"), collected by looking at the output of
|
So it seems the interval setting for multi-day updates was broken since 6e554aa ("Tweak end date for updates to avoid redundant data", 2022-12-09), resulting in the end date not being incremented. Fixed that in 0d7602f. It seems the result was that when a single day update failed for some other reason, the following multi-day update failed too, for this reason. After my recent manual updates, the GND-lastSuccessfulUpdate.txt was set to some date in the past (the end date of my last manual update), triggering this problem, resulting in missing updates again. I've deployed 0d7602f to quaoar1 and ran the multi-day update since 2022-12-20 to restore the data and test the fix, looks good:
The reason we did not get an alert for the second issue (the follow-up error, caused by the faulty end date) seems to be that we only retry and alert if we got no data at all, so if e.g. the first day of multi-day updates worked, but the second fails, we get no alert. Opened #339 to improve that. Assigning @dr0i for review, won't open a PR, changes are only 07d7b3b & 0d7602f. |
Since we don't abort our transformations on errors any more, but skip them instead, we have to make sure that the
GND-lastSuccessfulUpdate.txt
is written only if there were no exceptions, in order for the updates to be retried the next day (we noticed this due to a report about missing updates, seems it was a temporary 502 from the OAI-PMH proxy, worked when re-running the updates now). The existing retry-and-mail logic doesn't seem to work, probably due to the exception handling (not thrown, no retry, no mail?).The text was updated successfully, but these errors were encountered: