-
Notifications
You must be signed in to change notification settings - Fork 130
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Verify sha before cache write #2112
Verify sha before cache write #2112
Conversation
works, must be seen if this resolves https://tests.stockfishchess.org/actions?max_actions=1&action=&user=&text=validate&before=1720333358.645925&run_id= where a corrupted net ends up in the cache. Not clear how this can happen, at least this check will exclude the download part of the issue |
It should solve the issue. The request package may create corrupt downloads. |
but can it do without triggering an error (i.e. we call raise_for_status)? Edit we do similar stuff for the zipballs we get from github, and somehow we can't as easily verify those. Edit, we could probably try to unzip them. |
I think it can although the mechanics are not clear to me. I suppose the library receives the correct http headers but then the download is aborted. But in principle the library should be able to verify that the correct number of bytes has been received. |
Or maybe there is a bug in the error handling in the worker (I.e. an error is raised but not handled correctly). Can't check now. Traveling. |
It seems older versions of the requests package did not check content length. Could it be that the requests package in the worker is outdated? |
|
That request version was the last compatible one with python 3.6.8 used by noob's workers. Lines 30 to 33 in 12981ff
|
Last version on pypi is 2.32.3. |
so yeah, I think we're running into this. |
Of course we could check it ourselves since we are using our own wrapper.
…On Sun, Jul 7, 2024, 12:37 PM Joost VandeVondele ***@***.***> wrote:
psf/requests#4956 <psf/requests#4956>
so yeah, I think we're running into this.
—
Reply to this email directly, view it on GitHub
<#2112 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABSOQ34LRN7ZMZJHOI7HATTZLELADAVCNFSM6AAAAABKPEOQG2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMJSGQYDEOJUHE>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
looking into that... it is non-trivial. |
giving up on checking the content-length, I don't get it to work (seems like it can depend on encoding, etc). |
I am surprised by this... https://stackoverflow.com/questions/36194656/http-content-length-and-content-encoding |
Ok I see. You cannot check it with the length of the downloaded file. |
extend the original PR by moving the write of zipballs to after they have been unzipped and thus effectively been verified. Now both blobs written to cache are known to be OK. |
9384306
to
64cd8fe
Compare
The bundled request package doesn't verify the content downloaded is complete. Hence verify that that content is correct before storing it to cache. For the net perform a sha verify, for the zipball unzip first. Raise worker version to 246 (also server side).
64cd8fe
to
60b0c07
Compare
For an invalid net the code:
Is it the intended workflow? EDIT: there is a successive validation of the net, I'm doing a bit of refactoring to simplify the code. |
fd77775
to
d6967e6
Compare
- write in testing dir and cache only validated net - delete an invalid net from the cache - improve the readibility of the code
d6967e6
to
6f1b875
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good on DEV
Workers upgraded, thank you @vondele :) |
No description provided.