-
Notifications
You must be signed in to change notification settings - Fork 561
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spurious failure extracting zip archive #171
Comments
Ok so that error originates from inside zip's ...but yeah, I'm not really sure how we'd get an invalid zip file here unless the HTTP download failed somehow? I wonder if we could hash the zip file and store that digest as a header when storing in S3, and compare on download? Seems like something that shouldn't happen, in any event. |
Oh for some reason I thought that's what happened already but apparently not! I think that we uploaded a valid zip archive b/c we're not getting 100% failure rate on MSVC right now. Presuambly they're all getting the same cached value and later builds succeed after one fails. In that sense I think that this is a download failure of some form. As to what kind of download failure... unsure! We're checking for a I'm not sure if S3 could serve us invalid content? In any case though, one thing we could do is to detect a failed extraction of the archive and just count it as a cache miss maybe? That may be difficult to thread through. |
We've seen this error too, on a different project: https://ci.pytorch.org/jenkins/job/pytorch-builds/job/py2-devtoolset7-rocmrpm-centos7.5-build/2203//console
It's durable, so it definitely looks like there is something corrupted inside the cache. |
I think it doesn't happen anymore |
We've run into
Invalid checksum
errors a few times when working on rust-lang/rust, for example at https://ci.appveyor.com/project/rust-lang/rust/build/1.0.4224/job/ow4l9bb15wy56sht. This string apparently appears in thezip
crate and comes from an invalid crc32 checksum.How that actually managed to happen I'm not entirely sure! I'm not sure if this is a corrupt entry in the cache or a failed download, or if the download failed why it wasn't caught sooner...
The text was updated successfully, but these errors were encountered: