-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dependency Issue for Kerchunk -> Icechunk via Virtualizarr #321
Comments
Matthew Iannucci has been helpfully tracking the compatibility between Icechunk, VirtualiZarr, and Kerchunk in earth-mover/icechunk#197. The tl;dr is that until fsspec/kerchunk#516 is completed/merged to add Zarr-Python v3 compatibility in Kerchunk one would need to work off Matt's fork and branch to use these libraries in the same environment, since icechunk requires Zarr-Python v3. |
@maxrjones is correct that we're waiting on kerchunk to support zarr-python v3, and as icechunk requires zarr-python v3 currently the released versions of kerchunk and icechunk are incompatible. But as you're working with netCDF4 data you actually have a few different options:
|
Hi both, thanks very much for the suggestions! I will attempt both the first and second solutions in the next few days, I'm just trying to get a picture of how it all works for now. Solution 3 suggested above has an additional issue that it can't be used where some variables/dimensions are written inline in the kerchunk file (i.e base64 encoded) - this is listed as a ToDo in the error message. Most of the kerchunk files we've produced have inline components for some dimensions as this is more performant than having to make many very small requests to compose a dimension. Thanks again for the suggestions! |
Hi all, I'm relatively new to using virtualizarr but have been developing tools using Kerchunk for some time, specifically a package around large-scale conversions in parallel for thousands of datasets in our data archives.
I'm attempting to use the Virtualizarr library to concatenate some NetCDF4 data into a virtual dataset, then write out as an Icechunk store to disk. My issue is that it seems Icechunk requires the new zarr v3 pre-release, but Kerchunk (used to create the virtual dataset) needs Zarr < 3. I've so far been unable to resolve this dependency issue. Any suggestions for how to go about solving this would be appreciated, thanks!
My example code:
I either get an issue importing kerchunk (if I uninstall that to reinstall the zarr v3 pre-release) or an issue with zarr when trying to create the Icechunk store.
The text was updated successfully, but these errors were encountered: