-
Notifications
You must be signed in to change notification settings - Fork 1
Description
Via @jpivarski (and @martindurant) I learned about the Awkward Array project. Development has been mainly driven by high energy physics use cases, but the need for variable-length data structures and other "awkard" structures is common to other domains, and work in Awkward Array could inform if/how Zarr might support some of these features. A recent update from @jpivarski:
I've finished a lot of development on my side, and Awkward is ready to use (front page; Doxygen C++ and Sphinx Python documentation is done; tutorials are not). I remember that the interaction between regular-sized arrays and variable-sized arrays was important for your data, so RegularArrays (C++, Python) are worth taking a look at.
I'd also like to know if Zarr is taking a more columnar approach to ragged arrays. Even if it's not, I could write a C++ function to de-interlace list sizes from list contents, which could then be exposed to the Python layer for Zarr → Awkward for analysis. (There's an awkward1._io extension module for these sorts of things, including some special cases for ROOT.)