Skip to content

Connecting with Awkward Array #29

@alimanfoo

Description

@alimanfoo

Via @jpivarski (and @martindurant) I learned about the Awkward Array project. Development has been mainly driven by high energy physics use cases, but the need for variable-length data structures and other "awkard" structures is common to other domains, and work in Awkward Array could inform if/how Zarr might support some of these features. A recent update from @jpivarski:

I've finished a lot of development on my side, and Awkward is ready to use (front page; Doxygen C++ and Sphinx Python documentation is done; tutorials are not). I remember that the interaction between regular-sized arrays and variable-sized arrays was important for your data, so RegularArrays (C++, Python) are worth taking a look at.

I'd also like to know if Zarr is taking a more columnar approach to ragged arrays. Even if it's not, I could write a C++ function to de-interlace list sizes from list contents, which could then be exposed to the Python layer for Zarr → Awkward for analysis. (There's an awkward1._io extension module for these sorts of things, including some special cases for ROOT.)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions