TODOs to get zarr arrow in a reasonable, usable state #21

maximedion2 · 2024-06-30T02:39:27Z

This will be a list of TODOs for the overall project of writing a query engine for Zarr files (and eventually other raster formats... maybe). I'm going to split the overall project in 3 phases, numbered 0, 1 and 2. Each TODO on the list will eventually be assigned an issue with more details and a PR for the implementation.

maximedion2 · 2024-06-30T02:52:46Z

Phase 0:
This phase is about implementing the foundation for a query engine that shamelessly leverages that 1) Zarr is a heavily chunked up storage format and that 2) raster data typically involves some of the data representing some sort of coordinates, with most queries involving filtering on those coordinates. As I'm making this list, I already have the basics implemented, what's left is

Phase 1:
This phase will be about implementing a more generic version of the query engine that can be implemented for various raster formats. The broad steps will be

Define a trait that implements all the methods needed for a "raster reader", that file/store wrappers will need to implement.
Implement a "raster reader".

Phase 2:
This phase will be about implements efficient geospatial queries, that will work of off WKT strings. Realistically, I'm not going to implement a completely new type of data in DataFusion, I will have to rely on passing string to geospatial functions, or transforming data (like 2 floats for a point) into a string, that can then be passed to geospatial functions. The steps would be

Make sure that the geo Rust library supports spatial indexing and operations on multiple geometries at once, if not see if I can help out the project and implement it.
Implement within and intersect operations.
Implement distance and intersection operations.
Implement more "exotic" geospatial operations.
Implement a hypothetical "nearest" operation to allow for a "join nearest...".

maximedion2 · 2024-06-30T02:53:12Z

@tshauck feel free to add anything here of course.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TODOs to get zarr arrow in a reasonable, usable state #21

TODOs to get zarr arrow in a reasonable, usable state #21

maximedion2 commented Jun 30, 2024 •

edited

Loading

maximedion2 commented Jun 30, 2024 •

edited

Loading

maximedion2 commented Jun 30, 2024

TODOs to get zarr arrow in a reasonable, usable state #21

TODOs to get zarr arrow in a reasonable, usable state #21

Comments

maximedion2 commented Jun 30, 2024 • edited Loading

maximedion2 commented Jun 30, 2024 • edited Loading

maximedion2 commented Jun 30, 2024

maximedion2 commented Jun 30, 2024 •

edited

Loading

maximedion2 commented Jun 30, 2024 •

edited

Loading