DataFusion is a very fast, extensible query engine for building high-quality data-centric systems in Rust, using the Apache Arrow in-memory format. Python Bindings are also available. DataFusion offers SQL and Dataframe APIs, excellent performance, built-in support for CSV, Parquet, JSON, and Avro, extensive customization, and a great community.
Here are links to some important information
- Project Site
- Rust Getting Started
- Rust DataFrame API
- Rust API docs
- Rust Examples
- Python DataFrame API
- Architecture
DataFusion is great for building projects and products like SQL interfaces, time series platforms, and domain specific query engines. Click Here to see a list known users.
The developer’s guide contains information on how to contribute.