Highlights
- Pro
Stars
Turning PySpark Into a Universal DataFrame API
Cross-platform lib for process and system monitoring in Python
A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.
🔥 Blazing fast bulk data transfers between any cloud 🔥
Always know what to expect from your data.
Duelyst is a digital collectible card game and turn-based strategy hybrid, developed by Counterplay Games.
Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
Low-code framework for building custom LLMs, neural networks, and other AI models
RISE: "Live" Reveal.js Jupyter/IPython Slideshow Extension
A light-weight, flexible, and expressive statistical data testing library
modin-project / modin-spreadsheet
Forked from quantopian/qgridAn interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
Fastest library to load data from DB to DataFrames in Rust and Python
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
Run your code in the cloud, with technology so advanced, it feels like magic!
The source code that powers readthedocs.org
Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.
git-for-windows / git
Forked from git/gitA fork of Git containing Windows-specific patches.
DuckDB is an analytical in-process SQL database management system
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
A reactive Python kernel for Jupyter notebooks.