Stars
Lakekeeper: A Rust native Iceberg REST Catalog
Lightweight and extensible compatibility layer between dataframe libraries!
Chapter: Open Source GTM for Developer Tools.
Next generation compute platform for the post-modern data stack
Simple framework of typical data app actions (data readers, transforms, inference, data writers) for Python.
Scheduling infrastructure for absolutely everyone.
Video Clip Analytics By Using Dataflow and Video AI For Object Tracking
Streaming Anomaly Detection Solution by using Pub/Sub, Dataflow, BQML & Cloud DLP
[Moved to https://github.com/standardnotes/app] A free, open-source, and end-to-end encrypted notes app. https://standardnotes.com
An Apache Beam source to connect and consume data from TREP using the Websocket API.
Model analysis tools for TensorFlow
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
A Scala API for Apache Beam and Google Cloud Dataflow.
This project contains a basic pipeline for migrating a MS SQL Server catalog to a BigQuery dataset.
Opinion Analysis of News, Threaded Conversations, and User Generated Content