Data Engineering examples for Airflow, Prefect, and Mage.ai; dbt for BigQuery, Redshift, ClickHouse, PostgreSQL; Spark/PySpark for Batch processing; and Kafka for Stream processing
-
Updated
Feb 9, 2025 - Python
Data Engineering examples for Airflow, Prefect, and Mage.ai; dbt for BigQuery, Redshift, ClickHouse, PostgreSQL; Spark/PySpark for Batch processing; and Kafka for Stream processing
A simple demo showing how to use Ably and fastAPI to route messages into Kafka for stream processing
Current 2022 Confluent Keynote Demo covering Stream Designer, Stream Catalog, and Stream Sharing.
For recreational use. Just a playground of Kafka+Spark+MQTT+KSQLDB+others
Interactive ksqlDB command line client with autocompletion and syntax highlighting written in Python
Pythonic KSQL REST API - Next Gen.
Real-time Coinbase market data streaming pipeline with visualizations. Much appreciation to DataTalks.Club Data Engineering Zoom Camp: https://github.com/DataTalksClub/data-engineering-zoomcamp
Kafka Connect and kSQLDB with Oracle
An app to keep track of Youtube videos and sends the notification to a Telegram bot to inform you if anyone comments on those
Streaming event pipeline around Apache Kafka and its ecosystem, simulating Real-time Data Streaming
Free and simple way to interact with ksqlDB using UI
This project demonstrates a modern ETL (Extract, Transform, Load) streaming pipeline using various open-source technologies.
Kubernetes demo
Building a streaming Kafka application to push live notifications for updates to views, likes, favorites and comments on Youtube videos.
Real time fraud analysis using Kafka Streams
This repository contains a KSQLDB setup connection that streams the average ETH Gas Estimate using the Ethereum Gas Estimate API as source data.
Add a description, image, and links to the ksqldb topic page so that developers can more easily learn about it.
To associate your repository with the ksqldb topic, visit your repo's landing page and select "manage topics."