Skip to content

Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.

License

Notifications You must be signed in to change notification settings

artie-labs/transfer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

0075d1e · Mar 26, 2025
Mar 17, 2025
Mar 25, 2025
Jan 29, 2025
Mar 24, 2025
Mar 26, 2025
Mar 13, 2025
Feb 13, 2025
May 9, 2024
Aug 23, 2024
Jan 16, 2023
Mar 21, 2025
Feb 14, 2025
Mar 21, 2025
Mar 21, 2025
Dec 18, 2024
Mar 15, 2025

Repository files navigation

Artie Transfer

Artie Transfer

⚡️ Blazing fast data replication between OLTP and OLAP databases ⚡️


Learn more »

Artie Transfer is a real-time data replication solution for databases and data warehouses/lakes.

Typical ETL solutions rely on batched processes or schedulers (i.e. DAGs, Airflow), which means the data in the downstream data warehouse is often several hours to days old. This problem is exacerbated as data volumes grow, as batched processes take increasingly longer to run.

Artie leverages change data capture (CDC) and stream processing to perform data syncs in a more efficient way, which enables sub-minute latency.

Benefits of Artie Transfer:

  • Sub-minute data latency: always have access to live production data.
  • Ease of use: just set up a simple configuration file, and you're good to go!
  • Automatic table creation and schema detection: Artie infers schemas and automatically merges changes to downstream destinations.
  • Reliability: Artie has automatic retries and processing is idempotent.
  • Scalability: handle anywhere from 1GB to 100+ TB of data.
  • Monitoring: built-in error reporting along with rich telemetry statistics.

Take a look at this guide to get started!

Architecture

Examples

To run Artie Transfer's stack locally, please refer to the examples folder.

Getting started

Getting started guide

What is currently supported?

Transfer is aiming to provide coverage across all OLTPs and OLAPs databases. Currently Transfer supports:

  • Message Queues

    • Kafka (default)
  • Destinations:

    • BigQuery
    • Databricks
    • Iceberg (through S3 Tables)
    • Microsoft SQL Server
    • Redshift
    • S3
    • Snowflake
  • Sources:

    • DocumentDB
    • DynamoDB
    • Microsoft SQL Server
    • MongoDB
    • MySQL
    • Oracle
    • PostgreSQL

If the database you are using is not on the list, feel free to file for a feature request.

Configuration File

Telemetry

Artie Transfer's telemetry guide

Tests

Transfer is written in Go and uses counterfeiter to mock. To run the tests, run the following commands:

make generate
make test

Release

Artie Transfer is released through GoReleaser, and we use it to cross-compile our binaries on the releases as well as our Dockerhub. If your operating system or architecture is not supported, please file a feature request!

License

Artie Transfer is licensed under ELv2. Please see the LICENSE file for additional information. If you have any licensing questions please email hi@artie.com.

About

Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.

Topics

Resources

License

Stars

Watchers

Forks

Languages