Skip to content

slilichenko/dataflow-bigquery

Repository files navigation

Beam to BigQuery

This repo contains sample code for testing different BigQuery persistence methods in Apache Beam pipelines

Environment setup

export PROJECT_ID=<project_id>
export GCP_REGION=<region>
export BIGQUERY_REGION=us
source setup-env.sh

Start event generation for streaming pipelines

./start-event-generation.sh <events-per-second>

Run a pipeline

cd pipeline
./run-dataflow-<method>.sh <params>

For Storage Write API:

./run-dataflow-storage-streaming.sh <number-of-streams> <triggering-frequency-in-seconds>

About

Dataflow to BigQuery IO comparison

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published