Skip to content

godatadriven/data-pipelines-with-airflow-2nd-ed

 
 

Repository files navigation

Data Pipelines with Apache Airflow

Code accompanying the Manning book Data Pipelines with Apache Airflow.

Structure

Overall, this repository is structured as follows:

├── chapter01                # Code examples for Chapter 1.
├── chapter02                # Code examples for Chapter 2.
├── ...
├── .pre-commit-config.yaml  # Pre-commit config for the CI.
├── CHANGELOG.md             # Changelog detailing updates to the code.
├── LICENSE                  # Code license.
├── README.md                # This readme.
└── requirements.txt         # CI dependencies.

The chapterXX directories contain the code examples for each specific Chapter.

Code for each Chapter is generally structured something like follows:

├── dags                  # Airflow DAG examples (+ other code).
├── docker-compose.yml    # Docker-compose file used for running the Chapter's containers.
└── readme.md             # Readme with Chapter-specific details, if any.

Usage

Details for running specific chapter examples are available in the corresponding chapter's readme. In general, most code examples are run using docker-compose, together with the provided docker-compose.yml file in each chapter. This docker-compose file will take care of spinning up the required resources and start an Airflow instance for you. Once everything is running, you should be able to run the examples in Airflow using your local browser.

Some later Chapters (such as Chapters 11 and 13) may require a bit more setup. The details for doing so are described in the corresponding readme's and in the Chapter's themselves.

About

Code for the second edition of Data Pipelines with Apache Airflow Book

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 91.8%
  • Shell 4.2%
  • Dockerfile 2.4%
  • Other 1.6%