Skip to content

nemusa/ecommerce-pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A sample ELT pipeline for a public ecommerce dataset

The project uses Kaggle ecommerce dataset to demonstrate a simple ELT pipeline using Apache Airflow and Google Cloud Platform.

Running the project

Follow the instructions in Development Environment to set up the development environment.

Source data

The source data structure is documented in dataset.md

IDE setup

To set up a local virtualenv for the IDE suggestions, use the following commands:

python3.10 -m venv venv/
pip install --upgrade pip
pip install 'apache-airflow==2.8.1' \
    --constraint "https://raw.githubusercontent.com/apache/airflow/constraints-2.8.1/constraints-3.10.txt"
pip install -r dev/requirements.txt

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published