Skip to content

Mag1ck/AWS_YouTube_Data_Engineering_Analysis_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

AWS YouTube Data Engineering Analysis Project

Project Overview

This project demonstrates how to build a data engineering pipeline using AWS services to analyze YouTube data. The pipeline includes data ingestion, transformation, visualization, and automation. The goal is to provide insights into YouTube metrics and trends through interactive dashboards.

Features

  • Data Ingestion: Collect data from various YouTube sources using AWS services.
  • Data Transformation: ETL processes to clean and preprocess the data.
  • Data Visualization: Interactive dashboards and visualizations created with AWS QuickSight.
  • Automation: Scheduled pipelines for continuous data updates and analysis.

Usage

  • Data Ingestion: Upload your data sources to the specified S3 bucket.
  • Data Transformation: The ETL process will be executed to clean and preprocess the data. Monitor AWS Glue jobs for status and logs.
  • Data Visualization: Access AWS QuickSight to view and interact with the dashboards.

Data Sources

https://www.kaggle.com

Technologies Used

  • AWS Athena: Query service for analyzing data.
  • AWS Lambda: Serverless compute for data processing tasks.
  • AWS Glue: ETL service for data preparation.
  • AWS S3: Storage service for data storage.
  • AWS QuickSight: Data visualization service for creating dashboards.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published