Song Play Analysis

This project is a data analysis for songs dataset pertaining a startup called Sparkify. The analytics particularly concern the songs in addition to the user activity on Sparkify new music streaming app. Sparkify aims to understand what songs users are listening to. To fulfill this purpose their data needs to be modeled in order to easily querying the data.

In this project:

Converting raw data into well structured data warehouse.
Creating a database schema of Facts and Dimensions tables.
Building the ETL pipelines.

Prerequisites:

Python3

How to run it:

Run create_tables.py.
Run etl.py.
Start tweaking etl.ipynd if you like.

The project contains:

Test.ipynb > to check the update on the warehouse.
sql_queries.py > contians basic (Drop tables, Create tables, Insert and Select) quieries.
etl.py > etl pipelines.
etl.ipynb > nontebook for the same goal as etl.py except it gives a closer look to the data and not considering the whole process.
Create_tables.py > where connections to database and queries execution take place.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
create_tables.py		create_tables.py
etl.ipynb		etl.ipynb
etl.py		etl.py
sql_queries.py		sql_queries.py
test (1).ipynb		test (1).ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Song Play Analysis

In this project:

Prerequisites:

How to run it:

The project contains:

About

Releases

Packages

Languages

deema-A/Song-Play-Analysis

Folders and files

Latest commit

History

Repository files navigation

Song Play Analysis

In this project:

Prerequisites:

How to run it:

The project contains:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages