Skip to content

Data Wrangling Project using tweet archive of an user @dog_rates with Python (Numpy, Pandas, Matplotlib)

Notifications You must be signed in to change notification settings

AsunaMasuda/data_wrangling_twitter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

Wrangle and Analyze Twitter Data

This repository holds a Jupyter notebook where I performed data cleaning, analysing and visualization on tweet data of @dog_rates also known as WeRateDogs, with Python.

Overview

WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog. These ratings almost always have a denominator of 10. The numerators, though, almost always greater than 10. 11/10, 12/10, 13/10, etc. Why? Because "they're good dogs Brent." WeRateDogs has over 4 million followers and has received international media coverage.

Description

I analysed this tweet archive data by following steps.

  1. Using python and Pandas, I read csv, tsv and json files which contain the archives of @dog_rates account. Next, I accessed each dataframes and defined the data's quality and tidiness issues.
  2. I clearned those quality and tidiness issues using python and Pandas for further analysis, removing unneeded columns/rows and wrongly captured data in the wrangling process, and tyding up by combining those three dataframes.
  3. Using matplotlib and python, I conducted exploratory and explanatory data analysis.

About

Data Wrangling Project using tweet archive of an user @dog_rates with Python (Numpy, Pandas, Matplotlib)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published