Web scraping and data cleaning practice using the WeRateDogs twitter account. Main report in wrangle_act.html. Tweet data scraped via tweepy and twitter api.
- image-predictions.tsv - raw dataset generated by image predictions algorithm, explained in more detail in report
- image-archives-master.csv - cleaned images dataset
- resources.txt - sources for borrowed code
- tweet-json.txt - raw json data from pulled tweets, included because using the code to pull data takes about 30 minutes
- twitter-archive-enhanced.csv - raw tweet dataset
- twitter-archive_master.csv - cleaned tweet dataset
- wrangle_act.html - open with your browser to read report
- wrangle_act.ipynb - open with jupyter notebook to view source code