Skip to content

This project uses news, social media and polling data to get an idea about the public opinions on upcoming US presidential elections 2024 using Hadoop, MapReduce, Hive and Tableau

Notifications You must be signed in to change notification settings

rahul-m-patel/US_Presidential_elections_analytics

Repository files navigation

US Presidential elections analytics

This project uses news, social media and polling data to get an idea about the public opinions on upcoming US presidential elections 2024 using Hadoop, MapReduce, Hive and Tableau

Data Collection

We used Google News API, Reddit API and FiveThirtyEight.com to collect data related to US presidential elections

Data Cleaning using Mapreduce

We used Hadoop and MapReduce programs to explore, clean and profile our datasets

Sentiment analysis

We used GPT 3.5 API to get public opinions on political parties and candidates, and got a score between 0 to 1 which corresponds to favorability of particular party or the candidate

Hive Querying

We used Hive to get aggregated scores of all the news articles and social media posts, grouped by day and visualized our results using Tableau

About

This project uses news, social media and polling data to get an idea about the public opinions on upcoming US presidential elections 2024 using Hadoop, MapReduce, Hive and Tableau

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published