This project uses news, social media and polling data to get an idea about the public opinions on upcoming US presidential elections 2024 using Hadoop, MapReduce, Hive and Tableau
We used Google News API, Reddit API and FiveThirtyEight.com to collect data related to US presidential elections
We used Hadoop and MapReduce programs to explore, clean and profile our datasets
We used GPT 3.5 API to get public opinions on political parties and candidates, and got a score between 0 to 1 which corresponds to favorability of particular party or the candidate
We used Hive to get aggregated scores of all the news articles and social media posts, grouped by day and visualized our results using Tableau