The OpenRhetoric project aims to analyse the public communication of prominent figures from across the global political spectrum.
The project involves using Python to obtain from reliable sources content such as speechs and articles delivered by these figures and run Natural Language Processing (NLP) algorithms on them.
Figure | Data Source |
---|---|
Joseph Goebbels | Calvin College German Propaganda Archive |
OpenRhetoric is built with Python 3. The following Python libraries are in use and required to run it:
- Requests
- BeautifulSoup
- NLTK
- Matplotlib
- WordCloud, an amazing visualization library written in Python by Andreas Mueller
Every analysed figure has a program which will fetch, analyse and visualize data related to him/her.
- Download repository and unzip it to desired location
- Navigate to unzipped folder and open a terminal window
- Run the
.sh
file related to the figure you want to analyse. For example:
bash filename.sh