Author: Rodrigo Zamith
Version: 1.0
Just edit the 'start_url' variable in downloader.py
and execute the scripts in the following order:
- downloader.py (Downloads each page from the online conference program)
- scraper.py (Scrapes the locally-stored pages)
- longform.py (Converts the data into long-form (each author as a case), which facilitates certain analyses in R)
This script requires Python, as well as the urllib2 and BeautifulSoup libraries.
This script is licensed under the Mozilla Public License Version 2.0 (see LICENSE file in root folder). TL;DR: feel free to use it commercially, modify it, and distribute it, provided you disclose both the source code and any moditations you make to it. Attribution, where appropriate, is appreciated.