A Python Wikipedia Scraper using BeautifulSoup4! Wikipedia-Scraper automatically scrapes all text from a specific Wikipedia page and removes all of those unnecessary "[x]" within it.
Download a .zip file of the repository and extract it where you would like to keep the files.
Install the requirements by running:
pip install -r requirements.txt
In the console of your choice, run:
cd /path/to/directory
python3 scraper.py
Or if you are on Windows:
cd C:\Users\Path\To\Directory
python scraper.py
BeautifulSoup4
Requests
Wikipedia
You can install these requirements by running:
pip install -r requirements.txt
- English
- Dutch
- French
- Spanish
- Italian
- Swedish
- Portuguese
- Fix Bugs
- Optimize
- GUI
- Tables and Images
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Please make sure to update tests as appropriate.