Wikipedia-Scraper

A Python Wikipedia Scraper using BeautifulSoup4! Wikipedia-Scraper automatically scrapes all text from a specific Wikipedia page and removes all of those unnecessary "[x]" within it.

Installation

Download a .zip file of the repository and extract it where you would like to keep the files.

Install the requirements by running:

pip install -r requirements.txt

Usage

In the console of your choice, run:

cd /path/to/directory
python3 scraper.py

Or if you are on Windows:

cd C:\Users\Path\To\Directory
python scraper.py

Requirements

BeautifulSoup4

Requests

Wikipedia

You can install these requirements by running:

pip install -r requirements.txt

Supported Languages

English
Dutch
French
Spanish
Italian
Swedish
Portuguese

To Do

Fix Bugs
Optimize
GUI
Tables and Images

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wikipedia-Scraper

Installation

Usage

Requirements

Supported Languages

To Do

Contributing

License

About

Releases

Packages

Contributors 2

Languages

wither/Wikipedia-Scraper

Folders and files

Latest commit

History

Repository files navigation

Wikipedia-Scraper

Installation

Usage

Requirements

Supported Languages

To Do

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages