Skip to content

wither/Wikipedia-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Wikipedia-Scraper

A Python Wikipedia Scraper using BeautifulSoup4! Wikipedia-Scraper automatically scrapes all text from a specific Wikipedia page and removes all of those unnecessary "[x]" within it.

Installation

Download a .zip file of the repository and extract it where you would like to keep the files.

Install the requirements by running:

pip install -r requirements.txt

Usage

In the console of your choice, run:

cd /path/to/directory
python3 scraper.py

Or if you are on Windows:

cd C:\Users\Path\To\Directory
python scraper.py

Requirements

BeautifulSoup4

Requests

Wikipedia

You can install these requirements by running:

pip install -r requirements.txt

Supported Languages

  • English
  • Dutch
  • French
  • Spanish
  • Italian
  • Swedish
  • Portuguese

To Do

  1. Fix Bugs
  2. Optimize
  3. GUI
  4. Tables and Images

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

MIT

About

A Python Wikipedia Scraper using BeautifulSoup4!

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages