Python script that scrapes data about the Pokemon from the Serebii.net website. It collects information about each Pokemon's name, number, image, and stats. The script uses the Python libraries requests
and BeautifulSoup
to scrape the data from the website, and pandas
to store the data in a Pandas dataframe. Google Drive's API gspread
library is used to upload the following information onto a Google Sheet with the sheet names corresponding to the correct information.
To use the web scraper, need to have Python3 installed on local machine. Python3 can be installed from official Python website.
Will also need to install Python libraries, requests
, beautifulsoup4
, pandas
, and gspread
. The libraries can be installed by running following command in terminal:
pip install requests beautifulsoup4 pandas gspread
First, follow the instructions for Google Drive's API to obtain the credentials here. After obtaining the credentials, create a Google Sheet with the desired file name, and change sheet_name
variable in upload.py to match the file name. Go into the Google Sheet file and share it to the credentials address. Finally, run upload.py to populate the data onto the Google Sheet file.
If you would like to contribute to this project, feel free to fork the repository and submit a pull request.
Thanks to Serebii.net website for providing the data used in this project.