MGHTextbookScraper

A McGrawHill (MGH) Textbook Scraper that is made in python.

Requirements:

Python (Tested on 3.9, should work on anything recent)
McGrawHill Cookies

Getting Started:

Clone project

user@User-Machine~$ git clone https://github.com/Sendeky/MGHTextbookScraper
user@User-Machine~$ cd MGHTextbookScraper

Install requirements

user@User-Machine~$ pip install requirements.txt

Getting cookies from McGrawHill:

Navigate and open your textbook
Open your browser's Web Inspector (Ctrl+Shift+I for Chrome and most browsers)
Find a link that starts with "epub-factory-cdn.mheducation.com" (Ctrl+F to open find menu)
Open the link

There are 3 cookies that are necessary for this to work

Click on the Cookies Tab of the Web Inspector
Get 3 Cookies:
*CloudFront-Policy
*CloudFront-Signature
*CloudFront-Key-Pair-Id

Congrats! Now put all 3 cookies into the cookies.txt file

Running the Scraper:

It's super simple! Just run the main file like this

user@User-Machine~$ python TextbookScraperProject.py

The retrieved text will be in the project folder in data.json

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
README.md		README.md
TextbookScraperProject.py		TextbookScraperProject.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MGHTextbookScraper

Requirements:

Getting Started:

Getting cookies from McGrawHill:

Congrats! Now put all 3 cookies into the cookies.txt file

Running the Scraper:

About

Releases

Packages

Languages

Sendeky/MGHTextbookScraper

Folders and files

Latest commit

History

Repository files navigation

MGHTextbookScraper

Requirements:

Getting Started:

Getting cookies from McGrawHill:

Congrats! Now put all 3 cookies into the cookies.txt file

Running the Scraper:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages