WebArchiver

Will download HTML of URL provided at runtime, will also parse html extract embedded links and download associated html.

requests library used to GET HTML and parse embeded links
Will normalise a url by removing and non alphanumeric chars and replace with _ also remove http:// eg. https://python.org/ => python_org_
Save main url and embeded urls as .html and produce lookup.json to reconstruct urls from normalised filenames.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
images		images
README.md		README.md
web_archiver.py		web_archiver.py

Provide feedback