Skip to content

Very ugly python script to scrape webcomics and create a static html page for easy viewing

Notifications You must be signed in to change notification settings

tuxx/webcomic-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Description

HTML scraping script that gets some online web comics and displays them on a single page, because i am lazy.

Example of the webpage that is generated

Install

  • Copy web/ files to your webserver folder.
  • edit config.py and set the correct webserver folder.
  • pip install -r requirements.txt
  • python main.py

Todo

Donezell Washington

  • Make all current comics as plugins in the comics/ directory
  • Actually output HTML (currently it just prints the scraped html)
  • Discard old.py
  • Config parser so that OUTPUT_FILE and OUTPUT_DIR are not defined in 2 files.
  • Do not stop when one comic fails to scrape, just ignore that comic.
  • Make instagram scraper for instagram comics

To be done

  • Make it possible to scrape 1 comic for testing purposes
  • Add index generator
  • Check when a comic has been updated and display that (or put it on the top of the page, so you dont have to scroll so much)

General life todo

  • Drink beer
  • Raise hell
  • Pet dogs

Instagram toons

About

Very ugly python script to scrape webcomics and create a static html page for easy viewing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published