Python scripts for downloading comics from readcomicsonline.to and comicextra.net
This script is for ripping comics from readcomicsonline.ru and comicextra.net.
In addition to downloading the images for the selected comic it also processes the downloaded images, crops out the black bar "Read more at" text that some uploaded comic files have and removes any of the commonly used ripper pages (generally added to the end of the comic.)
Once processed, the folder is converted into a .cbz file.
Usage
python3 comic_grabber.py -c [-a / --alt 1 (to search comicextra) | -i / --img (to ignore file cropping process) | -b / --ban (to ignore checking watermark pages)
This does require the installation of Tesseract-OCR if you want to take advantage of the cropping process.
A config file will be created upon first run to set your downloads directory and tesseract-ocr path.
Keep in mind, this is still a work in progress. Would love to add a gui to it or implement some other functions. I am planning on adding the ability to rip from readcomiconline.li as well.
Note -- Add any of the water mark uploader files (such as zone, empire etc.) to the banned folder, I am sure there are plenty more files out there that I have yet to stumble across