-
Notifications
You must be signed in to change notification settings - Fork 15
imageboard scraper
recrm edited this page Jan 19, 2015
·
1 revision
Imageboard-scraper.py is a simple script designed to interact with image boards based on the 4chan API. Running the program collects all posts made since the script was last run. If the script has not been run before it collects all current posts.
python3 imageboard-scraper.py trv
Running the script with a single option will download all posts in the associated board. The above command will download all posts in the 4chan /trv (travel) board.
-
-h
- Outputs the command line help screen.
- example: python3 imageboard-scraper.py -h
-
-output
- Changes the directory the scraped posts will be saved to.
- example: python3 imageboard-scraper.py -output /path/to/save/folder
- (Will save results in /path/to/save/folder)
-
-image
- Boolean value, if set will also download images.
- Stores the images in a /images folder inside the output folder.
-
-url
- Changes the internal URL's of the website to scrap.
- Currently only accepts '4chan' or '8chan'.
- Defaults to '4chan'.