Skip to content
recrm edited this page Jan 19, 2015 · 1 revision

imageboard-scraper.py

Imageboard-scraper.py is a simple script designed to interact with image boards based on the 4chan API. Running the program collects all posts made since the script was last run. If the script has not been run before it collects all current posts.


Basic usage.


python3 imageboard-scraper.py trv

Running the script with a single option will download all posts in the associated board. The above command will download all posts in the 4chan /trv (travel) board.


Other Arguments


  • -h

    • Outputs the command line help screen.
    • example: python3 imageboard-scraper.py -h
  • -output

    • Changes the directory the scraped posts will be saved to.
    • example: python3 imageboard-scraper.py -output /path/to/save/folder
    • (Will save results in /path/to/save/folder)
  • -image

    • Boolean value, if set will also download images.
    • Stores the images in a /images folder inside the output folder.
  • -url

    • Changes the internal URL's of the website to scrap.
    • Currently only accepts '4chan' or '8chan'.
    • Defaults to '4chan'.
Clone this wiki locally