Skip to content
This repository has been archived by the owner on Jul 24, 2020. It is now read-only.

Latest commit

 

History

History
9 lines (8 loc) · 353 Bytes

README.md

File metadata and controls

9 lines (8 loc) · 353 Bytes

Web Scrape

  • One master server.
  • Multiple "workers" which supply scraping network bandwidth + processing power.
  • How to update worker code?
    • Server sends the before/after parts of the job processing pipeline to the workers.
    • This code is maintained in normal python code.
  • Insert obtained data into database + create more jobs.