This repository has been archived by the owner on Jul 24, 2020. It is now read-only.

Web Scrape

One master server.
Multiple "workers" which supply scraping network bandwidth + processing power.
How to update worker code?
- Server sends the before/after parts of the job processing pipeline to the workers.
- This code is maintained in normal python code.
Insert obtained data into database + create more jobs.