Skip to content

Latest commit

 

History

History
12 lines (9 loc) · 399 Bytes

README.md

File metadata and controls

12 lines (9 loc) · 399 Bytes

Vietnamese news crawler

Simple crawler utilizing Scrapy library to extract articles from Vietnamese news websites given urls.

Currently support:

How to run

On the command line, navigate to the respective subfolder

  • kenh14_crawler: type scrapy crawl kenh14_content_crawler
  • soha_crawler: type scrapy crawl soha_content_crawler