Skip to content

Releases: yujiosaka/headless-chrome-crawler

1.2.5

03 Jan 13:34
001053d
Compare
Choose a tag to compare

Added

changed

  • Make cache to be required for HCCrawler.connect() and HCCrawler.launch()'s options.
  • Provide skipDuplicates to remember and skip duplicate URLs, instead of passing null to cache option.
  • Modify BaseCache interface.

1.2.4

25 Dec 10:34
934438c
Compare
Choose a tag to compare

Added

  • Support CSV and JSON Lines formats for exporting results
  • Emit requeststarted, requestskipped, requestfinished, requestfailed, maxdepthreached, maxrequestreached and disconnected events.
  • Improve debug logs by tracing public APIs and events.

Changed

  • Allow onSuccess and evaluatePage options as null.
  • Change crawler.isPaused, crawler.queueSize, crawler.pendingQueueSize and crawler.requestedCount from read-only properties to methods.

Fixed

  • Fix a bug of ignoring maxDepth option.

1.2.3

17 Dec 05:00
780b4d7
Compare
Choose a tag to compare

[1.2.3] - 2017-12-17

changed

  • Refactor by changing tye style of requiring cache directory.

Fixed

  • Fix a bug of starting too many crawlers more than maxConcurrency when requests fail.

1.2.2

16 Dec 02:07
e7bad81
Compare
Choose a tag to compare

Added

  • Automatically collect and follow links found in the requested page.
  • Support maxDepth for crawler.queue()'s options.

1.2.1

13 Dec 01:46
8ebdb0f
Compare
Choose a tag to compare

Added

1.2.0

10 Dec 21:57
84300a2
Compare
Choose a tag to compare

Changed

1.1.2

10 Dec 10:07
2cd6356
Compare
Choose a tag to compare

Added

1.1.1

09 Dec 04:04
Compare
Choose a tag to compare

[1.1.1] - 2017-12-09

Added

  • Add CHANGELOG
  • Automatically dismisses dialog
  • Enrich unit tests

Changed

  • Refactor by separating HCCrawler and Crawler classes
  • Make preparation of pages parallel

1.1.0

07 Dec 18:52
e0e5a7d
Compare
Choose a tag to compare
  • Refactor by separating HCCrawler and Crawler classes
  • Public API to launch a browser has changed. Now you can launch browser by HCCrawler.launch()
  • Rename shouldRequest to preRequest
  • Modify README according to new public API
  • Modify examples according to new public API
  • Support extraHeaders option
  • Refactor handlers for options
  • Add comment in JSDoc style

1.0.0

05 Dec 13:23
9138361
Compare
Choose a tag to compare