Skip to content

smallela1/stormcrawlerfight

 
 

Repository files navigation

stormcrawlerfight

Benchmarking of StormCrawler with different versions, backends or alternative crawlers.

With Storm installed, you can generate an uberjar:

mvn clean package

and then use the following command to inject URLs into the topology

storm jar target/storm-crawler-fight-2.0-SNAPSHOT.jar  org.apache.storm.flux.Flux --local es-injector.flux

then

storm jar target/storm-crawler-fight-2.0-SNAPSHOT.jar  org.apache.storm.flux.Flux --local es-crawler.flux

Replace '--local' with '--remote' to deploy it on a running Storm cluster.

About

Crawl configuration for benchmarking StormCrawler

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • FLUX 51.3%
  • Shell 48.7%