Skip to content

Zeavz/web-scrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Application is deployed at https://jenis-web-scrapper.herokuapp.com

the web scrapper is wrapped as a REST service for ease of testing

curl to trigger the webcrawler:

curl --request GET \ --url 'https://jenis-web-scrapper.herokuapp.com/scrapping/start?url=https%3A%2F%2Fwww.sedna.com%2F'

The output will be as follows:

{[
  "routes": these are the main routes on the domain of this webpage {
    "assets": This is a list of all assets found on the current route,
    "outboundLinks": These are the links that could be navigated to from the current route
  }
]}

To build the project run ./mvnw clean install

To run the project run ./mvnw spring:boot run

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages