scrape.sh

Wrapper for https://github.com/lawzava/scrape to scrape a TSV (tab seperated file) of urls to product a TSV output with emails

Ideal scrape.sh usage

./scrape.sh{php} -f 2 file.tsv options

-f email column number (defaults = 1)

-v will output to stderr debug information

ouptput to stdout:

email     domain    website

Forked from https://github.com/lawzava/scrape

CLI utility to scrape emails from websites

Usage

Sample call:

scrape -w https://lawzava.com

Depends on chromium or google-chrome being available in path if --js is used

Parameters:

          --async             Scrape website pages asynchronously (default true)
      -d, --depth int         Max depth to follow when scraping recursively (default 3)
          --emails            Scrape emails (default true)
          --follow-external   Follow external 3rd party links within website
      -h, --help              help for scrape
          --js                Enables JS execution await
          --debug             Print debug logs
          --recursively       Scrape website recursively (default true)
      -w, --website string    Website to scrape (default "https://lawzava.com")

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.github/workflows		.github/workflows
cmd		cmd
scraper		scraper
tld		tld
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
ScrapeCommand.php		ScrapeCommand.php
Scrape_Test.txt		Scrape_Test.txt
athletetrax.php		athletetrax.php
email_filters.txt		email_filters.txt
go.mod		go.mod
go.sum		go.sum
icon.svg		icon.svg
main.go		main.go
run_scrape.php		run_scrape.php
scrape.sh		scrape.sh
snapcraft.yaml		snapcraft.yaml
url_filters.txt		url_filters.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

scrape.sh

Ideal scrape.sh usage

Forked from https://github.com/lawzava/scrape

Usage

Parameters:

About

Releases

Packages

Languages

License

atippett/scrape

Folders and files

Latest commit

History

Repository files navigation

scrape.sh

Ideal scrape.sh usage

Forked from https://github.com/lawzava/scrape

Usage

Parameters:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages