__
__ _ _(_ ) __ _ __
/'_ '\/'_' )| | /'__'( '__)
( (_) ( (_| || |( ___| |
'\__ '\__,_(___'\____(_)
( )_) |
\___/' @dwisiswant0
A fast tool to fetch URLs from HTML attributes by crawl-in. Inspired by the @omespino Tweet, which is possible to extract src
, href
, url
and action
values by evaluating JavaScript through Chrome DevTools Protocol.
The installation is easy. You can download a prebuilt binary from releases page, unpack and run! or with
▶ (sudo) curl -sSfL https://git.io/galer | sh -s -- -b /usr/local/bin
If you have go1.15+ compiler installed and configured:
▶ GO111MODULE=on go get github.com/dwisiswant0/galer
▶ git clone https://github.com/dwisiswant0/galer
▶ cd galer
▶ go build .
▶ (sudo) mv galer /usr/local/bin
Simply, galer can be run with:
▶ galer -u "http://domain.tld"
▶ galer -h
This will display help for the tool. Here are all the switches it supports.
Flag | Description |
---|---|
-u, --url | Target to fetches (single target URL or list) |
-e, --extension | Show only certain extensions (comma-separated, e.g. js,php) |
-c, --concurrency | Concurrency level (default: 50) |
--in-scope | Show in-scope URLs/same host only |
-o, --output | Save fetched URLs output into file |
-t, --timeout | Maximum time (seconds) allowed for connection (default: 60) |
-s, --silent | Silent mode (suppress an errors) |
-v, --verbose | Verbose mode show error details unless you weren't use silent |
-h, --help | Display its helps |
▶ galer -u "http://domain.tld"
▶ galer -u /path/to/urls.txt
▶ cat urls.txt | galer
In case you want to chained with other tools:
▶ subfinder -d domain.tld -silent | httpx -silent | galer
You can use galer as library.
▶ go get github.com/dwisiswant0/galer/pkg/galer
For example:
package main
import (
"fmt"
"github.com/dwisiswant0/galer/pkg/galer"
)
func main() {
cfg := &galer.Config{
Timeout: 60,
}
cfg = galer.New(cfg)
run, err := cfg.Crawl("https://twitter.com")
if err != nil {
panic(err)
}
for _, url := range run {
fmt.Println(url)
}
}
- Enable to set extra HTTP headers
- Provide randomly User-Agent
- Bypass headless browser
- Add exception for specific extensions
If you are still confused or found a bug, please open the issue. All bug reports are appreciated, some features have not been tested yet due to lack of free time.
galer released under MIT. See LICENSE
for more details.
Current version is 0.0.2 and still development.
id_ID
• /gäˈlər/ — kalau galer jangan dicium baunya, langsung cuci tangan, bego!
- Omar Espino for the idea, that's why this tool was made!