Skip to content

Latest commit

 

History

History
17 lines (10 loc) · 538 Bytes

README.md

File metadata and controls

17 lines (10 loc) · 538 Bytes

The python script to extract domain names from a URL list, while ensuring the TLD being intact.

It'll strip any sub-domain or path from the URL and creates a new file with the unique domain list.

The script required the TLDextract library by John, for Python 3. More on the library and details at https://github.com/john-kurkowski/tldextract

Command to install: pip install tldextract

Usage:

  • $chmod +x domain.py
  • $./domain.py <input_file> <output_file>

Notes:

  • The input files must contain one URL in each line

Enjoy!!