Skip to content

Simple Python library to scrape email addresses from HTML

License

Notifications You must be signed in to change notification settings

kichik/email-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python Module for Scraping Email Addresses from HTML

The email_scraper module provides a simple method that extracts email addresses from HTML. It is able to find emails in plain text, links, atob() obfuscation and HTML entities obfuscation.

Available on PyPI.

Usage

>>> from email_scraper import scrape_emails
>>> scrape_emails('<html><body><a href="mailto:hello@world.com">email me</a></body></html>')
{'hello@world.com'}
>>> scarpe_emails('<a href="javascript:window.location.href=atob(\'bWFpbHRvOmVtYWlsQGV4YW1wbGUuY29t\')">E-Mail</a>')
{'email@example.com'}