EmailScraper

A scrapy script to spider a website and scrape all emails using a regex. EmailScraper outputs the email and the url it was found in JSON format. The output is generated as the website is spidered and does not contain duplicates.

Requirements

Scrapy

pip install scrapy

Usage

Scrape all emails from example.com and save the output to emails.json, and only print status of spider (not every GET request).

scrapy runspider EmailScraper.py -a url=http://example.com/ -o emails.json -L INFO

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
EmailScraper.py		EmailScraper.py
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EmailScraper

Requirements

Usage

License

About

Releases

Packages

Languages

License

TheKevinWang/EmailScraper

Folders and files

Latest commit

History

Repository files navigation

EmailScraper

Requirements

Usage

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages