DomainSpiderSE

 一款基于国内两大SEO搜索引擎爱站和站长之家的子域名爬取工具。

环境

安装依赖：

 pip install -r requirement.txt

使用selenium模块依赖chromedriver，根据自己Chrome版本下载对应版本。

chromedriver国内镜像下载地址：http://npm.taobao.org/mirrors/chromedriver/

下载完后解压，将chromedriver.exe文件复制到Chrome的Google/Chrome/Application目录下和Python的安装目录。

用法

python spiderSE.py -h

************************************************************
        github：https://github.com/ltfafei
          CSDN: afei00123.blog.csdn.net
               公众号：网络运维渗透

************************************************************

usage: spiderSE.py [-h] -d URL -f FILES [--option {a,z}]

SearchEngine domain spider Script

optional arguments:
  -h, --help            show this help message and exit
  -d URL, --url URL     please input master domain. eg: xxx.com
  -f FILES, --files FILES
                        Please input url_file path to save. eg: c:\urls.txt
  --option {a,z}        --option a：使用爱站爬虫；--option z：使用站长之家爬虫（默认使用站长之家爬虫）

python spiderSE.py -d xxx.cn -f urls.txt

python spiderSE.py -d xxx.edu.cn -f urls1.txt --option a

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
__pycache__		__pycache__
README.md		README.md
aiStand_domainCrawl.py		aiStand_domainCrawl.py
requirement.txt		requirement.txt
spiderSE.py		spiderSE.py
webMaster_domainCrawl.py		webMaster_domainCrawl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DomainSpiderSE

环境

用法

About

Releases

Packages

Languages

ltfafei/DomainSpiderSE

Folders and files

Latest commit

History

Repository files navigation

DomainSpiderSE

环境

用法

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages