Selenium Scraping Framework

Browsers

Allows crawling/scraping through any of the major browsers (dependencies and drivers are downloaded on-the-fly): Firefox, Chrome, Edge, Safari, Opera, Internet Explorer.

Crawling

Implements generic crawlers that can be extended to retrieve any kind of crawl frontier (e.g FolderCrawler, WebCrawler, PageRankCrawler and multi-threaded counterparts).

Scraping

Creates an executable graph that allows transforming data the same way ETL frameworks do. Control Flow tasks are used to chain execution of tasks, while Data Flow tasks handle data transformation and multi-pipelining.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.settings		.settings
src		src
.classpath		.classpath
.gitignore		.gitignore
.project		.project
ReadMe.md		ReadMe.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selenium Scraping Framework

Browsers

Crawling

Scraping

About

Releases

Packages

Languages

54754N4/SSF

Folders and files

Latest commit

History

Repository files navigation

Selenium Scraping Framework

Browsers

Crawling

Scraping

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages