Skip to content

This is a "web" crawler. The main feature of this repo is the multi threading implementation for web crawling in python.

Notifications You must be signed in to change notification settings

gauravgpta93/web_crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

web_crawler

This is a "web" crawler. The main feature of this repo is the multi threading implementation for web crawling in python.

This only contains the code for the crawler and not the simulator for the web. TThe requirements.txt give the libraries required.

The crawler class contains the functions responsible for thread pools and the crawling function itself. The constants file under config gives us some parameters to setup the crawler.

About

This is a "web" crawler. The main feature of this repo is the multi threading implementation for web crawling in python.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages