web_crawler

This is a "web" crawler. The main feature of this repo is the multi threading implementation for web crawling in python.

This only contains the code for the crawler and not the simulator for the web. TThe requirements.txt give the libraries required.

The crawler class contains the functions responsible for thread pools and the crawling function itself. The constants file under config gives us some parameters to setup the crawler.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
Helper		Helper
config		config
Crawler log		Crawler log
README.md		README.md
crawler.py		crawler.py
crawler_test.py		crawler_test.py
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

web_crawler

About

Uh oh!

Releases

Packages

Languages

gauravgpta93/web_crawler

Folders and files

Latest commit

History

Repository files navigation

web_crawler

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages