medusa-crawler 1.0.0.pre.2
Install from the command line:
Learn more about RubyGems packages
$ gem install medusa-crawler --version "1.0.0.pre.2"
Install via Gemfile:
source "https://rubygems.pkg.github.com/brutuscat" do
gem "medusa-crawler", "1.0.0.pre.2"
end
About this version
== Medusa: a ruby crawler framework {rdoc-image:https://badge.fury.io/rb/medusa-crawler.svg}[https://badge.fury.io/rb/medusa-crawler] rdoc-image:https://github.com/brutuscat/medusa-crawler/workflows/Ruby/badge.svg?event=push
Medusa is a framework for the ruby language to crawl and collect useful information about the pages it visits. It is versatile, allowing you to write your own specialized tasks quickly and easily.
=== Features
- Choose the links to follow on each page with +focus_crawl+
- Multi-threaded design for high performance
- Tracks +301+ HTTP redirects
- Allows exclusion of URLs based on regular expressions
- HTTPS support
- Records response time for each page
- Obey robots.txt directives (optional, but recommended)
- In-memory or persistent storage of pages during crawl, provided by Moneta[https://github.com/moneta-rb/moneta]
- Inherits OpenURI behavior (redirects, automatic charset and encoding detection, proxy configuration options).
Do you have an idea or a suggestion? {Open an issue and talk about it}[https://github.com/brutuscat/medusa-crawler/issues/new]
Details
- medusa-crawler
- brutuscat
- over 4 years ago
- 3 dependencies
Assets
- medusa-crawler-1.0.0.pre.2-rubygems.gem
Download activity
- Total downloads 0
- Last 30 days 0
- Last week 0
- Today 0