Skip to content

medusa-crawler 1.0.0.pre.2

Install from the command line:
Learn more about RubyGems packages
$ gem install medusa-crawler --version "1.0.0.pre.2"
Install via Gemfile:
source "https://rubygems.pkg.github.com/brutuscat" do
  gem "medusa-crawler", "1.0.0.pre.2"
end

About this version

== Medusa: a ruby crawler framework {rdoc-image:https://badge.fury.io/rb/medusa-crawler.svg}[https://badge.fury.io/rb/medusa-crawler] rdoc-image:https://github.com/brutuscat/medusa-crawler/workflows/Ruby/badge.svg?event=push

Medusa is a framework for the ruby language to crawl and collect useful information about the pages it visits. It is versatile, allowing you to write your own specialized tasks quickly and easily.

=== Features

  • Choose the links to follow on each page with +focus_crawl+
  • Multi-threaded design for high performance
  • Tracks +301+ HTTP redirects
  • Allows exclusion of URLs based on regular expressions
  • HTTPS support
  • Records response time for each page
  • Obey robots.txt directives (optional, but recommended)
  • In-memory or persistent storage of pages during crawl, provided by Moneta[https://github.com/moneta-rb/moneta]
  • Inherits OpenURI behavior (redirects, automatic charset and encoding detection, proxy configuration options).

Do you have an idea or a suggestion? {Open an issue and talk about it}[https://github.com/brutuscat/medusa-crawler/issues/new]

Details


Assets

  • medusa-crawler-1.0.0.pre.2-rubygems.gem

Download activity

  • Total downloads 0
  • Last 30 days 0
  • Last week 0
  • Today 0