cobweb 1.0.26
Cobweb is a web crawler that can use resque to cluster crawls to quickly crawl extremely large sites which is much more performant than multi-threaded crawlers. It is also a standalone crawler that has a sophisticated statistics monitoring interface to monitor the progress of the crawls.
Gemfile:
=
install:
=
Runtime Dependencies (8):
addressable
~> 2.3
haml
~> 4.0
json
~> 1.8
nokogiri
~> 1.6
redis
~> 3.0
redis-namespace
~> 1.3
sinatra
~> 1.4
slop
~> 3.4