Change the repository type filter
All
Repositories list
28 repositories
spruce
PublicEnrichment pipeline for CUR reports which adds energy and carbon data allowing to report and reduce the impact of the your cloud usage.carbonara
Public archiveEnrichment pipeline for CUR / FOCUS reports which adds energy and carbon data allowing to report and reduce the impact of the your cloud usage.benchmark
PublicStormCrawler topology to evaluate the performance of different backends and configurationsdigitalpebble.github.io
Publicstormcrawler-docker
PublicResources for running StormCrawler with Docker servicescrawlurlfrontier
Public archivestorm
Publictika
Publicdocs
Publicansible-storm
Publicnutch
Publicurlfrontier-client
PublicURLFrontier client written in Rust (mostly as a way of learning Rust)TextClassification
PublicA Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and can be used as a front end to various ML algorithms. libSVM and liblinear are currently embedded.stormcrawlerfight
Publicbehemoth
Public archivecrawler-commons
Publicsc-warc
Publictescobank
Public archivebehemoth-commoncrawl
Public archivetika-cc
PublicNutchFight
Publicbehemoth-elasticsearch
Public archivebehemoth-textclassification
Public archiveTextClassificationPlugin
Public archivengrams-api
Public archive