Skip to content
Brandon Holt edited this page Aug 1, 2014 · 94 revisions

Scalding is a Scala library that makes it easy to write MapReduce jobs in Hadoop. It's similar to other MapReduce platforms like Pig and Hive, but offers a higher level of abstraction by leveraging the full power of Scala and the JVM.

Scalding is built on top of Cascading, a Java library that abstracts away much of the complexity of Hadoop (such as the need to write raw map and reduce functions).

Need a suggestion for where to start? Try Scalding REPL: This tutorial teaches you how to interact with your data using the Scalding REPL.

Getting help

Documentation

Third Party Modules

Videos

How-tos

Tutorials

Articles and presentations from around the web

Other

Documentation Todo

Contents

Getting help

Documentation

Matrix API

Third Party Modules

Videos

How-tos

Tutorials

Articles

Other

Clone this wiki locally