##Hadoop Tutorials
This repo contains a set of tutorials for Hadoop designed to work inside or alongside the Hortonworks Sandbox. Sandbox is a single-node Hadoop cluster that runs in a Virtual Machine (Virtual Box, VMWare, Hyper-V).
Please feel free to recommend edits to existing tutorials, and contribute your own. New contributions should be placed in the 'Community' folder.
Current tutorials are as follows:
###Sandbox Tutorials (Embedded in Sandbox)
- 1 - Hello World - An Overview of Sandbox with HCatalog, Hive and Pig
- 2 - Data Processing with Pig - Processing Baseball Stats
- 3 - Data Processing with Hive - Processing Baseball Stats
- 4 - HCatalog, Basic Pig and Hive Commands
- 5 - Using Basic Pig Commands
- 6 - Loading Data into the Hadoop
- 7 - Installing the Hortonworks ODBC driver on Windows 7
- 8 - Using Excel 2013 to Access Hadoop Data
- 9 - Using Excel 2013 to Analyze Hadoop Data
- 10 - Visualizing Website Clickstream Data
- 11 - Installing the Hortonworks ODBC driver on Mac OS X
- 12 - Refining and Visualizing Server Log Data
###Community Tutorials
- Your turn!