Skip to content

Latest commit

 

History

History
36 lines (23 loc) · 1.53 KB

README.md

File metadata and controls

36 lines (23 loc) · 1.53 KB

Hadoop Essentials (Hortonworks Sandbox HDP 2.6.1)

This GitHub project stores content related to the Hadoop Essentials course offering from Hortonworks University which is available for cost in an instructor-led format and FREE for self-paced students.

Status: Complete

Approach & Setup

The Hadoop Essentials course uses demonstrations instead of hands-on labs due to the short duration of the offering. That said, the demos are closely aligned with the publicly available tutorials.

Additionally, to allow participants to recreate the demos performed during the course, the Hortonworks Sandbox is utilized. See Sandbox Setup for specific setup and configuration details regarding this course.

The target audience for this repo is the instructors themselves to provide them with guidance for presenting these demos to a live audience, but all are welcome to utilize and feedback (and fixes via pull requests) is surely appreciated.

The Demonstrations

Operational Overview with Ambari

Loading Data into HDFS

Data Manipulation with Hive

Risk Analysis with Pig

Risk Analysis with Spark

Securing Hive with Ranger