Skip to content

Latest commit

 

History

History
13 lines (7 loc) · 791 Bytes

README.md

File metadata and controls

13 lines (7 loc) · 791 Bytes

Data Exploration

The Jupyter Notebook is a web application that allows you to create and share documents that contain live code, equations, visualizations and explanatory text. In PNDA, it supports exploration and presentation of data from HDFS and HBase.

This tutorial demonstrates how to use Jupyter to access data and make sense of data interactively using the Jupyter pyspark interpreter. The data samples used in this lab are networking telemetry data and netflow data and persistent in PNDA platform.

This section explains how to manage PySpark application dependencies.