Skip to content

Commit

Permalink
Update 05c-python_pandas.md
Browse files Browse the repository at this point in the history
Updated
  • Loading branch information
igabr authored Dec 11, 2017
1 parent 807f76f commit 4807554
Showing 1 changed file with 6 additions and 14 deletions.
20 changes: 6 additions & 14 deletions 05c-python_pandas.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,6 @@
# Python: Introduction to Pandas

## Objective
This section of the Python pre-work is intended to be an introduction to Pandas. Explore the "Pandas Resources" links below for more advanced options.
This section should also be completed on HackerRank. The information below will help give you a surface level introduction to [Pandas](https://pandas.pydata.org/). A widely used Data Science library in Python.

## Introduction
### What is Pandas?
Expand All @@ -11,7 +10,7 @@ Pandas is a Python library for data manipulation and analysis. In particular, it
Pandas provides fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python.

### Pandas - the name
The name is derived from the term "Panel data", an econometrics term for multidimensional structured data sets.
The name is derived from the term "Panel data", an econometric term for multidimensional structured data sets.

### What is a Pandas data frame?
**DataFrame** is a 2-dimensional labeled data structure with columns of potentially different types. You can think of it like a spreadsheet or SQL table, or a dict of Series objects. It is generally the most commonly used pandas object.
Expand All @@ -26,29 +25,22 @@ The name is derived from the term "Panel data", an econometrics term for multidi

#### [Things in Pandas I Wish I Had Known Sooner](http://nbviewer.jupyter.org/github/rasbt/python_reference/blob/master/tutorials/things_in_pandas.ipynb?utm_source=Python+Weekly+Newsletter&utm_campaign=8416b188e6-Python_Weekly_Issue_176_January_29_2015&utm_medium=email&utm_term=0_9e26887fc5-8416b188e6-312716773#Selecting-NaN-Rows)


---

## :boom: Exercises :boom:
## :boom: Review :boom:

>### Part 1: Review Pandas Example
Review the Jupyter Notebook [`1_pandas_jeopardy_example.ipynb`](python/pandas/1_pandas_jeopardy_example.ipynb) which uses the `jeopardy.csv` data.

>### Part 2: Try Pandas
* Open the Jupyter Notebook [`2_pandas_olive_questions.ipynb`](python/pandas/2_pandas_olive_questions.ipynb) which uses the `olive.csv` data.
* Make a copy of the notebook and name it `3_pandas_olive_answers_myname.ipynb`. (Example: my notebook would be called `3_pandas_olive_answers_reshama.ipynb`).
* Update the header at the top by adding in your name and date.
* Edit this notebook and complete the exercises.

---

## Optional Pandas Topics: To Explore Further
## Useful Pandas Topics: To Explore Further

* `groupby` objects
* applying functions
* indexing
* conditional selecting; filtering
* selecting rows and columns: `.loc`, `.iloc`, `.ix`
* working with missing data: `Null`, `NaN`, `None`
* selecting rows and columns: `.loc`, `.iloc`
* working with missing data: `NaN`, `None`
* sorting
* merge, join

0 comments on commit 4807554

Please sign in to comment.