Nicholas Wolf
ORCID 0000-0001-5512-6151
This lesson is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Overview
This class is meant to be an overview of using Python Pandas for those who have never used it before, or who have made some progress but could use further guidance. Some background in working with tabular data is helpful, but not required. We'll focus on the following goals:
- Understand the building blocks of a Pandas dataframe
- Know how to make a dataframe and how to load it with data
- Filtering, selecting, and other common operations needed to focus on a subset of a dataframe
- Updating values
- Table joins and merges
- Exporting a dataframe to a saved file
Materials
We will work from this Jupyter Notebook, available on our course JupyterHub instance here: https://tutorials-1.rcnyu.org.
You are welcome to download and deploy this same Notebook in your own local Jupyter environment. To copy the entire set of teaching materials, clone its repository:
git clone https://github.com/NYU-DataServices/startingpandas.git
You can also find this notebook (in non-executable form) at https://nyu-dataservices.github.io/startingpandas.