This repository will contain all of our python prep content. Those individuals wishing to complete our Data Science Boot Camp will need to understand all of the content covered in this repository.
These lectures will present python coding through a series of jupyter notebooks. Lectures have corresponding videos which can be found here, https://www.erdosinstitute.org/programs/asynchronous/python-prep. Each lecture notebook will have two versions:
- An empty version that you can fill in and play around with as desired,
- A "Complete" version that was filled in while recording the lecture video.
You should complete the lectures in the following order:
- Introduction
- My First jupyter notebook
- Basic Data Types
- Strings
- More Complicated Data Types
- Shallow and Deep Copies
- Conditionals and Loops
- Writing Functions
- Reading and Writing to File
- Importing a Module or Package
- Basic numpy
- Basic pandas
- Basic Plotting
- Data File Types
- Troubleshooting Errors
- Next Steps
- Classes and Objects in Python (Optional)
- Parquet Files (Optional)
- ydata-profiling (Optional)
This folder contains jupyter notebooks full of practice problems. Unless otherwise stated, each lecture notebook has a corresponding practice problem notebook for you to test your skills after completing the lecture. You should strive to reach a competency with the content so that you are able to complete these notebooks relatively quickly.
You should complete the practice problem notebooks in the following order:
- Start Here!
- jupyter notebooks
- Data Structures Conditionals and Loops
- Shallow and Deep Copies
- Functions
- Writing to File and Importing a Package
- numpy
- pandas
- Plotting
- Data File Types
- Troubleshooting Errors
- Classes and Objects in Python (Optional)
- Parquet Files (Optional)
- ydata-profiling (Optional)
This folder contains a few notebooks that can serve as skill assessments. You should treat these notebooks like quizzes, meaning you should try to not refer back to the lectures or practice problems when completing them. Each notebook will have a set of solutions posted with them as well. These notebooks can be finished in any order.
A starting point explaining the skill assessment notebooks.
Focuses more on base python like loops, list comprehensions, writing functions, etc.
Focuses more on data analysis packages like pandas, numpy, and matplotlib.
A blend of base python and data analysis packages.
The data folder contains the data used in the various jupyter notebooks in the repository.
Copyright Info
This repository was written for the Erdős Institute Cőde Data Science Boot Camp by Matthew Osborne, Ph. D., 2023.
Any potential redistributors must seek and receive permission from Matthew Tyler Osborne, Ph.D. prior to redistribution. Redistribution of the material contained in this repository is conditional on acknowledgement of Matthew Tyler Osborne, Ph.D.'s original authorship and sponsorship of the Erdős Institute. (see License.md)