generated from carpentries/workbench-template-md
-
-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
2560e27
commit ee38a30
Showing
1 changed file
with
17 additions
and
4 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -4,7 +4,18 @@ title: Setup | |
|
||
## Overview | ||
|
||
In this workshop, we’ll explore machine learning procedures using **AWS SageMaker**. We assume you have a GitHub account and an AWS account. If not, please follow the instructions below to set these up. Once accounts are ready, you’ll download the required data and create a fork of this lesson's GitHub repository. | ||
This workshop introduces you to foundational workflows in **AWS SageMaker**, covering data setup, model training, hyperparameter tuning, and model deployment within AWS's managed environment. You’ll learn how to use SageMaker notebooks to control data pipelines, manage training jobs, and evaluate model performance effectively. We’ll also cover strategies to help you scale training and tuning efficiently, with guidance on choosing between CPUs and GPUs, as well as when to consider parallelized training. | ||
|
||
To keep costs manageable, this workshop provides tips for tracking and monitoring AWS expenses, so your experiments remain affordable. While AWS isn’t entirely free, it’s very cost-effective for typical ML workflows—training roughly 100 models on a small dataset (under 10GB) can cost under $20, making it accessible for many research projects. | ||
|
||
### What This Workshop Does Not Cover | ||
|
||
Currently, this workshop does not include: | ||
- **AWS Lambda** for serverless function deployment, | ||
- **MLFlow** or other MLOps tools for experiment tracking, | ||
- Additional AWS services beyond the core SageMaker ML workflows. | ||
|
||
If there’s a specific ML workflow or AWS service you’d like to see included in this curriculum, we’re open to developing more content to meet the needs of researchers and ML practitioners at UW–Madison. Please contact [[email protected]](mailto:[email protected]) with suggestions or requests. | ||
|
||
## Accounts and Initial Setup | ||
|
||
|
@@ -16,8 +27,10 @@ If you don't already have a GitHub account, [sign up for GitHub](https://github. | |
|
||
If you don't have an AWS account, please follow these steps: | ||
|
||
> **Note**: Hackathon attendees can skip this step since we are providing you with the account. | ||
1. Go to the [AWS Free Tier page](https://aws.amazon.com/free/) and click **Create a Free Account**. | ||
2. Complete the sign-up process. AWS offers a free tier with limited monthly usage. Some services, including SageMaker, may incur charges beyond free-tier limits, so be mindful of usage during the workshop. | ||
2. Complete the sign-up process. AWS offers a free tier with limited monthly usage. Some services, including SageMaker, may incur charges beyond free-tier limits, so be mindful of usage during the workshop. If you follow along with the materials, you can expect to incur around $10 in compute fees (e.g., from training and tuning several different models with GPU enabled at times). | ||
|
||
Once your AWS account is set up, log in to the **AWS Management Console** to get started with SageMaker. | ||
|
||
|
@@ -36,10 +49,10 @@ You will need a copy of the lesson repository on GitHub to explore how to manage | |
|
||
To do this: | ||
|
||
1. Go to the workshop's [GitHub repository page](https://github.com/YOUR_REPO_LINK). | ||
1. Go to the workshop's [GitHub repository page](https://github.com/UW-Madison-DataScience/ml-with-aws-sagemaker). | ||
2. Click **Fork** (top right) to create your own copy of the repository under your GitHub account. | ||
3. Once forked, you don't need to do anything else. We'll clone this fork once we start working in the AWS Jupyter environment using... | ||
|
||
```bash | ||
!git clone https://github.com/YOUR_USERNAME/YOUR_FORKED_REPO.git | ||
!git clone https://github.com/YOUR_USERNAME/ml-with-aws-sagemaker.git | ||
``` |