Skip to content

Commit

Permalink
Update setup.md
Browse files Browse the repository at this point in the history
  • Loading branch information
qualiaMachine authored Nov 4, 2024
1 parent 2560e27 commit ee38a30
Showing 1 changed file with 17 additions and 4 deletions.
21 changes: 17 additions & 4 deletions learners/setup.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,18 @@ title: Setup

## Overview

In this workshop, we’ll explore machine learning procedures using **AWS SageMaker**. We assume you have a GitHub account and an AWS account. If not, please follow the instructions below to set these up. Once accounts are ready, you’ll download the required data and create a fork of this lesson's GitHub repository.
This workshop introduces you to foundational workflows in **AWS SageMaker**, covering data setup, model training, hyperparameter tuning, and model deployment within AWS's managed environment. You’ll learn how to use SageMaker notebooks to control data pipelines, manage training jobs, and evaluate model performance effectively. We’ll also cover strategies to help you scale training and tuning efficiently, with guidance on choosing between CPUs and GPUs, as well as when to consider parallelized training.

To keep costs manageable, this workshop provides tips for tracking and monitoring AWS expenses, so your experiments remain affordable. While AWS isn’t entirely free, it’s very cost-effective for typical ML workflows—training roughly 100 models on a small dataset (under 10GB) can cost under $20, making it accessible for many research projects.

### What This Workshop Does Not Cover

Currently, this workshop does not include:
- **AWS Lambda** for serverless function deployment,
- **MLFlow** or other MLOps tools for experiment tracking,
- Additional AWS services beyond the core SageMaker ML workflows.

If there’s a specific ML workflow or AWS service you’d like to see included in this curriculum, we’re open to developing more content to meet the needs of researchers and ML practitioners at UW–Madison. Please contact [[email protected]](mailto:[email protected]) with suggestions or requests.

## Accounts and Initial Setup

Expand All @@ -16,8 +27,10 @@ If you don't already have a GitHub account, [sign up for GitHub](https://github.

If you don't have an AWS account, please follow these steps:

> **Note**: Hackathon attendees can skip this step since we are providing you with the account.
1. Go to the [AWS Free Tier page](https://aws.amazon.com/free/) and click **Create a Free Account**.
2. Complete the sign-up process. AWS offers a free tier with limited monthly usage. Some services, including SageMaker, may incur charges beyond free-tier limits, so be mindful of usage during the workshop.
2. Complete the sign-up process. AWS offers a free tier with limited monthly usage. Some services, including SageMaker, may incur charges beyond free-tier limits, so be mindful of usage during the workshop. If you follow along with the materials, you can expect to incur around $10 in compute fees (e.g., from training and tuning several different models with GPU enabled at times).

Once your AWS account is set up, log in to the **AWS Management Console** to get started with SageMaker.

Expand All @@ -36,10 +49,10 @@ You will need a copy of the lesson repository on GitHub to explore how to manage

To do this:

1. Go to the workshop's [GitHub repository page](https://github.com/YOUR_REPO_LINK).
1. Go to the workshop's [GitHub repository page](https://github.com/UW-Madison-DataScience/ml-with-aws-sagemaker).
2. Click **Fork** (top right) to create your own copy of the repository under your GitHub account.
3. Once forked, you don't need to do anything else. We'll clone this fork once we start working in the AWS Jupyter environment using...

```bash
!git clone https://github.com/YOUR_USERNAME/YOUR_FORKED_REPO.git
!git clone https://github.com/YOUR_USERNAME/ml-with-aws-sagemaker.git
```

0 comments on commit ee38a30

Please sign in to comment.