Skip to content

Latest commit

 

History

History
 
 

onboarding

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

DSI ONBOARDING TASKS

Note: The following is a Markdown formatted readme file of this Google Document that is sent to students by production teams upon signing up for the course.

DATA SCIENCE IMMERSIVE

Congratulations on joining the Data Science Immersive program at General Assembly! We can’t wait to get started, so we’ve outlined a few assignments to get everyone on the same page. These onboarding tasks will prepare you to hit the ground running, and are due by the first day of class. As always, let us know if you have any questions. Have fun and we’ll see you soon!

-- GA + the DSI Instructional Team


WHAT CAN I EXPECT FROM THE ONBOARDING?

Your onboarding tasks include a number of required and optional tutorials to help you review the fundamentals of stats, python, SQL, git, and the command line. These are accompanied by a set of multiple-choice challenges that assess whether you’ve fully grasped the concepts that you’ve learned.

There are time estimations under each module but depending on your background and familiarity with these topics, you may spend more or less time on each part.

If you find yourself thinking…

     “This stuff is way too easy, is this course actually going to challenge me?”

The purpose of these tasks is to make sure that everyone starts the course on the same page. If you are speeding through this material, you’re in a great place to start the course, and we’ve included bonus material for you to work through. Don’t worry, you’ll be challenged in this course - we like to take it slow in the beginning but will ramp up fast!

     “This stuff is way too hard, am I going to be able to keep up?”

That’s why you’re doing these assignments in the first place - so that you can get up to speed! We’ll be reviewing these concepts in the first week as well so if you don’t get it now, you’ll have a chance to fully catch up by the end of the first week of class.

A NOTE ABOUT TECHNOLOGY

You’ll need an account with Khan Academy and Code Academy to complete these assignments. Go ahead and take 5 minutes to create accounts if you don’t have them already.

Additionally, before the course starts, we’ll provide you with directions on how to install the following technologies:

  • Github - We’ll be using Github on a daily basis to store and share our code
  • Python 2.7 - We will be using Python & its packages as our primary language
  • Anaconda - We will be using Anaconda as our primary development environment
  • Postgres - We’ll be using Postgres for local SQL-based data storage
  • Slack - We’ll be using Slack on a daily basis to communicate with each other

STEP 1: COMPLETE THESE REQUIRED MODULES

Est: 35 - 40 hrs

Complete the following two modules. If you’ve already done these in the past, then try reviewing the "Bonus" activities instead:

I. Statistics

II. Python

III. Git

IV. SQL

V. Command Line

STEP 2: SUBMIT THESE REQUIRED EXERCISES

1 -2 hrs

Once you've completed Step 1, please complete our Data Science Onboarding Exercises. Follow the directions to submit your answers.

STEP 3: TRY SOME OPTIONAL CHALLENGES

1 hr - however long you want!

These advanced exercises will help you sharpen your wits in statistics and programming :

Bonus Materials

STEP 4: DO SOME OPTIONAL READING

1 hr - however long you want!

Time to get excited about data! Check out the following resources for a brief look at how data science is changing the world:

Articles

Blogs

  • Check out Randy Olson’s blog where he shares his data tinkering problems and discoveries like how he computed the optimal way to find Waldo.
  • FiveThirtyEight is Nate Silver’s brain child and a data nerd’s playground. Browse around for some interesting and inspiring uses of statistics and data modeling in the real world.

Videos & Podcasts

Textbooks