Skip to content

Latest commit

 

History

History
119 lines (93 loc) · 5.68 KB

open_ended.md

File metadata and controls

119 lines (93 loc) · 5.68 KB

Open-ended assignments

In Homework 1 and the Final Project, you will pick your own dataset(s).

Open data portals

There are countless places to get data, notably:

{% if id == "columbia" -%}

Inspiration

For starters, see the Final Project examples from past semesters.

Probably not realistic to make visualizations that are as fancy as these ones made by professionals, but they may give you ideas. Some also include links/downloads of the source data.

Storing data

{% if id == "columbia" -%} To work with uploaded files in {{coding_env_name}}, you have two options.

Direct upload

Fewer steps, but your file(s) will disappear when your session ends.

Steps to get data into {{coding_env_name}} directly

  1. In the {{coding_env_name}} sidebar, click the Files icon (A).
  2. Click the upload button (B).
  3. Select your file.
  4. You'll use read_csv("MY_FILENAME.csv") in your code.

Google Drive

More steps, but your file(s) are preserved between sessions.

Steps to get data into {{coding_env_name}} via Drive

  1. Upload the file(s) somewhere in Drive.
  2. In the {{coding_env_name}} sidebar, click the Files icon (A).
  3. Click the Mount Drive icon (B).
    • You may need to run the code it injects to authorize it (C).
    • Think of this as attaching your Drive to your {{coding_env_name}} instance, as if you were plugging in a USB flash drive.
  4. Navigate to the file (D).
    • You may need to click into content, then drive.
  5. Next to the filename, click the three dots.
  6. Click Copy path (E).
    • The value should be something like /content/drive/My Drive/....
  7. Use this path with read_csv() (F).

{% else -%}

  1. Open the {{coding_env_name}} file browser.
  2. Navigate to the folder your notebook is in.
  3. Upload the data.
  4. From Python, use read_csv("./<filename>.csv").

Note that that file path should be to relative to the notebook within {{coding_env_name}} — ./ means "in the same directory". {% endif %}{{coding_env_name}} cannot access the file on your local machine; in other words, the path shouldn't start with C:\\ or anything like that. More info about file paths.

{% if id == "nyu" -%}

Limits

{{coding_env_name}} has a disk storage limit of 1GB (a.k.a. 1,024 MB or 1,048,576 KB) across all your files, and a memory limit of 3GB. {%- endif %}

Reducing data size

You can make data smaller before uploading by filtering it through: