Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: update Chicago Taxi Dataset URL to Zenodo #3357

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 3 additions & 3 deletions docs/getting-started/10-minutes-to-awkward-array.md
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@ In this guide, we'll look at how to manipulate a jagged dataset to plot taxi rou

## Loading the dataset

Our dataset is formatted as a 611 MB [Apache Parquet](https://parquet.apache.org/) file, provided [here](https://pivarski-princeton.s3.amazonaws.com/chicago-taxi.parquet). Alongside JSON, and raw buffers, Awkward can also read Parquet files and Arrow tables.
Our dataset is formatted as a 611 MB [Apache Parquet](https://parquet.apache.org/) file, provided [here](https://zenodo.org/records/14537442/files/chicago-taxi.parquet). Alongside JSON, and raw buffers, Awkward can also read Parquet files and Arrow tables.

Given that this file is so large, let's first look at the *metadata* with `ak.metadata_from_parquet` to see what we're working with:

Expand All @@ -43,7 +43,7 @@ import numpy as np
import awkward as ak

metadata = ak.metadata_from_parquet(
"https://pivarski-princeton.s3.amazonaws.com/chicago-taxi.parquet"
"https://zenodo.org/records/14537442/files/chicago-taxi.parquet"
)
```

Expand All @@ -59,7 +59,7 @@ There are a lot of different columns here (`trip.sec`, `trip.begin.lon`, `trip.p

```{code-cell} ipython3
taxi = ak.from_parquet(
"https://pivarski-princeton.s3.amazonaws.com/chicago-taxi.parquet",
"https://zenodo.org/records/14537442/files/chicago-taxi.parquet",
row_groups=[0],
columns=["trip.km", "trip.begin.l*", "trip.end.l*", "trip.path.*"],
)
Expand Down
2 changes: 1 addition & 1 deletion docs/user-guide/how-to-examine-single-item.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,7 @@ First, let's load the dataset using the {func}`ak.from_parquet` function. We wil
```{code-cell} ipython3
import awkward as ak

url = "https://pivarski-princeton.s3.amazonaws.com/chicago-taxi.parquet"
url = "https://zenodo.org/records/14537442/files/chicago-taxi.parquet"
taxi = ak.from_parquet(
url,
row_groups=[0],
Expand Down
Loading