This repo contains examples and tutorials for various Banyan Julia libraries, such as BanyanDataFrames and BanyanArrays. If you're new to Banyan, this is a great place to start!
To run the notebooks, follow the following steps:
- Set up a Banyan account by following the steps here.
- Install Git and clone (download) this notebook by running the following in a terminal (command prompt):
git clone [email protected]:banyan-team/banyan-julia-examples.git
- Download the Julia installer for your operating system here, and follow the instructions to finish the installation. Open Julia in the REPL and enter package mode by typing
]
. Runadd IJulia
to install the Julia package required for running a Jupyter Notebook with Julia.
julia> ]
(@v1.6) pkg> add IJulia
- Install Jupyter Notebook by running
pip install notebook
or following directions here. - Cd into the directory of the notebook you would like to run. Then, run
BANYAN_API_KEY=<YOUR_BANYAN_API_KEY> BANYAN_USER_ID=<YOUR_BANYAN_USER_ID> julia --project=.
to open Julia. - To run the notebook, run
using IJulia; notebook()
in the Julia interpretor. These steps will ensure that you open the notebook in the correct environment.
iris.ipynb
- This notebook is a good starting place if you are new to Banyan. It provides a simple example of how to spin up a cluster and a session and how to run some data analytics on a CSV file on the cluster.nyc_taxi.ipynb
- Large-scale data analysis of Parquet dataset stored in the cloud (S3)parameter_tuning.ipynb
- Massively parallel parameter tuning in the cloudshallow_water_modeling.ipynb
- MPI-powered fluid dynamics simulationsatellite_image_encoding
- PyTorch-based encoding of thousands of NASA satellite images
Interactive usage in VSCode
If you are running the Jupyter notebooks from VSCode, you may encounter the following
error on the calls to readline()
to read user input: UndefVarError: execute_msg not defined
.
This is a known bug in the Julia VSCode extension: see here.
To resolve this, either replace the call to readline()
with the actual value
you wish to input, or run the Jupyter notebooks in another editor to support the interactive usage.