-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
- Loading branch information
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
``` | ||
bin etc lib64 proc sbin sys var | ||
boot {{ site.remote.homedir | replace: "/", "" }} mnt root scratch tmp working | ||
dev lib opt run srv usr | ||
``` | ||
{: .output} |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
``` | ||
No Modulefiles Currently Loaded. | ||
``` | ||
{: .output} |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
* **Hostname**: Where did your job run? | ||
* **MaxRSS**: What was the maximum amount of memory used? | ||
* **Elapsed**: How long did the job take? | ||
* **State**: What is the job currently doing/what happened to it? | ||
* **MaxDiskRead**: Amount of data read from disk. | ||
* **MaxDiskWrite**: Amount of data written to disk. |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,19 @@ | ||
> Jobs on an HPC system might run for days or even weeks. We probably have | ||
> better things to do than constantly check on the status of our job with | ||
> `{{ site.sched.status }}`. Looking at the manual page for | ||
> `{{ site.sched.submit.name }}`, can you set up our test job to send you an email | ||
> when it finishes? | ||
> | ||
> > ## Hint | ||
> > | ||
> > You can use the *manual pages* for {{ site.sched.name }} utilities to find | ||
> > more about their capabilities. On the command line, these are accessed | ||
> > through the `man` utility: run `man <program-name>`. You can find the same | ||
> > information online by searching > "man <program-name>". | ||
> > | ||
> > ``` | ||
> > {{ site.remote.prompt }} man {{ site.sched.submit.name }} | ||
> > ``` | ||
> > {: .language-bash} | ||
> {: .solution} | ||
{: .challenge} |
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,74 @@ | ||
#------------------------------------------------------------ | ||
# EPCC, The University of Edinburgh: Cirrus + PBS Pro | ||
#------------------------------------------------------------ | ||
|
||
# Cluster host and scheduler options: the defaults come from | ||
# Graham at Compute Canada, running Slurm. Other options can | ||
# be found in the library of snippets, | ||
# `_includes/snippets_library`. To use one, replace options | ||
# below with those in `_config_options.yml` from the | ||
# library. E.g, to customise for Cirrus at EPCC, running | ||
# Slurm, we could replace the options below with those from | ||
# | ||
# _includes/snippets_library/EPCC_Cirrus_slurm/_config_options.yml | ||
# | ||
# If your cluster is not represented in the library, please | ||
# copy an existing folder, rename it, and customize for your | ||
# installation. Remember to keep the leading slash on the | ||
# `snippets` variable below! | ||
|
||
snippets: "/snippets_library/EPCC_Cirrus_slurm" | ||
|
||
local: | ||
prompt: "[auser@laptop ~]$" | ||
bash_shebang: "#!/bin/bash" | ||
|
||
remote: | ||
name: "Cirrus" | ||
login: "login.cirrus.ac.uk" | ||
host: "cirrus-login1" | ||
node: "r1i0n32" | ||
location: "EPCC, The University of Edinburgh" | ||
homedir: "/lustre/home/tc001" | ||
user: "auser" | ||
group: "tc001" | ||
prompt: "[auser@cirrus-login1 ~]$" | ||
bash_shebang: "#!/bin/bash" | ||
module_python3: "anaconda/python3-2021.11" | ||
|
||
sched: | ||
name: "Slurm" | ||
submit: | ||
name: "sbatch" | ||
options: "--partition=standard --qos=standard --time=00:02:00" | ||
queue: | ||
debug: "debug" | ||
testing: "testing" | ||
status: "squeue" | ||
flag: | ||
user: "-u auser" | ||
interactive: "--time=00:20:00 --partition=standard --qos=standard --pty /usr/bin/bash --login" | ||
histdetail: "-l -j" | ||
name: "-J" | ||
partition: "-p standard" | ||
qos: "-q standard" | ||
time: "-t" | ||
queue: "-p" | ||
nodes: "-N" | ||
tasks: "-n" | ||
del: "scancel" | ||
interactive: "srun" | ||
info: "sinfo" | ||
comment: "#SBATCH" | ||
hist: "sacct" | ||
hist_filter: "--format=JobID,JobName,State,Elapsed,NodeList,MaxRSS,MaxDiskRead,MaxDiskWrite" | ||
|
||
episode_order: | ||
- 11-hpc-intro | ||
- 12-cluster | ||
- 13-scheduler | ||
- 14-modules | ||
- 15-transferring-files | ||
- 16-parallel | ||
- 17-resources | ||
- 18-responsibility |