You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Four instances of job.sh will be executed, possibly on different compute nodes, and each instance will have an environment variable set SLURM_ARRAY_JOB_ID as 1, 2, 3, or 4.
pman should do something similar.
The text was updated successfully, but these errors were encountered:
number_of_workers
can be a way to support embarrassingly parallel jobs on multi-node compute environments.How can a process identify which replicate it is? It is necessary to know so the workfload can be divided, e.g. in plugin code:
The equivalent concept in SLURM is a job array.
https://slurm.schedmd.com/job_array.html
e.g.
Four instances of
job.sh
will be executed, possibly on different compute nodes, and each instance will have an environment variable setSLURM_ARRAY_JOB_ID
as1
,2
,3
, or4
.pman
should do something similar.The text was updated successfully, but these errors were encountered: