Skip to content
This repository has been archived by the owner on Sep 12, 2024. It is now read-only.

Managing overflow sites like NERSC

Sharad Agarwal edited this page May 15, 2020 · 10 revisions

Unified extends the whitelist to NERSC in real time when conditions are met which are the following:

  • FNAL agent
  • Idle jobs / Running jobs > 0.5
  • Campaign is setup with overflow OR the workflow requires no primary input (like wmLHEGS)
    "overflow": {
     ...
}

overflow empty means no overflow

for workflow will no input like GS or wmLHEGS it will automatically enable overflow to HPC

for workflow with input (like SIM etc) it will only overflow if we enable overflow PRIM. sometimes we need it, like UL campaigns where the GEN input is really light so reading over xrootd has no problem

Module used in unified for managing overflow sites - mappor.py

side note from Alan

Unified adds NERSC to the DESIRED_Sites job classad! If we manage to add NERSC to the SiteWhitelist, then we would be able to use more NERSC resources (and get more work on the FNAL agents)

side note from Thong

The reason NERSC is not in the original whitelist is that NERSC doesn’t have an entry for ncores and memory in gwmsmon, which Unified uses to filter out appropriate sites according to wf requirement

Clone this wiki locally