Skip to content

Feature exp run: Dryer resume within the CI #6823

Closed as not planned
Closed as not planned
@DavidGOrtega

Description

@DavidGOrtega

Issue

In the CI, to be able to resume training with preexisting checkpoints we have to make something like:

EXP_NAME=cml-run-${GITHUB_SHA}
EXP_AVAIL=$(dvc exp pull --run-cache origin $EXP_NAME || echo '')
if [[ -z "$EXP_AVAIL" ]]; then
    echo "############\nFirst Time\n############"
    dvc exp run -n $EXP_NAME --pull -v
else    
    echo "############\nResuming\n############"
    dvc exp apply $EXP_NAME
    dvc exp run -v
fi

Would be nice if we had:

  • a flag with dvc exp run -n $EXP_NAME to be able to pull and apply

So it would become:

EXP_NAME=cml-run-${GITHUB_SHA}
dvc exp run -n $EXP_NAME --pull-apply -v

Additional issue

Please note:

EXP_AVAIL=$(dvc exp pull --run-cache origin $EXP_NAME || echo '')

This is because dvc exp pull --run-cache origin $EXP_NAME will throw an error in no prev experiments are present

Metadata

Metadata

Assignees

No one assigned

    Labels

    A: experimentsRelated to dvc expenhancementEnhances DVCp2-mediumMedium priority, should be done, but less important

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions