Skip to content

~NGC release testing #19

~NGC release testing

~NGC release testing #19

Manually triggered April 5, 2024 17:10
Status Cancelled
Total duration 33m 26s
Artifacts 1

ngc-release-testing.yaml

on: workflow_dispatch
Matrix: test-maxtext / maxtext-multinode
Waiting for pending jobs
Matrix: test-maxtext / single-process-multi-device
Waiting for pending jobs
Matrix: test-jax / run-unit-test
Matrix: test-levanter / run-unit-test
Waiting for pending jobs
Matrix: test-rosetta-pax / rosetta-pax-multi-node-te
Matrix: test-rosetta-pax / rosetta-pax-multi-node
Matrix: test-rosetta-pax / rosetta-pax-single-node-dropout-te
Matrix: test-rosetta-pax / single-process-evaluation-te
Matrix: test-rosetta-pax / single-process-multi-device-te
test-jax  /  ...  /  launch-slurm-runner
33m 8s
test-jax / runner / launch-slurm-runner
test-levanter  /  ...  /  launch-slurm-runner
test-levanter / runner / launch-slurm-runner
test-maxtext  /  summary
test-maxtext / summary
test-maxtext  /  metrics
test-maxtext / metrics
test-rosetta-pax  /  summary
0s
test-rosetta-pax / summary
test-rosetta-pax  /  metrics
0s
test-rosetta-pax / metrics
test-maxtext  /  ...  /  sitrep
test-maxtext / sitrep / sitrep
test-rosetta-pax  /  ...  /  sitrep
test-rosetta-pax / sitrep / sitrep
test-maxtext  /  outcome
test-maxtext / outcome
test-rosetta-pax  /  outcome
0s
test-rosetta-pax / outcome
finalize  /  workflow-badge
finalize / workflow-badge
finalize  /  report
finalize / report
finalize  /  upload-badge
finalize / upload-badge
finalize  /  publish-badge
finalize / publish-badge
Fit to window
Zoom out
Zoom in

Annotations

36 errors
test-jax / jax-V100-unit-test
Process completed with exit code 1.
test-jax / jax-A100-unit-test
The run was canceled by @DwarKapex.
test-rosetta-pax / single-process-multi-device-te (1, 1, 2, 4)
The operation was canceled.
test-jax / runner / launch-slurm-runner
The run was canceled by @DwarKapex.
test-jax / runner / launch-slurm-runner
The operation was canceled.
test-rosetta-pax / single-process-multi-device-te (1, 8, 1, 1)
The operation was canceled.
test-rosetta-pax / rosetta-pax-single-node-dropout-te (1, 8, 1, 1)
The operation was canceled.
test-rosetta-pax / single-process-evaluation-te (1, 8, 1, 1)
The operation was canceled.
test-rosetta-pax / rosetta-pax-multi-node (4, 2, 1, 2)
The run was canceled by @DwarKapex.
test-rosetta-pax / rosetta-pax-multi-node (4, 2, 1, 2)
The operation was canceled.
test-rosetta-pax / rosetta-pax-multi-node (1, 4, 1, 2)
The run was canceled by @DwarKapex.
test-rosetta-pax / rosetta-pax-multi-node (1, 4, 1, 2)
The operation was canceled.
test-rosetta-pax / rosetta-pax-multi-node (4, 2, 1, 1)
The run was canceled by @DwarKapex.
test-rosetta-pax / rosetta-pax-multi-node (4, 2, 1, 1)
The operation was canceled.
test-rosetta-pax / rosetta-pax-multi-node (1, 8, 1, 1)
The run was canceled by @DwarKapex.
test-rosetta-pax / rosetta-pax-multi-node (1, 8, 1, 1)
The operation was canceled.

Artifacts

Produced during runtime
Name Size
jax-unit-test-V100 Expired
26.4 KB