Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Detect hang by tensor util #1448

Merged

add update_node_xpu_info api

63cf412
Select commit
Loading
Failed to load commit list.
Merged

Detect hang by tensor util #1448

add update_node_xpu_info api
63cf412
Select commit
Loading
Failed to load commit list.
Codecov / codecov/patch succeeded Jan 21, 2025 in 0s

87.04% of diff hit (target 80.00%)

View this Pull Request on Codecov

87.04% of diff hit (target 80.00%)

Annotations

Check warning on line 68 in dlrover/python/common/metric/context.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/context.py#L68

Added line #L68 was not covered by tests

Check warning on line 72 in dlrover/python/common/metric/context.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/context.py#L70-L72

Added lines #L70 - L72 were not covered by tests

Check warning on line 95 in dlrover/python/common/metric/context.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/context.py#L95

Added line #L95 was not covered by tests

Check warning on line 99 in dlrover/python/common/metric/context.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/context.py#L97-L99

Added lines #L97 - L99 were not covered by tests

Check warning on line 122 in dlrover/python/common/metric/context.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/context.py#L122

Added line #L122 was not covered by tests

Check warning on line 126 in dlrover/python/common/metric/context.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/context.py#L124-L126

Added lines #L124 - L126 were not covered by tests

Check warning on line 153 in dlrover/python/common/metric/context.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/context.py#L152-L153

Added lines #L152 - L153 were not covered by tests

Check warning on line 148 in dlrover/python/common/metric/metric.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/metric.py#L148

Added line #L148 was not covered by tests

Check warning on line 152 in dlrover/python/common/metric/metric.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/metric.py#L152

Added line #L152 was not covered by tests

Check warning on line 316 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L316

Added line #L316 was not covered by tests

Check warning on line 321 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L320-L321

Added lines #L320 - L321 were not covered by tests

Check warning on line 324 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L324

Added line #L324 was not covered by tests

Check warning on line 330 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L327-L330

Added lines #L327 - L330 were not covered by tests

Check warning on line 342 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L341-L342

Added lines #L341 - L342 were not covered by tests

Check warning on line 351 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L348-L351

Added lines #L348 - L351 were not covered by tests

Check warning on line 353 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L353

Added line #L353 was not covered by tests

Check warning on line 418 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L418

Added line #L418 was not covered by tests

Check warning on line 422 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L422

Added line #L422 was not covered by tests

Check warning on line 426 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L426

Added line #L426 was not covered by tests

Check warning on line 498 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L498

Added line #L498 was not covered by tests

Check warning on line 502 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L502

Added line #L502 was not covered by tests

Check warning on line 506 in dlrover/python/common/metric/monitor.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/common/metric/monitor.py#L506

Added line #L506 was not covered by tests

Check warning on line 1251 in dlrover/python/elastic_agent/torch/training.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/elastic_agent/torch/training.py#L1250-L1251

Added lines #L1250 - L1251 were not covered by tests

Check warning on line 71 in dlrover/python/master/diagnosis/diagnosis_manager.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/master/diagnosis/diagnosis_manager.py#L71

Added line #L71 was not covered by tests

Check warning on line 99 in dlrover/python/master/diagnosis/diagnosis_manager.py

See this annotation in the file changed.

@codecov codecov / codecov/patch

dlrover/python/master/diagnosis/diagnosis_manager.py#L99

Added line #L99 was not covered by tests