Trial_timeout not working properly for hyperparameter sweep in pipeline #39549
Labels
bug
This issue requires a change to an existing behavior in the product in order to be resolved.
Client
This issue points to a problem in the data-plane of the library.
customer-reported
Issues that are reported by GitHub users external to the Azure organization.
Machine Learning
needs-team-attention
Workflow: This issue needs attention from Azure service team or SDK team
Service Attention
Workflow: This issue is responsible by Azure service team.
Describe the bug
I am trying to submit a hyperparameter sweep job in a pipeline with a trial_timeout, but the trial_timeout does not get set (and the job continues to run past the trial_timeout time). It appears that when submitting from the python sdk, the
maxRunDurationSeconds
does not get set properly for the individual trial (see screenshot below).To Reproduce
Steps to reproduce the behavior:
Run the code below:
Expected behavior
A trial should get cancelled if it goes past the trial_timeout time.
Screenshots
I have checked the raw json for the individual trials, and it appears that
maxRunDurationSeconds
is not set.Job running for 2+ hrs when trial_timeout is set to 1800 seconds (30 min)
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: