Skip to content

Releases: pytorch/test-infra

Runner lambdas v20230420-103318

20 Apr 10:35
8b591dc
Compare
Choose a tag to compare
Increase the retry timeout for scaleUp by using a stochastic overshoo…

Runner lambdas v20230328-092043

28 Mar 09:22
8c7dcaa
Compare
Choose a tag to compare
scaleUp performance tunning to take advantage of the retry mechanism …

Runner lambdas v20230327-164347

27 Mar 16:45
058d167
Compare
Choose a tag to compare
Recent changes surfaced a bug with sending metrics (#3930)

Await for async functions completion.

Runner lambdas v20230327-093037

27 Mar 09:32
59d4b4f
Compare
Choose a tag to compare
Retry mechanism, improved code design and resiliency, add request cac…

Runner lambdas v20230207-031411

07 Feb 03:15
9e8ad98
Compare
Choose a tag to compare
Revert "fail scale-up batch when a single instance of the batch fails…

Runner lambdas v20230207-004225

07 Feb 00:43
1b8c6c3
Compare
Choose a tag to compare
gha-infra: Revert retry related commits (#2442)

Runner lambdas v20230201-184248

01 Feb 18:44
489a39d
Compare
Choose a tag to compare
clean messages from sqs when there is a failure (#2019)

Runner lambdas v20230201-143825

01 Feb 14:39
9d5460f
Compare
Choose a tag to compare
improve retry mechanism, avoiding rely on AWS SQS retry and instead o…

Runner lambdas v20230131-205135

31 Jan 20:53
698132e
Compare
Choose a tag to compare
limit messages delay to  900s on retry (#1966)

Runner lambdas v20230131-135103

31 Jan 13:52
9afcf35
Compare
Choose a tag to compare
creates a sqs queue for retry failed scale-up requests that are retry…