-
Notifications
You must be signed in to change notification settings - Fork 106
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[bugfix] Evaluate Slurm job nodelist lazily #3022
[bugfix] Evaluate Slurm job nodelist lazily #3022
Conversation
Codecov ReportAttention:
Additional details and impacted files@@ Coverage Diff @@
## master #3022 +/- ##
=======================================
Coverage 86.71% 86.71%
=======================================
Files 61 61
Lines 12002 12002
=======================================
Hits 10407 10407
Misses 1595 1595
☔ View full report in Codecov by Sentry. |
Thank you so much @vkarak. I will try this as soon as our system is back from maintenance. |
@stefaniereuter did you manage to try this out? |
I am so sorry I haven't had a chance to test yet as our system will still be down for probably another week.
26 Oct 2023 14:49:16 Vasileios Karakasis ***@***.***>:
…
@stefaniereuter[https://github.com/stefaniereuter] did you manage to try this out?
—
Reply to this email directly, view it on GitHub[#3022 (comment)], or unsubscribe[https://github.com/notifications/unsubscribe-auth/AXOHEN4EFAFGECZSKKOVV73YBJL4XAVCNFSM6AAAAAA6H3HMQCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTOOBRGA2TONJUHE].
You are receiving this because you were mentioned.
[Tracking image][https://github.com/notifications/beacon/AXOHEN5FTVADCSMKNLZQOOLYBJL4XA5CNFSM6AAAAAA6H3HMQCWGG33NNVSW45C7OR4XAZNMJFZXG5LFINXW23LFNZ2KUY3PNVWWK3TUL5UWJTTKFDEA2.gif]
|
@stefaniereuter Did you have a chance to test this? We will merge this PR so as to make it to the 4.4.1 release. If the problem is not fixed, feel free to reopen the issue. |
sorry for the very late response. Yes we were finally able to test this last week and it seem to fix our issue, thank you so much |
The
nodelist
evaluation was being triggered early by logging, when we update the log record with all the test attributes:reframe/reframe/core/logging.py
Lines 832 to 846 in b1c8970
The problem is that in some cases Slurm does not populate the nodelist of the job immediately, so we might end with an incomplete node list in the test job. We fix this by evaluating lazily the job's nodelist after the job has finished.
Fixes #3018.
@stefaniereuter could you please check if this fixes your problem?