-
Notifications
You must be signed in to change notification settings - Fork 15
feat: parametrize GPUS_PER_NODE and CPUS_PER_WORKER in ray.sub #410
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Terry Kong <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We're getting to a lot of 'knobs' in ray.sub. Maybe it's time to add a doc for it? It didn't seem obvious to me that envvars like HF_HOME and WANDB_API_KEY would get plumbed through ray.sub and now we're adding GPUS_PER_NODE and CPUS_PER_WORKER too.
I'll address the UV_CACHE_DIR in a follow up PR #426 |
Signed-off-by: Terry Kong <[email protected]>
Signed-off-by: Terry Kong <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Completed the Tech Pubs review of docs/cluster.md and provided some copyedits and suggested text revisions. Comments added inline with the "add a suggestion" tool as well as line-by-line for read-only text.
Co-authored-by: jgerh <[email protected]> Signed-off-by: Terry Kong <[email protected]>
Signed-off-by: Terry Kong <[email protected]>
Signed-off-by: Terry Kong <[email protected]>
Closes #309