Skip to content

[Q&A] Jobs not being submitted to the correct server when hosting two servers on the same machine #2657

Closed Answered by YuanTingHsieh
dima1997 asked this question in Q&A
Discussion options

You must be logged in to vote

@dima1997 thanks for the question!

Our job storage by default is stored inside "/tmp/nvflare/jobs-storage".

This is configured in here: https://github.com/NVIDIA/NVFlare/blob/2.3.8/nvflare/lighter/impl/master_template.yml#L200-L220

So for your case, if you want to "run two NVIDIA FLARE servers on the same physical machine with the same FQDN name but with different associated ports."

You need to modify the "project.yml" to read in a different version of "master_template.yml" before you do the provision for the second server.
Otherwise, this two server will be using the same directory for the job management thus you see the failure.

So for example, copy and modify the "master_template.yml" …

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@dima1997
Comment options

Answer selected by dima1997
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants