Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Role of ps and worker #2

Open
Abdus-Samee opened this issue Sep 19, 2023 · 2 comments
Open

Role of ps and worker #2

Abdus-Samee opened this issue Sep 19, 2023 · 2 comments

Comments

@Abdus-Samee
Copy link

Hey there!
Just wanted to clarify something regarding the code for ps and worker. I've recently started working with such distributed training, so pardon my silly queries.

As much as I've come to know, ps serve parameters to the workers while the later one fetches them. Aside from the difference in the tf_config, I've noticed no code for fetching/serving of parameters particularly dedicated to only ps or only workers. Both share the same code.

I wanted to know how are they coordinating with one another?

@Abdus-Samee
Copy link
Author

@18520339 I hope you look into the issue.

Thanks.

@18520339
Copy link
Owner

18520339 commented Sep 24, 2023

Hi, as far as I know, the ParameterServerStrategy employs an underlying communication protocol (like gRPC) to coordinate the variable updates and synchronization. So, when using that strategy, the coordination between ps and workers is handled behind the scenes by TF's runtime, and you don't need to write explicit code to fetch or serve parameters.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants