We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
We've implemented the weight update API for RLHF pipeline:
api:
sglang/python/sglang/srt/server.py
Lines 214 to 239 in 21e9e63
test case / usage:
https://github.com/sgl-project/sglang/blob/main/test/srt/test_update_weights_from_distributed.py
We will integrated SGLang into OpenRLHF this week. Here is the data for our accuracy and speed test.
https://huggingface.co/datasets/OpenRLHF/prompt-collection-v0.1-dev-rand5k
https://huggingface.co/datasets/OpenRLHF/prompt-collection-v0.1-dev-100k
Typically, 50K data requires several hours.
See above.
The text was updated successfully, but these errors were encountered:
zhaochenyang20
No branches or pull requests
Checklist
Motivation
We've implemented the weight update API for RLHF pipeline:
api:
sglang/python/sglang/srt/server.py
Lines 214 to 239 in 21e9e63
test case / usage:
https://github.com/sgl-project/sglang/blob/main/test/srt/test_update_weights_from_distributed.py
We will integrated SGLang into OpenRLHF this week. Here is the data for our accuracy and speed test.
https://huggingface.co/datasets/OpenRLHF/prompt-collection-v0.1-dev-rand5k
https://huggingface.co/datasets/OpenRLHF/prompt-collection-v0.1-dev-100k
Typically, 50K data requires several hours.
Related resources
See above.
The text was updated successfully, but these errors were encountered: