-
Notifications
You must be signed in to change notification settings - Fork 13.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] I can't run DeepSeek V3 using SGlang #596
Comments
Did you run the fp8 model on 4x4 A100 clusters? Maybe your model name can be changed to "default". |
I also tried "default" in model name. Same issue. One thing I noticed is that few minutes later, all nodes shutdown with quantization error. But even with this model: |
@vabatista I met the same issue on A100 machines, that |
@vabatista Did you use the Official model repo files, not the bf16 format? I failed to run the model on A100. I guess that the A100 seems not to be supported yet to run the official model. |
My first try was to use the official model. Then I also tried unsloth/DeepSeek-V3-bf16. |
please go SGlang community for more help,thanks |
Describe the bug
When run this code
I get 404 - Not found. The api call is hiting the server:
To Reproduce
I run DeepSeek V3 into SGlang using this recipe (docker version): https://github.com/sgl-project/sglang/blob/main/docs/backend/server_arguments.md
I'm using 4 cluster nodes with 4 Nvidia A100 each. Here is the command:
In the other 3 hosts I change only the
--node-rank
parameterExpected behavior
Get the response using the API
Additional context
One strange behavior is that the server was up into 3rd node, not in the master.
The text was updated successfully, but these errors were encountered: