Will "Prequant BitsAndBytes models with TP" be supported? #10117
Answered
by
chenqianfzh
congwupiece
asked this question in
Q&A
-
when seve a BitsAndBytes model with --tensor-parallel-size N |
Beta Was this translation helpful? Give feedback.
Answered by
chenqianfzh
Jan 20, 2025
Replies: 2 comments 1 reply
-
Getting same error. Btw, what is PP? |
Beta Was this translation helpful? Give feedback.
0 replies
-
@chenqianfzh What difficulties are there with this support? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Michael and I had discussed about the lack of support of TP to prequant bnb models, in PR #8434. We agreed PP is a reasonable choice than TP.