Replies: 1 comment
-
各种量化部署方案(llama.cpp,transformers等)试试吧。 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
请问如何在google的colab进行部署,我的显卡只有8g运行13B的有些吃力
Beta Was this translation helpful? Give feedback.
All reactions