如何使用 LMDeploy 把 InternLM-20B-4bit 部署为服务 #451

vansin · 2023-09-22T06:50:42Z

vansin
Sep 22, 2023
Collaborator

通过以下步骤，即可快速把 InternLM-20B-4bit 部署为服务，并在线与模型聊天。

第一步：安装 lmdeploy

pip install 'lmdeploy>=0.0.9'

第二步：下载 InternLM-20B-4bit 模型

git-lfs install
git clone --depth 1 https://huggingface.co/internlm/internlm-chat-20b-4bit

第三步：转换模型权重格式

python3 -m lmdeploy.serve.turbomind.deploy internlm-chat \
    --model-path ./internlm-chat-20b-4bit \
    --model-format awq \
    --group-size 128

第四步：启动 gradio 服务

python3 -m lmdeploy.serve.gradio.app ./workspace --server_name {ip_addr} --server_port {port}

vansin · 2023-09-23T08:10:53Z

vansin
Sep 23, 2023
Collaborator Author

通过以下步骤，即可快速把 InternLM-20B-Chat 部署为服务，并在线与模型聊天。

第一步：安装 lmdeploy

pip install 'lmdeploy>=0.0.9'`

第二步：下载 InternLM-20B-4bit 模型

git-lfs install
git clone https://huggingface.co/internlm/internlm-chat-20b

第三步：转换模型权重格式

python3 -m lmdeploy.serve.turbomind.deploy internlm-chat \
    --model-path ./internlm-chat-20b

第四步：启动 gradio 服务

python3 -m lmdeploy.serve.gradio.app ./workspace --server_name {ip_addr} --server_port {port}

0 replies

YIBO0408 · 2023-09-24T07:00:59Z

YIBO0408
Sep 24, 2023

cd internlm-chat-20b
git lfs pull

0 replies

luoqingyi1993 · 2023-09-25T12:34:42Z

luoqingyi1993
Sep 25, 2023

量化部署出现这个错误，大佬们看看是为什么？

6 replies

luoqingyi1993 Sep 25, 2023

这个是文件夹，里面放的就是4bit的模型

lvhan028 Sep 26, 2023
Maintainer

日志显示：./root/model/internlm-chat-20b-4bit/tokenizer.model doesn't exist.
文件夹内的文件是全的不？

luoqingyi1993 Sep 26, 2023

全的呢

lvhan028 Sep 26, 2023
Maintainer

我复现不出来。能贴一下 ll ./root/model/internlm-chat-20b-4bit/ 的结果吗？

luoqingyi1993 Sep 27, 2023

luoqingyi1993 · 2023-09-27T09:19:14Z

luoqingyi1993
Sep 27, 2023

24G的显存，好像跑不了4bit的量化模型？还是哪里的参数要设置？

1 reply

lvhan028 Sep 28, 2023
Maintainer

triton_models/weights/config.ini中的session_len调小

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

如何使用 LMDeploy 把 InternLM-20B-4bit 部署为服务 #451

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 4 comments 7 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

如何使用 LMDeploy 把 InternLM-20B-4bit 部署为服务 #451

vansin Sep 22, 2023 Collaborator

Replies: 4 comments · 7 replies

vansin Sep 23, 2023 Collaborator Author

YIBO0408 Sep 24, 2023

luoqingyi1993 Sep 25, 2023

luoqingyi1993 Sep 25, 2023

lvhan028 Sep 26, 2023 Maintainer

luoqingyi1993 Sep 26, 2023

lvhan028 Sep 26, 2023 Maintainer

luoqingyi1993 Sep 27, 2023

luoqingyi1993 Sep 27, 2023

lvhan028 Sep 28, 2023 Maintainer

vansin
Sep 22, 2023
Collaborator

Replies: 4 comments 7 replies

vansin
Sep 23, 2023
Collaborator Author

YIBO0408
Sep 24, 2023

luoqingyi1993
Sep 25, 2023

lvhan028 Sep 26, 2023
Maintainer

lvhan028 Sep 26, 2023
Maintainer

luoqingyi1993
Sep 27, 2023

lvhan028 Sep 28, 2023
Maintainer