-
Notifications
You must be signed in to change notification settings - Fork 3
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
format markdown and remove redundant files
- Loading branch information
1 parent
c03f655
commit 47d0f4a
Showing
12 changed files
with
142 additions
and
177 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,10 +1,12 @@ | ||
## Run Mistral on K3s with vLLM from prem-operator source code | ||
# Run Mistral on K3s with vLLM from prem-operator source code | ||
|
||
## Overview | ||
|
||
### Overview | ||
This guide will help you to run Mistral on K3s with vLLM from prem-operator source code. | ||
Ubuntu 22.04.3 LTS was used as a host OS. | ||
|
||
### Prerequisites | ||
## Prerequisites | ||
|
||
- K3s cluster | ||
- Helm | ||
- Git | ||
|
@@ -14,61 +16,52 @@ Ubuntu 22.04.3 LTS was used as a host OS. | |
- K9s(optional) | ||
- Nvidia GPU Operator | ||
|
||
### Steps | ||
1. Install K3s cluster | ||
## Steps | ||
|
||
```bash | ||
# 1. Install K3s cluster | ||
./../scripts/install_k3s.sh | ||
``` | ||
2. Install Helm | ||
```bash | ||
|
||
# 2. Install Helm | ||
./.../scripts/install_helm.sh | ||
``` | ||
3. Install Nvidia GPU Operator | ||
```bash | ||
|
||
# 3. Install Nvidia GPU Operator | ||
./../scripts/install_gpu_operator_k3s.sh | ||
``` | ||
4. Install tools: make, curl, jq | ||
```bash | ||
|
||
# 4. Install tools: make, curl, jq | ||
./../scripts/install_make_curl_jq.sh | ||
``` | ||
5. Install Go | ||
```bash | ||
|
||
# 5. Install Go | ||
./../scripts/install_go.sh | ||
``` | ||
6. Install Docker | ||
```bash | ||
|
||
# 6. Install Docker | ||
./../scripts/install_docker.sh | ||
``` | ||
7. Install K9s(optional) | ||
```bash | ||
|
||
# 7. Install K9s(optional) | ||
./../scripts/install_k9s.sh | ||
``` | ||
8. Clone prem-operator repository | ||
```bash | ||
|
||
# 8. Clone prem-operator repository | ||
git clone [email protected]:premAI-io/prem-operator.git | ||
``` | ||
9. Deploy AIDeployment CRD | ||
```bash | ||
|
||
# 9. Deploy AIDeployment CRD | ||
sudo make install | ||
``` | ||
10. Build prem-operator Docker image | ||
```bash | ||
|
||
# 10. Build prem-operator Docker image | ||
sudo make docker-build | ||
``` | ||
11. Load Docker image to K3s cluster | ||
```bash | ||
|
||
# 11. Load Docker image to K3s cluster | ||
sudo docker save -o ./controller controller:latest | ||
sudo k3s ctr images import controller | ||
``` | ||
12. Deploy prem-operator | ||
```bash | ||
|
||
# 12. Deploy prem-operator | ||
sudo make deploy | ||
``` | ||
13. Deploy vLLM | ||
```bash | ||
|
||
# 13. Deploy vLLM | ||
sudo kubectl apply -f ./../examples/vllm.yaml | ||
``` | ||
14. Send request to vLLM | ||
|
||
### Send request to vLLM using curl and process the response with jq | ||
|
||
```bash | ||
curl -X 'POST' http://vllm.127.0.0.1.nip.io/v1/completions \ | ||
-H 'accept: application/json' \ | ||
|
Oops, something went wrong.