From 93f94baff80a671af5e66bb9cfa0a060b3d821f4 Mon Sep 17 00:00:00 2001 From: Trevor Royer Date: Mon, 6 Jan 2025 17:13:43 -0700 Subject: [PATCH] add replicas --- content/modules/ROOT/pages/02-vllm.adoc | 1 + 1 file changed, 1 insertion(+) diff --git a/content/modules/ROOT/pages/02-vllm.adoc b/content/modules/ROOT/pages/02-vllm.adoc index fd68a97..cbf4bd5 100644 --- a/content/modules/ROOT/pages/02-vllm.adoc +++ b/content/modules/ROOT/pages/02-vllm.adoc @@ -38,6 +38,7 @@ When you first create a Model Server in a Project you must select a `single-mode ---- Model deployment name: vllm Serving runtime: vLLM ServingRuntime for KServe +Number of model server replicas to deploy: 1 Model server size: Custom CPUs requested: 2 Cores CPUs limit: 4 Cores