FastChat Helm Chart

This Helm chart allows for a straightforward deployment of the FastChat platform on your Kubernetes Cluster. Designed for those aiming to train, serve, and evaluate large language model-based chatbots with ease, this Helm chart is your one-stop solution.

Prerequisites: Your Kubernetes Cluster should be compatible with GPU nodes.

Quick Links:

Quickstart

Clone the Repository

Start by cloning the repository to your local machine:
```
git clone https://github.com/gmougeolle/fastchat-helm
cd <repository-dir>
```
Replace <repository-dir> with the name of the directory.
Configure Your Deployment

Modify the values.yaml file to suit your deployment. This file contains configuration parameters like enabling the webserver, API, controller, and model worker, among others.
- Set up the web server, API, and controller:
  - Enable or disable with the enabled field.
  - Adjust replicas to specify the number of pod replicas.
  - Use extraParams to pass any additional parameters.
- For the model worker:
  - Specify the GPU brand with gpuBrand.
  - Define GPU limits using gpuLimit.
  - Determine the model path with modelPath.
- Set up the persistent volume claim (PVC) for models with storageRequest.
Deploy FastChat

With the Helm chart and values configured, deploy FastChat:
```
helm install fastchat-helm ./ -f values.yaml
```
Access the Deployment

If ingress is enabled in the values.yaml file, you can access the deployment using the specified hostname.

Values Overview:

Webserver: Configure the FastChat webserver settings.
API: Set up the FastChat API parameters.
Controller: Define the FastChat controller parameters.
ModelWorker: Specify the FastChat model worker settings, including GPU configurations.
PVC: Adjust the storage request for the models PVC.

Versioning:

Chart Version: 0.1.0
App Version: 0.1.0

Make sure to refer to the official FastChat GitHub Repository for any additional information or updates.

Note: Ensure that you have the necessary privileges to deploy on your Kubernetes Cluster and always test in a staging environment before deploying to production.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
templates		templates
.helmignore		.helmignore
Chart.yaml		Chart.yaml
README.md		README.md
values.yaml		values.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FastChat Helm Chart

Quick Links:

Quickstart

Values Overview:

Versioning:

About

Releases

Packages

Languages

gmougeolle/fastchat-helm

Folders and files

Latest commit

History

Repository files navigation

FastChat Helm Chart

Quick Links:

Quickstart

Values Overview:

Versioning:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages