Breeze-LLM-server

Utilizing FastAPI and the Breeze LLM model, an LLM endpoint has been developed to enable users to swiftly deploy a Breeze LLM model on server.

Model information

Huggingface : https://huggingface.co/YC-Chen/Breeze-7B-Instruct-v1_0-GGUF

How to install

Install Anaconda (If you have already installed Anaconda, please jump to next step.)

# 1.Download Anaconda
wget -P /tmp https://repo.anaconda.com/archive/Anaconda3-2020.02-Linux-x86_64.sh
# 2.Install Anaconda
bash /tmp/Anaconda3-2020.02-Linux-x86_64.sh

Run the following command to set up the environment:

# git clone
git clone https://github.com/OscarWei61/Breeze-LLM-server.git
cd Breeze-LLM-server

Note: Using Python 3.7 may encounter errors, whereas Python 3.9 works smoothly. Default install python 3.9 version.

# 2.Run env setup script
# default to install ctransformers with no GPU acceleration.
conda create --name Breeze --file ./requirements.txt

If you want to install ctransformers with CUDA GPU acceleration, you need to uninstall ctransform and replace with:

pip install ctransformers[cuda]

How to run the program

# 1.Activate Conda env
conda activate Breeze
python main.py

Reminder: Before closing the connection with the server, make sure to stop the server by pressing Ctrl+C. If necessary, use the "kill" command to stop the FastAPI server.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
client		client
data		data
server		server
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Breeze-LLM-server

Model information

How to install

How to run the program

About

Releases

Packages

Languages

License

OscarWei61/Breeze-LLM-server

Folders and files

Latest commit

History

Repository files navigation

Breeze-LLM-server

Model information

How to install

How to run the program

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages