GitHub - neuralwork/build-cog-inference-container: Containerized model deployment of neuralwork/instruct-finetune-mistral to AWS EC2 with Cog.

Build a Dockerized Inference API using Cog

This repository contains the code and instructions to build a Dockerized Inference API for an LLM using Cog. For detailed tutorial of building the docker image and deploying to it to AWS EC2, please refer to our blog. The LLM is the mistral-7b finetuned on the style instruct dataset and named mistral-7b-style-instruct. Training code and instructions of the model can be found in the instruct-finetune-mistral repository, its detailed tutotial can be found in our blog post.

Pre-requisites

Nvidia GPU with CUDA support.
Docker installed.
Cog installed.
Nvidia Container Toolkit installed.

Build the Docker Image

To build the Docker image, run the following in the cloned directory:

cog build -t mistral-7b-style-instruct

This will build the Docker image with the name mistral-7b-style-instruct.

Run the Docker Image

To run the Docker image, run the following in the cloned directory:

docker run -p 5000:5000 mistral-7b-style-instruct

Test the Inference API

To test the Inference API, you can use the following curl command:

curl http://localhost:5000/predictions -X POST -H "Content-Type: application/json" -d '{"input": {"prompt":"I am an athletic and 180cm tall man in my mid twenties, I have a rectangle shaped body with slightly broad shoulders and have a sleek,casual style. I usually prefer darker colors.", "event": "I am going to a wedding."}}'

Or you can use the following python code:

import requests

url = 'http://localhost:5000/predictions'
data = {"input": {"prompt":"I am an athletic and 180cm tall man in my mid twenties, I have a rectangle shaped body with slightly broad shoulders and have a sleek,casual style. I usually prefer darker colors.", "event": "I am going to a wedding."}}
response = requests.post(url, json=data)
print(response.json())

From neuralwork with ❤️

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
cog.yaml		cog.yaml
predict.py		predict.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Build a Dockerized Inference API using Cog

Pre-requisites

Build the Docker Image

Run the Docker Image

Test the Inference API

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

neuralwork/build-cog-inference-container

Folders and files

Latest commit

History

Repository files navigation

Build a Dockerized Inference API using Cog

Pre-requisites

Build the Docker Image

Run the Docker Image

Test the Inference API

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages