distill-llm

Example of distilling LLM knowledge using LoRa

Data

We use this dataset juancavallotti/multilingual-gec from the huggingface Hub. It is a synthetic grammar correction dataset.

Install

Torch

conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia

Other libs:

pip install -r requirements.txt

Run

All steps to run the experiments are listed in order in the file scripts/run.sh that you can run as bash scripts/run.sh

Results

% of exact match between ground truth and prediction:

LLama 2–70B: 42%
Base Tiny-LLama: 11%
Distilled Tiny-LLama: 31%

Blog

https://medium.com/p/12e954d256c2

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
data		data
distill		distill
scripts		scripts
.gitignore		.gitignore
.secrets.baseline		.secrets.baseline
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt
ruff.toml		ruff.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

distill-llm

Data

Install

Run

Results

Blog

About

Releases

Packages

Languages

License

jasonmaoverlord/distill-llm

Folders and files

Latest commit

History

Repository files navigation

distill-llm

Data

Install

Run

Results

Blog

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages