LLM-Distillery

LLM-Distillery is a pipeline for distillation of one or multiple teacher models into a student.

Main features:

Single and Multi-Teacher distillation
Distillation on instruct and completion text
Offline distillation: collects the dataset, and only then trains (Yes, you can share the collected datasets)
Windows and Linux support
Automatic hdf5 dataset synchronization, with continued collection after force-exit
Lots of knobs to tweak! From temperature to the device mapping strategy
And a lot more!

Installation

See Wiki for installation instructions

Console UI

1.mp4

(Full run of tinyllama 1.1B self-distillation from full fp16 model to 4bit quantized version)

Contributions

Big thanks to kalomaze for help and keeping me sane while I was building this project!
Also, thanks to AlpinDale for giving access to compute during the development!

If you want to contribute to this project, feel free!
Open issues when you encounter them, and make PRs when you feel like it.

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
classes		classes
utils		utils
LICENSE		LICENSE
README.md		README.md
collect_and_finetune.py		collect_and_finetune.py
config.json		config.json
open_venv.bat		open_venv.bat
open_venv.sh		open_venv.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-Distillery

Installation

Console UI

Contributions

About

Releases

Packages

Languages

License

golololologol/LLM-Distillery

Folders and files

Latest commit

History

Repository files navigation

LLM-Distillery

Installation

Console UI

Contributions

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages