Skip to content

2.1.2

Compare
Choose a tag to compare
@jondurbin jondurbin released this 23 Aug 14:01
· 43 commits to main since this release

LMoE - HF inference (better quality, slower), vllm inference (much faster, much lower quality for some adapters)