Releases: guinmoon/LLMFarm
Releases · guinmoon/LLMFarm
v0.1.6
Changelog
- Add gpt2 inference
- Add replit inference
- Add support for k_quants
- Add chat reload button
- Add gpt_neox updated
- Fixed custom prompt format
for example for ORCA its:### User:\n{{prompt}}\n\n### Response:\n
- Fixed RedPajma
- Fixed context params on load
- Fixed memory leak on reload model
- Fixed autoscroll in message view