Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

guinmoon / LLMFarm Public

Notifications You must be signed in to change notification settings
Fork 86
Star 1.4k

Code
Issues 20
Pull requests
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Releases: guinmoon/LLMFarm

Releases · guinmoon/LLMFarm

v0.9.5

25 Feb 17:55

guinmoon

Compare

Choose a tag to compare

Loading

v0.9.5

Changes:

llama.cpp updated to b2135
Added the ability to download models from the application menu
Added progress indicator for model loading
Fixed disappearing keyboard bug
Fixed some other bugs

Assets 2

Loading

lin72h reacted with thumbs up emoji

All reactions

👍 1 reaction

1 person reacted

v0.9.2

12 Feb 15:14

guinmoon

Compare

Choose a tag to compare

Loading

v0.9.2

Changes:

Added possibility to specify System Prompt, which will be added to the text of the first message in the session. See FAQ.
Added ability to clone chat (without message history)
Chats are sorted by last modification date
Clear chat history button is placed on the toolbox.
You can now use both {prompt} and {{prompt}} designations in templates
Fixed a bug with displaying already deleted chats and models
Fixed some other bugs
Templates have been updated

Assets 2

Loading

All reactions

v0.9.0

20 Jan 11:57

guinmoon

Compare

Choose a tag to compare

Loading

v0.9.0

Changes:

llama.cpp updated to b1891
added support for Phi2, TinyLlama and other models
various GUI improvements
added clear chat button
user templates improve
fixed token to string issue
fixed gpt2 metal
fixed many other errors

Assets 2

Loading

smurat, lin72h, Juneautek, and cinzy reacted with hooray emoji

All reactions

🎉 4 reactions

4 people reacted

v0.8.1

14 Dec 16:06

guinmoon

Compare

Choose a tag to compare

Loading

v0.8.1

Changes:

fixed autoscroll

** Metal support temporary disabled for GPT2 models
** More about LoRA here https://github.com/guinmoon/LLMFarm/blob/main/lora.md

Assets 2

Loading

lin72h reacted with thumbs up emoji

All reactions

👍 1 reaction

1 person reacted

v0.8.0

05 Dec 10:48

guinmoon

Compare

Choose a tag to compare

Loading

v0.8.0

Changes:

llama.cpp updated to b1601
added support for StableLM-3b-4e1t models
added support for Qwen models
added the possibility to merge LoRA with the model
added merge and train LoRA progress bar
added the possibility to save user templates
added multyline input
fixed many other errors

** Metal support temporary disabled for GPT2 models
** More about LoRA here https://github.com/guinmoon/LLMFarm/blob/main/lora.md

Assets 4

Loading

lin72h and yk295 reacted with thumbs up emoji

All reactions

👍 2 reactions

2 people reacted

v0.7.5

14 Nov 17:41

guinmoon

Compare

Choose a tag to compare

Loading

v0.7.5

Changes:

added LoRA train support experimental
add BOS/EOS token to begin/end of prompt options
handle special tokens options
model loading indication
token/sec indicator on message
fixed some errors

** Due to high RAM consumption LoRA training on iPhone is possible only on Pro models
It is recommended to use q8_0 quantization for LoRA training.

Assets 2

Loading

lin72h reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

v0.7.0.1

20 Oct 19:41

guinmoon

Compare

Choose a tag to compare

Loading

v0.7.0.1

Changes:

added special token support for prompt template like <s>User: {{prompt}}</s>
fixed tokenizer bug that could cause application crashing

Assets 3

Loading

All reactions

v0.7.0

19 Oct 14:36

guinmoon

Compare

Choose a tag to compare

Loading

v0.7.0

Changes:

llama.cpp updated to b1396
added support for MPT models
added support for Bloom models
added support Metal for q5_0, q5_1 quantization
gpt-2 now with Metal support
LoRA adapters support (More about LoRA here)
fixed mirostat for non llama
fixed premature completion of predictions
fixed many other errors

Assets 3

Loading

All reactions

v0.6.2

09 Oct 18:04

guinmoon

Compare

Choose a tag to compare

Loading

v0.6.2

Changes

starcoder (santacoder) with Metal and mmap support (GGUF)
infinite text generation by reseting n_past
fixed some errors
fixed mmap always false

Assets 2

Loading

All reactions

v0.6.1

27 Sep 18:43

guinmoon

Compare

Choose a tag to compare

Loading

v0.6.1

Changes

fix crash on n_tokens > context size
fix llama mlock

Assets 3

Loading

All reactions

Previous 1 2 3 4 Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.