04 Sep 01:29

github-actions

eb2fcda

v0.7.0

Requires at least b3606 llama.cpp release.

Breaking Changes

Adjusted to handle breaking changes in llama.cpp /health endpoint: ggml-org/llama.cpp#9056

Instead of using the /health endpoint to monitor slot statuses, starting from this version, Paddler uses the /slots endpoint to monitor llama.cpp instances.
Paddler's /health endpoint remains unchanged.

Assets 6

13 Aug 09:54

github-actions

v0.6.0

84784c8

v0.6.0

Latest supported llama.cpp release: b3604

Features

Assign names to Paddler agents (https://github.com/distantmagic/paddler/discussions/12)

Fixes

Agent host formatting in dashboard

Assets 6

12 Aug 12:18

github-actions

v0.6.0-rc1

4282c51

v0.6.0-rc1 Pre-release

Pre-release

Features

Assign names to Paddler agents (#15)

Assets 6

17 Jul 21:31

github-actions

v0.5.0

6874008

v0.5.0

Fixes

Management server crashed in some scenarios due to concurrency issues

Assets 6

16 Jul 12:33

github-actions

v0.4.0

da8943a

v0.4.0

Thank you, @ScottMcNaught, for the help with debugging the issues! :)

Fixes

OpenAI compatible endpoint is now properly balanced (/v1/chat/completions)
Balancer's reverse proxy panicked in some scenarios when the underlying llama.cpp instance was abruptly closed during the generation of completion tokens
Added mutex in the targets collection for better internal slots data integrity

Contributors

ScottMcNaught

Assets 6

27 Jun 21:08

github-actions

v0.3.0

4ebbfcd

v0.3.0

Features

Requests can queue when all llama.cpp instances are busy
AWS Metadata support for agent local IP address
StatsD metrics support

Assets 7

01 Jun 23:20

github-actions

v0.1.0

b61191f

v0.1.0

Aggregated Health Status Responses

Paddler aggregates all the underlying llama.cpp health statuses. When you check the /health endpoint, it reports aggregated results, making it a drop-in replacement for the llama.cpp server itself (in a sense that you can start making requests to Paddler instead of llama.cpp and things will work the same way).

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Breaking Changes

Features

Fixes

Features

Fixes

Fixes

Contributors

Features

Aggregated Health Status Responses

Releases: distantmagic/paddler

v0.7.0

Breaking Changes

v0.6.0

Features

Fixes

v0.6.0-rc1

Features

v0.5.0

Fixes

v0.4.0

Fixes

Contributors

v0.3.0

Features

v0.1.0

Aggregated Health Status Responses