Support of MLX #1176

FoPPi · 2025-02-10T19:03:10Z

FoPPi
Feb 10, 2025

MLX it's like ollama but only for arm mac's and faster than ollama. Here is a links of project.

fblissjr · 2025-02-15T16:33:32Z

fblissjr
Feb 15, 2025

Would highly recommend supporting MLX. Runs natively on Apple Silicon, is fast, and is likely to be the defacto inference engine for Apple M-series chips. It's come a long, long way. There's an entire community of models with great quantization features: mlx-community (MLX Community)

Now even supports native distributed mode, so you can run inference over multiple Mac devices - https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/examples/pipeline_generate.py

I'd be happy to help work on this if there's interest in something beyond ollama.

0 replies

ICeZer0 · 2025-02-16T01:08:49Z

ICeZer0
Feb 16, 2025

Since Goose does not support LM-studio as an LLM provider I built an Ollama proxy to convert your queries. Its working on MLX models.

Check it out, hope it helps!
https://github.com/Embedded-Nature/ollama-proxy/

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support of MLX #1176

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Support of MLX #1176

FoPPi Feb 10, 2025

Replies: 2 comments

fblissjr Feb 15, 2025

ICeZer0 Feb 16, 2025

FoPPi
Feb 10, 2025

fblissjr
Feb 15, 2025

ICeZer0
Feb 16, 2025