Use local model, besides Huggingface API #47

GhostBP112 · 2024-10-02T08:02:03Z

Is it planned or possible to use a local LLM for processing?
I would see this variant as a possibility to significantly increase the generation speed (if the appropriate hardware is available) and also the possibility to use the model offline.

barun-saha · 2024-10-02T12:28:31Z

Hi,

Thanks for your interest in SlideDeck AI.

There is no "plan" as such for this. However, the use of local LLMs has been in "thoughts" lately.

Regarding the speed, token generation with Mistral Nemo appears to take longer, yes. I have been contemplating to switch back to Mistral or at least provide it as an alternative.

Let me create some tasks toward this general direction.

barun-saha · 2024-12-08T09:56:54Z

Hi @GhostBP112 ,

Just added support for offline LLMs via Ollama. An environment variable needs to be set to access this mode. Detailed steps are available in the project description.

Let me know if you get a chance to try it out, and whether it works out.

barun-saha added the enhancement New feature or request label Oct 2, 2024

barun-saha mentioned this issue Dec 8, 2024

Offline LLMs via Ollama #62

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use local model, besides Huggingface API #47

Use local model, besides Huggingface API #47

GhostBP112 commented Oct 2, 2024

barun-saha commented Oct 2, 2024

barun-saha commented Dec 8, 2024

Use local model, besides Huggingface API #47

Use local model, besides Huggingface API #47

Comments

GhostBP112 commented Oct 2, 2024

barun-saha commented Oct 2, 2024

barun-saha commented Dec 8, 2024