Enable the use of self-hosted LLMs #36

artivis · 2024-08-01T09:16:50Z

This PR introduces several fixes and features that enable the use of other, open-ai api compatible, LLMs such as those served by the Ollama framework.

Most notably:

Make use of the api endpoint
Add a --dry-run option to exec that prints the answer without executing it
Enable setting up the system prompts through env var & cli arg

fujitatomoya · 2024-08-01T17:08:41Z

@artivis thanks 👍 you are the 1st contributor 🥇 i will definitely look at the fix.

fujitatomoya

So this actually is a part of #21.

what this is trying to do is to be agnostic from some OpenAI default and required setting so that https://ollama.com/ can be configured and used as backend AI system.

for doing this, IMO we can take this fix to support but there are things need to be done.

Documentation update. all of the docs are currently related to OpenAI, documentation needs to be more generic and supported AI backend system.
Refactor code base to be agnostic from OpenAI only. https://ollama.com/ can be a good example for this.

ros2ai/api/config.py

ros2ai/verb/status.py

Signed-off-by: Tomoya Fujita <[email protected]>

fujitatomoya · 2024-08-08T21:19:51Z

@artivis i added the minor patch to fix the problem with verification.sh script. after building, verification for all container images, i will take this in.

artivis added 10 commits July 31, 2024 15:26

do not set defaults on cli args

3158c34

allow for empty api key

545b8b2

decouple key check and model ret check

7dcea66

fix logic for params precedence

d1dfb89

less verbosity by default when everything's fine

255513b

add missing curl deps

1cd776c

init clients with api endpoint

19f4d43

use api endpoint when listing avaialable models

d928999

Add a dry-run option to exec

db87a7a

enable setting role

ab90e2c

artivis mentioned this pull request Aug 1, 2024

Add ros2ai ubuntu-robotics/ros2cli-snap#2

Merged

api key cannot be empty

7ac52b7

fujitatomoya previously approved these changes Aug 8, 2024

View reviewed changes

ros2ai/api/config.py Show resolved Hide resolved

ros2ai/api/config.py Show resolved Hide resolved

fujitatomoya reviewed Aug 8, 2024

View reviewed changes

ros2ai/verb/status.py Outdated Show resolved Hide resolved

status subcommand fails with verification.sh test.

187f459

Signed-off-by: Tomoya Fujita <[email protected]>

fujitatomoya dismissed their stale review via 187f459 August 8, 2024 21:18

fujitatomoya merged commit ad04c11 into fujitatomoya:rolling Aug 8, 2024
4 checks passed

fujitatomoya mentioned this pull request Aug 8, 2024

Support [ollama](https://github.com/ollama/ollama) or self-hosted LLMs with more generic interfaces. #37

Closed

2 tasks

artivis deleted the feat/ollama branch August 9, 2024 06:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable the use of self-hosted LLMs #36

Enable the use of self-hosted LLMs #36

artivis commented Aug 1, 2024 •

edited

Loading

fujitatomoya commented Aug 1, 2024

fujitatomoya left a comment

fujitatomoya commented Aug 8, 2024

Enable the use of self-hosted LLMs #36

Enable the use of self-hosted LLMs #36

Conversation

artivis commented Aug 1, 2024 • edited Loading

fujitatomoya commented Aug 1, 2024

fujitatomoya left a comment

Choose a reason for hiding this comment

fujitatomoya commented Aug 8, 2024

artivis commented Aug 1, 2024 •

edited

Loading