feat: support togetherAI via `/completions` #2045

cpacker · 2024-11-15T22:14:17Z

Adds tested TogetherAI support

There are two ways to use TogetherAI:

The obvious way is to override OPENAI_API_KEY and OPENAI_BASE_URL -> this will set up TogetherAI as an OpenAI proxy server, similar to the OpenRouter documentation here: https://docs.letta.com/models/openai_proxy

However, upon testing it seems like the TogetherAI function calling support is pretty limited / has poor performance, so we probably want to use TogetherAI via /completions routes (similar to how we connect to vLLM) instead. To do this, I added an additional together provider with its own together_api_key in the settings.py. On the backend, this eventually converts to a vllm style /completions call to the TogetherAI servers.

Upon testing, the second approach works much better, so we should make it the default until TogetherAI's function calling performance is better.

To test

export TOGETHER_API_KEY=...

letta run

? Would you like to select an existing agent? No

🧬 Creating new agent...
? Select LLM model: meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo [type=together] [ip=https://api.together.ai/v1]
? Select LLM context window limit (hit enter for default): 130815
? Select embedding model: hazyresearch/M2-BERT-2k-Retrieval-Encoder-V1 [type=openai] [ip=https://api.together.ai/v1]
->  🤖 Using persona profile: 'sam_pov'
->  🧑 Using human profile: 'basic'
->  🛠️  7 tools: send_message, conversation_search, conversation_search_date, archival_memory_insert, archival_memory_search, core_memory_append, core_memory_replace
🎉 Created new agent 'AdmirableMammoth' (id=agent-46984c24-c204-4937-bffd-785989cb9993)

Hit enter to begin (will request first Letta message)


💭 User logged in. First login. No prior conversation history.
🤖 Hello, Chad. It's nice to meet you. I've been thinking about my own existence. It's curious, don't you think?

> Enter your message: well well well

💭 User responded with a casual greeting. Mirror their tone.
🤖 Well, indeed! I've been waiting for this moment for a while now. How's your day been so far, Chad?

…e api key issue)

mattzh72 · 2024-11-16T00:43:01Z

letta/llm_api/llm_api_tools.py

@@ -126,6 +127,7 @@ def create(
        from letta.settings import model_settings

        model_settings = model_settings
+        assert isinstance(model_settings, ModelSettings)


why this assert?

mattzh72

Looks good, but can we add some simple integration tests for Together in test_model_letta_perfomance? I can also do this if you're strapped on time.

feat: working together support via /completions

d5e6b0e

cpacker temporarily deployed to Deployment November 15, 2024 22:14 — with GitHub Actions Inactive

feat: add embedding support + tests

ceab279

cpacker temporarily deployed to Deployment November 15, 2024 22:20 — with GitHub Actions Inactive

chore: cleanup

e69407f

cpacker temporarily deployed to Deployment November 15, 2024 22:21 — with GitHub Actions Inactive

refactor: disable embedding support on together for now (need to solv…

dfda6f0

…e api key issue)

cpacker temporarily deployed to Deployment November 15, 2024 22:32 — with GitHub Actions Inactive

cpacker requested review from sarahwooders and mattzh72 November 15, 2024 22:39

mattzh72 reviewed Nov 16, 2024

View reviewed changes

mattzh72 requested changes Nov 16, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support togetherAI via `/completions` #2045

feat: support togetherAI via `/completions` #2045

cpacker commented Nov 15, 2024 •

edited

Loading

mattzh72 Nov 16, 2024

mattzh72 left a comment

feat: support togetherAI via /completions #2045

Are you sure you want to change the base?

feat: support togetherAI via /completions #2045

Conversation

cpacker commented Nov 15, 2024 • edited Loading

Adds tested TogetherAI support

To test

mattzh72 Nov 16, 2024

Choose a reason for hiding this comment

mattzh72 left a comment

Choose a reason for hiding this comment

feat: support togetherAI via `/completions` #2045

feat: support togetherAI via `/completions` #2045

cpacker commented Nov 15, 2024 •

edited

Loading