Track model metadata #481

SmittieC · 2024-06-26T08:22:37Z

It can really help users out if OCS knows a few things about selected models so that we can build guardrails that will ultimately help lower frustration and make the platform for robust. See this thread as an example where it would have been useful if OCS had known what the model's token limit is.

Model metadata to track

Token limit
Rates / cost
Support for function calling

Currently we store model token limits on the Experiment or Pipeline Node but the values are specific to the LLM model so should be stored along with the model name.

We must also allow users to create new 'models' via the UI e.g. fine tuned models.

The token limits should then be accessed through the LlmService objects.

SmittieC · 2024-06-28T06:05:27Z

If we know the token limit for a specific model, we can

Do do input token count and disallow users to input messages larger than that which the model can handle (on webusers though)
Do proper limiting and/or estimation for what the max token limit should be. Currently users can set this to any number, regardless of the model's context limit.

SmittieC · 2024-07-15T11:43:54Z

Some thoughts on this can be found here cc @snopoke @stephherbers @bderenzi

SmittieC · 2024-10-18T06:09:40Z

Langfuse has a nice way of doing it. Under the Tracing tab -> Models. This might serve as inspiration

marcklingen · 2024-10-18T16:30:38Z

Langfuse has a nice way of doing it. Under the Tracing tab -> Models. This might serve as inspiration

fyi, this is currently changing in Langfuse as more detailed model price details are necessary (cached/video/audio tokens). will probably be released next week

github-project-automation bot added this to OpenChatStudio Jun 26, 2024

SmittieC changed the title ~~OCS to have knowledge of model limits~~ OCS to have knowledge of model metadata Jun 28, 2024

SmittieC changed the title ~~OCS to have knowledge of model metadata~~ Track model metadata Jun 28, 2024

SmittieC added the enhancement New feature or request label Jun 28, 2024

SmittieC self-assigned this Jul 16, 2024

snopoke assigned proteusvacuum and unassigned SmittieC Oct 17, 2024

proteusvacuum moved this to 🏗 In progress in OpenChatStudio Oct 29, 2024

proteusvacuum linked a pull request Oct 31, 2024 that will close this issue

Track LLM Model Metadata #810

Merged

proteusvacuum closed this as completed in #810 Nov 11, 2024

github-project-automation bot moved this from 🏗 In progress to ✅ Done in OpenChatStudio Nov 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Track model metadata #481

Track model metadata #481

SmittieC commented Jun 26, 2024 •

edited by snopoke

Loading

SmittieC commented Jun 28, 2024

SmittieC commented Jul 15, 2024

SmittieC commented Oct 18, 2024

marcklingen commented Oct 18, 2024

Track model metadata #481

Track model metadata #481

Comments

SmittieC commented Jun 26, 2024 • edited by snopoke Loading

Model metadata to track

SmittieC commented Jun 28, 2024

SmittieC commented Jul 15, 2024

SmittieC commented Oct 18, 2024

marcklingen commented Oct 18, 2024

SmittieC commented Jun 26, 2024 •

edited by snopoke

Loading