Use LLM APIs responses in token counting #5604

enyst · 2024-12-14T20:25:22Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

Give a summary of what the PR does, explaining any non-trivial design decisions

This PR proposes to use the liteLLM token usage data explicitly.

track it in Metrics
add two utility methods to link it to events

Please note: technically we also have it in the tool_call_metadata optional field in Event, because we have the entire litellm ModelResponse there. This PR proposes to take it out of a ModelResponse, for a few reasons:

it's not always there: MessageActions, some FinishActions, and user actions don't have a tool. We can fill the data for those from elsewhere (e.g. tokenizer on the corresponding message)
it seems easier to work with, in a format we need, than the format that litellm returns, though I could be wrong
we may want to remove the full ModelResponse from events sometime, it contains a lot we don't really need. Events are supposed to be sort of simple and higher level, while the technical details of an API response in litellm are quite complex and of a different nature.

Part of #6707

Follow-up on #5550

Link of any specific issues this addresses
Fix #2947

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:9d3f77f-nikolaik   --name openhands-app-9d3f77f   docker.all-hands.dev/all-hands-ai/openhands:9d3f77f

openhands/llm/llm.py

enyst · 2024-12-15T09:35:52Z

@openhands-agent Read the diff of this PR, PR 5604. And add unit tests for the functionality we change or introduce.

Please make sure to explore the unit tests a bit to find if we already have test files appropriate for these tests, if not make a new one.

…counting

openhands/agenthub/codeact_agent/codeact_agent.py

mamoodi · 2025-02-10T14:40:25Z

@enyst this is still in flux?

enyst · 2025-02-22T20:13:03Z

openhands/core/message_utils.py

+                (
+                    usage
+                    for usage in metrics.tokens_usages
+                    if usage.response_id == response_id


This implementation ignores that model_response might have the tokens data and takes it from TokenUsage instead. We will likely always need response_id, but maybe not all the rest in model_response.

openhands/core/message_utils.py

csmith49 · 2025-02-23T15:35:31Z

openhands/llm/metrics.py

@@ -17,18 +17,31 @@ class ResponseLatency(BaseModel):
    response_id: str


+class TokensUsage(BaseModel):


Minor nit, but I'd expect this to be called TokenUsage instead of the pluralized form.

Hah, I felt the same, it reads strange 😅 , o3-mini wanted it for some mysterious reason. Fixed!

They are going to align humans to their preferences, aren't they? 😂

csmith49

LGTM! Super useful to be able to grab token usage info from events after the fact. One question: we link token usage metrics to events by going through the response ID in the tool-call metadata. Do all events have that metadata, even if they're just a user message?

Co-authored-by: Calvin Smith <[email protected]>

…enyst/usage

enyst · 2025-02-23T16:34:29Z

LGTM! Super useful to be able to grab token usage info from events after the fact. One question: we link token usage metrics to events by going through the response ID in the tool-call metadata. Do all events have that metadata, even if they're just a user message?

No, unfortunately we don't have it for MessageActions, FinishActions (started from user at least) and any user actions (like bash commands that the user writes in the UI terminal). One of the utility methods tries to find some info by looking back in history, at the last event that had it. It will be better perhaps to try to fill it in ourselves from other sources (e.g. tokenizer or for Anthropic its API has an endpoint returning token counts). 🤔

enyst marked this pull request as draft December 14, 2024 20:25

enyst commented Dec 14, 2024

View reviewed changes

openhands/llm/llm.py Outdated Show resolved Hide resolved

All-Hands-AI deleted a comment from openhands-agent Dec 15, 2024

enyst added the lint-fix label Dec 15, 2024

enyst pushed a commit to enyst/playground that referenced this pull request Dec 15, 2024

Fix pr All-Hands-AI#5604: Make use of litellm 'Usage' data for token …

6c5c421

…counting

enyst added lint-fix and removed lint-fix labels Dec 15, 2024

enyst self-assigned this Dec 15, 2024

enyst added lint-fix and removed lint-fix labels Dec 15, 2024

enyst mentioned this pull request Dec 15, 2024

Small fix and addition for token counting #5550

Merged

1 task

enyst changed the title ~~Make use of litellm 'Usage' data for token counting~~ Use LLM APIs responses in token counting Dec 17, 2024

enyst commented Dec 17, 2024

View reviewed changes

openhands/agenthub/codeact_agent/codeact_agent.py Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

All-Hands-AI deleted a comment from openhands-agent Jan 21, 2025

track used tokens

d80c376

enyst force-pushed the enyst/usage branch from 5ccb369 to d80c376 Compare February 22, 2025 17:48

enyst removed the lint-fix label Feb 22, 2025

enyst added 7 commits February 22, 2025 18:59

add response_id

bd9fc55

test accumulation

c59abb5

clean up

dba25f5

fix not initialized

38b5198

retrieve tokens usage for an event

b1a18d5

add tests

5b063cc

fix tests

801b134

enyst commented Feb 22, 2025

View reviewed changes

enyst marked this pull request as ready for review February 22, 2025 20:20

enyst requested a review from csmith49 February 22, 2025 20:20

csmith49 reviewed Feb 23, 2025

View reviewed changes

openhands/core/message_utils.py Outdated Show resolved Hide resolved

csmith49 reviewed Feb 23, 2025

View reviewed changes

csmith49 approved these changes Feb 23, 2025

View reviewed changes

enyst and others added 3 commits February 23, 2025 17:21

Update openhands/core/message_utils.py

ba267f0

Co-authored-by: Calvin Smith <[email protected]>

rename to TokenUsage, consistency tweaks

46de5a7

Merge branch 'enyst/usage' of github.com:All-Hands-AI/OpenHands into …

9d3f77f

…enyst/usage

enyst merged commit 2d2dbf1 into main Feb 23, 2025
15 checks passed

enyst deleted the enyst/usage branch February 23, 2025 16:58

enyst mentioned this pull request Feb 24, 2025

Use response_id to track token usage for MessageActions #6913

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use LLM APIs responses in token counting #5604

Use LLM APIs responses in token counting #5604

enyst commented Dec 14, 2024 •

edited by github-actions bot

Loading

enyst commented Dec 15, 2024

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

mamoodi commented Feb 10, 2025

enyst Feb 22, 2025

csmith49 Feb 23, 2025

enyst Feb 23, 2025

csmith49 left a comment

enyst commented Feb 23, 2025

		@@ -17,18 +17,31 @@ class ResponseLatency(BaseModel):
		response_id: str


		class TokensUsage(BaseModel):

Use LLM APIs responses in token counting #5604

Use LLM APIs responses in token counting #5604

Conversation

enyst commented Dec 14, 2024 • edited by github-actions bot Loading

enyst commented Dec 15, 2024

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

mamoodi commented Feb 10, 2025

enyst Feb 22, 2025

Choose a reason for hiding this comment

csmith49 Feb 23, 2025

Choose a reason for hiding this comment

enyst Feb 23, 2025

Choose a reason for hiding this comment

csmith49 left a comment

Choose a reason for hiding this comment

enyst commented Feb 23, 2025

enyst commented Dec 14, 2024 •

edited by github-actions bot

Loading