Summary handler token limit retry #56

jasnoo · 2024-07-26T19:31:34Z

Description

Currently, feedback that is received via the Feedback Widget is stored in a Google Sheet. There are specific tabs that only contain feedback comments related to one conceptual webpage tab. For example, there is a maternity-timeline tab corresponding to all feedback received on the maternity timeline tool.

Previously, the summary handler would receive a http request that includes some URL. The handler determines what sheet to grab info from, pulls a specific hardcoded amount of rows from a tab google sheet, and attempt to summarize the comments from that data via chatGPT. The limitation here is that there is a max token for the model, so if the comment content exceeded the model's maximum context length, it would throw an error.

Approach

This has been improved in 3 ways:

Estimating the tokens in the comments via reduceCommentToNotExceedTokenLimit. A token can be estimated as ~4 characters, so there is a check to see if the comments exceed this. If so, the comments are reduced to be under the estimated limit. Note that the max tokens for the entire transaction is 16384, with 2000 being dedicated to the summary output (so 14384 for the input).
Individual comments are capped at 500 characters before being passed to the getSummary function. A very small percentage of comments exceed this - for instance in claim-status tab, it was about ~8/990 that exceeded this. Doing this helps allow for more comments to be considered and summarized
A retry mechanism has been added in getSummary so that an error is thrown due to exceeding the max context length/token count, the comment array will slice 50 comments on and retry again - up to 5 times.

Note: The summary handler is only utilized in an internal tool currently which I plan to update as well.

Steps to Test

This handler does not have complete tests, just some tests for the libs. I have added/updated them for now. npm test for tests.

I have also run example HTTP calls intentionally using comment content that would exceed the character limits, context limits, and also max out the retries and can see that it works as expected

…y mechanism

jasnoo added 3 commits July 26, 2024 13:40

updated to estimate tokens used for summarizing input to preduce retr…

adad11d

…y mechanism

updated chat.gpt tests

91f7341

made slice comment value into var

e6fbeeb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Summary handler token limit retry #56

Summary handler token limit retry #56

jasnoo commented Jul 26, 2024

Summary handler token limit retry #56

Are you sure you want to change the base?

Summary handler token limit retry #56

Conversation

jasnoo commented Jul 26, 2024

Description

Approach

Steps to Test