Condenser for Browser Output Observations #6578

adityasoni9998 · 2025-02-01T20:35:01Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below
Developed a condenser that allows the user to only keep the most recent attention_window number of browser outputs in the LLM's context.

Give a summary of what the PR does, explaining any non-trivial design decisions
Designed the BrowserOutputCondenser class for this functionality. This is helpful for long trajectories involving (possibly screenshot-based) web navigation to avoid context window exceeded errors and control inference cost. Previously implemented condensers do not allow masking a specific type of observation. Since, browser observations are generally very large, this might be helpful.

Link of any specific issues this addresses

…ted screenshot.

…4o models)

… use when providing context to the LLM.

…dates (All-Hands-AI#6617) Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: amanape <[email protected]>

Co-authored-by: sp.wack <[email protected]>

…-AI#6635)

Co-authored-by: openhands <[email protected]> Co-authored-by: Ray Myers <[email protected]>

…sary (All-Hands-AI#6618) Co-authored-by: openhands <[email protected]>

Co-authored-by: openhands <[email protected]> Co-authored-by: Xingyao Wang <[email protected]>

…dates (All-Hands-AI#6617) Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: amanape <[email protected]>

Co-authored-by: sp.wack <[email protected]>

…-AI#6635)

Co-authored-by: openhands <[email protected]> Co-authored-by: Ray Myers <[email protected]>

csmith49 · 2025-02-12T19:08:36Z

Modulo the discussion on combining observation condensers (which we can take care of in a future PR when another bespoke observation masking strategy is needed), this looks good.

Can you extend the unit tests in tests/unit/test_condenser.py to handle this class as well?

adityasoni9998 · 2025-02-21T01:07:01Z

Modulo the discussion on combining observation condensers (which we can take care of in a future PR when another bespoke observation masking strategy is needed), this looks good.

Can you extend the unit tests in tests/unit/test_condenser.py to handle this class as well?

Done!

enyst

Interesting to see condensers extended for particular use cases. @adityasoni9998 could you perhaps share a bit about how this PR was tested?

I'd love to know if @csmith49 is also okay with it.

openhands/memory/condenser/impl/browser_output_condenser.py

csmith49 · 2025-02-23T15:53:56Z

I'm good to approve this pretty quick, but it would be good to hear any data (anecdotal or otherwise) that this condenser is helpful in the stated context. @adityasoni9998 Have you managed to run the agent with this condenser and see "better" browsing behavior?

Co-authored-by: Engel Nyst <[email protected]>

adityasoni9998 · 2025-02-24T00:46:45Z

Hi @csmith49 and @enyst. As of now, I am yet to evaluate this condenser on downstream benchmarks and I do not have any quantitative metrics comparing the two situations where we use and don't use a condenser. However, we have prior results of browsing agents working reasonably well without having older accessibility trees and screenshots from previous steps (for eg. consider the performance of VisualBrowsingAgent on VisualWebArena). browser-use also has a similar approach in their browsing agent wherein they only provide observations from the most recent action.

Also while evaluating the default CodeAct agent with full history on The Agent Company and GAIA, the agent struggles with large context sizes due to longer trajectories involving interactions with the browser which results in hallucinations/forgetting. Anyways, using this condenser is an optional choice being made by the user, and the default behaviour of CodeAct's browsing still remains unchanged. Alternatively, once I am done evaluating this condenser on more recent benchmarks, I can comment on this PR if that helps.

adityasoni9998 added 27 commits September 28, 2024 19:37

added gitignore

965cee7

Merge remote-tracking branch 'upstream/main'

a3c8bcc

Merge remote-tracking branch 'upstream/main'

c2505c0

Merge remote-tracking branch 'upstream/main'

6aba462

Merge remote-tracking branch 'upstream/main'

1fef52a

Merge remote-tracking branch 'upstream/main'

bad7ccf

Merge remote-tracking branch 'upstream/main'

5258fe8

Merge remote-tracking branch 'upstream/main'

dcdb448

Merge remote-tracking branch 'upstream/main'

072d956

Merge remote-tracking branch 'upstream/main'

ce0979f

Visual browsing using Set-of-marks annotated screenshot in CodeActAgent

f5eed29

Allow screenshot-based browsing in openhands with set-of-marks annota…

4315c22

…ted screenshot.

Merge remote-tracking branch 'upstream/main'

dfee306

Merge branch 'main' into codeact_browsing

f45e7ec

Added LLM-check for visual browsing tool usage. (not support for GPT-…

9b742c5

…4o models)

Merge remote-tracking branch 'upstream/main'

6b94c97

Merge remote-tracking branch 'upstream/main' into codeact_browsing

70772cc

Undo changes in package-lock.json.

a7d38cd

Merge remote-tracking branch 'upstream/main'

26c4f72

Merge branch 'main' into codeact_browsing

818533f

Merge remote-tracking branch 'upstream/main'

a699a0d

Merge branch 'main' into codeact_browsing

d34c412

Merge remote-tracking branch 'upstream/main'

e66a113

Merge branch 'main' into codeact_browsing

ee1173e

Rename visual browsing flag in agent config.

83fa5b0

Browser output condenser to condense observation outputs from browser…

65bf992

… use when providing context to the LLM.

Merge branch 'main' into browser_condenser

3b05b68

adityasoni9998 marked this pull request as ready for review February 2, 2025 02:51

Merge branch 'main' into browser_condenser

d35a225

xingyaoww requested a review from csmith49 February 2, 2025 05:24

PeterDaveHello and others added 18 commits February 6, 2025 20:43

Update and Improve zh-TW Traditional Chinese locale (All-Hands-AI#6621)

89db3c1

chore(deps): bump the version-all group across 1 directory with 15 up…

86fbf5f

…dates (All-Hands-AI#6617) Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: amanape <[email protected]>

Only show start project button in conversations (All-Hands-AI#6626)

3d5d96b

Co-authored-by: sp.wack <[email protected]>

chore(frontend): Migrate from NextUI to HeroUI via codemod (All-Hands…

b82eed0

…-AI#6635)

Better error logging in posthog (All-Hands-AI#6346)

228bbc4

Co-authored-by: openhands <[email protected]> Co-authored-by: Ray Myers <[email protected]>

Add o1 to verfied models (All-Hands-AI#6642)

5e7b097

Remove free disk space steps from workflows to test if they are neces…

904b2b3

…sary (All-Hands-AI#6618) Co-authored-by: openhands <[email protected]>

Fix memory leak in JSON encoder (All-Hands-AI#6620)

6e7aa15

Co-authored-by: openhands <[email protected]> Co-authored-by: Xingyao Wang <[email protected]>

Update and Improve zh-TW Traditional Chinese locale (All-Hands-AI#6621)

0b361f3

chore(deps): bump the version-all group across 1 directory with 15 up…

7e08383

…dates (All-Hands-AI#6617) Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: amanape <[email protected]>

Only show start project button in conversations (All-Hands-AI#6626)

f1911be

Co-authored-by: sp.wack <[email protected]>

chore(frontend): Migrate from NextUI to HeroUI via codemod (All-Hands…

7c1c19c

…-AI#6635)

Better error logging in posthog (All-Hands-AI#6346)

945bdd7

Co-authored-by: openhands <[email protected]> Co-authored-by: Ray Myers <[email protected]>

Add o1 to verfied models (All-Hands-AI#6642)

03f4745

Merge remote-tracking branch 'upstream/main'

3fa1fb7

Merge branch 'main' into browser_condenser

33783dd

Merge remote-tracking branch 'upstream/main'

e74c794

Merge branch 'main' into browser_condenser

9ad1ada

li-boxuan approved these changes Feb 8, 2025

View reviewed changes

adityasoni9998 added 2 commits February 20, 2025 19:09

Merge remote-tracking branch 'upstream/main' into browser_condenser

062014f

Added unit test for browser output condenser.

48507c8

adityasoni9998 closed this Feb 21, 2025

adityasoni9998 reopened this Feb 21, 2025

li-boxuan approved these changes Feb 21, 2025

View reviewed changes

enyst reviewed Feb 22, 2025

View reviewed changes

openhands/memory/condenser/impl/browser_output_condenser.py Outdated Show resolved Hide resolved

Update openhands/memory/condenser/impl/browser_output_condenser.py

5479f9e

Co-authored-by: Engel Nyst <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Condenser for Browser Output Observations #6578

Condenser for Browser Output Observations #6578

adityasoni9998 commented Feb 1, 2025

csmith49 commented Feb 12, 2025

adityasoni9998 commented Feb 21, 2025

enyst left a comment

csmith49 commented Feb 23, 2025

adityasoni9998 commented Feb 24, 2025

Condenser for Browser Output Observations #6578

Are you sure you want to change the base?

Condenser for Browser Output Observations #6578

Conversation

adityasoni9998 commented Feb 1, 2025

csmith49 commented Feb 12, 2025

adityasoni9998 commented Feb 21, 2025

enyst left a comment

Choose a reason for hiding this comment

csmith49 commented Feb 23, 2025

adityasoni9998 commented Feb 24, 2025