fix: allow to continue when the agent is stuck in interactive mode #5597

enyst · 2024-12-14T16:58:13Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below
Let user continue when the agent gets stuck in a loop (UI-only).

Changes:

Clean up:
- Remove unused almost_stuck field and related code
- Simplify stuck detection logic
Improve stuck detection:
- Use headless_mode to determine behavior
- In interactive mode: only consider history after last user message
- In headless mode: keep existing behavior (full history)
Optimizations:
- Use reversed() to find last user message
- Elegant filtering that works in both modes:
  - In headless: actively filters user messages
  - In interactive: no-op (already sliced after last user message)
Tests:
- Add tests for both modes
- Verify behavior before/after user messages
- Maintain backward compatibility

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:63cb544-nikolaik   --name openhands-app-63cb544   docker.all-hands.dev/all-hands-ai/openhands:63cb544

- Remove almost_stuck field from State class - Remove almost_stuck counter from StuckDetector - Simplify stuck detection logic to focus on actual loop detection - Update tests to remove almost_stuck assertions

- Add UI mode awareness to stuck detection - Only consider history after last user message in UI mode - Keep existing behavior in headless mode - Add comprehensive tests for both modes Fix: #5480

- Use headless_mode flag to determine stuck detection behavior - In interactive mode (not headless), only consider history after last user message - Keep existing behavior in headless mode - Add comprehensive tests for both modes Fix: #5480

openhands/controller/stuck.py

- Use not_headless parameter to match AgentController's headless_mode - Remove unnecessary interactive_mode concept - Update tests to use consistent terminology - Keep behavior the same, just clearer naming

openhands/controller/stuck.py

- Use headless_mode parameter to match AgentController - Remove confusing double negative (not_headless) - Keep behavior the same, just clearer naming

openhands/controller/agent_controller.py

tests/unit/test_is_stuck.py

openhands/controller/stuck.py

- Use reversed() to find last user message - Stop searching once found (break) - Same behavior, just more efficient

The same filter works perfectly in both modes: - In headless: actively filters user messages - In non-headless: no-op (already sliced after last user message)

tests/unit/test_is_stuck.py

xingyaoww

LGTM! Thanks for the fix!

github-actions · 2024-12-14T19:40:28Z

Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly.

github-actions · 2024-12-14T19:46:35Z

Trigger by: Pull Request (integration-test label on PR #5597)
Commit: 1e3eaa8
Integration Tests Report (Haiku)
Haiku LLM Test Results:
Success rate: 100.00% (6/6)

Total cost: USD 0.00

instance_id	success	reason	cost
t04_git_staging	True		0
t05_simple_browsing	True		0
t01_fix_simple_typo	True		0
t03_jupyter_write_file	True		0
t06_github_pr_browsing	True		0
t02_add_bash_hello	True		0

Integration Tests Report (DeepSeek)
DeepSeek LLM Test Results:
Success rate: 83.33% (5/6)

Total cost: USD 0.00

instance_id	success	reason
t04_git_staging	True
t05_simple_browsing	False	The answer is not found in any message. Total messages: 2.
t02_add_bash_hello	True
t01_fix_simple_typo	True
t03_jupyter_write_file	True
t06_github_pr_browsing	True

Download testing outputs (includes both Haiku and DeepSeek results): Download

openhands-agent added 3 commits December 14, 2024 16:01

refactor: remove unused almost_stuck functionality

212787c

- Remove almost_stuck field from State class - Remove almost_stuck counter from StuckDetector - Simplify stuck detection logic to focus on actual loop detection - Update tests to remove almost_stuck assertions

fix: improve stuck detection in UI mode

1e739bd

- Add UI mode awareness to stuck detection - Only consider history after last user message in UI mode - Keep existing behavior in headless mode - Add comprehensive tests for both modes Fix: #5480

enyst commented Dec 14, 2024

View reviewed changes

openhands/controller/stuck.py Show resolved Hide resolved

refactor: simplify stuck detection to use headless_mode directly

8589b0a

- Use not_headless parameter to match AgentController's headless_mode - Remove unnecessary interactive_mode concept - Update tests to use consistent terminology - Keep behavior the same, just clearer naming

enyst force-pushed the fix-stuck-loop-recovery-simple branch from 56167cc to 8589b0a Compare December 14, 2024 17:11

enyst commented Dec 14, 2024

View reviewed changes

openhands/controller/stuck.py Show resolved Hide resolved

enyst and others added 2 commits December 14, 2024 18:15

Update openhands/controller/stuck.py

9f17d10

refactor: use headless_mode consistently

0f6c731

- Use headless_mode parameter to match AgentController - Remove confusing double negative (not_headless) - Keep behavior the same, just clearer naming

enyst commented Dec 14, 2024

View reviewed changes

openhands/controller/agent_controller.py Outdated Show resolved Hide resolved

Update openhands/controller/agent_controller.py

860d3e2

enyst commented Dec 14, 2024

View reviewed changes

tests/unit/test_is_stuck.py Outdated Show resolved Hide resolved

Update tests/unit/test_is_stuck.py

b0616e3

enyst commented Dec 14, 2024

View reviewed changes

tests/unit/test_is_stuck.py Outdated Show resolved Hide resolved

Update tests/unit/test_is_stuck.py

ab4d504

enyst commented Dec 14, 2024

View reviewed changes

tests/unit/test_is_stuck.py Outdated Show resolved Hide resolved

Update tests/unit/test_is_stuck.py

1f952c8

enyst commented Dec 14, 2024

View reviewed changes

openhands/controller/stuck.py Show resolved Hide resolved

enyst and others added 3 commits December 14, 2024 18:35

Update openhands/controller/stuck.py

16c82b3

perf: optimize last user message search

e8636e1

- Use reversed() to find last user message - Stop searching once found (break) - Same behavior, just more efficient

docs: explain elegant user message filtering

c180037

The same filter works perfectly in both modes: - In headless: actively filters user messages - In non-headless: no-op (already sliced after last user message)

enyst added the lint-fix Attempts to fix lint issues on the PR label Dec 14, 2024

🤖 Auto-fix Python linting issues

282e59e

enyst commented Dec 14, 2024

View reviewed changes

tests/unit/test_is_stuck.py Show resolved Hide resolved

Update tests/unit/test_is_stuck.py

3291318

enyst commented Dec 14, 2024

View reviewed changes

tests/unit/test_is_stuck.py Show resolved Hide resolved

Update tests/unit/test_is_stuck.py

63cb544

enyst changed the title ~~fix: improve stuck detection in interactive mode~~ fix: allow to continue when the agent is stuck in interactive mode Dec 14, 2024

enyst requested review from xingyaoww and rbren December 14, 2024 18:03

enyst requested a review from neubig December 14, 2024 18:03

xingyaoww approved these changes Dec 14, 2024

View reviewed changes

enyst added the integration-test Runs integration tests on the PR label Dec 14, 2024

enyst merged commit f0257c7 into main Dec 14, 2024
19 checks passed

enyst deleted the fix-stuck-loop-recovery-simple branch December 14, 2024 19:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: allow to continue when the agent is stuck in interactive mode #5597

fix: allow to continue when the agent is stuck in interactive mode #5597

enyst commented Dec 14, 2024 •

edited

Loading

xingyaoww left a comment

github-actions bot commented Dec 14, 2024

github-actions bot commented Dec 14, 2024

fix: allow to continue when the agent is stuck in interactive mode #5597

fix: allow to continue when the agent is stuck in interactive mode #5597

Conversation

enyst commented Dec 14, 2024 • edited Loading

xingyaoww left a comment

Choose a reason for hiding this comment

github-actions bot commented Dec 14, 2024

github-actions bot commented Dec 14, 2024

enyst commented Dec 14, 2024 •

edited

Loading