[Feature]: Aider-inspired RepoMap #2185

neubig · 2024-06-01T14:43:47Z

What problem or use case are you trying to solve?

Aider has a functionality to create a RepoMap, which is a concise description of the repo in text format, with the most relevant/important parts highlighted.

Describe the UX of the solution you'd like

It would be nice to have a RepoMap class within OpenDevin that can be used by any agent to pull in a description of the repo.

Do you have thoughts on the technical implementation?

copy the aider repomap.py into OpenDevin. This could probably be put in either a new indexing folder here, or in the memory folder.
make it possible to import the repomap into one of our competitive agents, such as CodeAct. This functionality should be optional, like the github message (details).
run experiments on SWE-Bench to see if this functionality improves accuracy

Describe alternatives you've considered

We could also implement this from scratch, or create improved code search functionality.

Additional context

Parent issue Create Aider Agent #120

The text was updated successfully, but these errors were encountered:

neubig · 2024-06-01T14:48:16Z

@ryanhoangt will probably take a look at this

tobitege · 2024-06-22T12:51:31Z

Reference: #2248 Add Aider-inspired RepoMap

0xdevalias · 2024-07-01T01:54:46Z

Describe alternatives you've considered

We could also implement this from scratch, or create improved code search functionality.

See also:

Explore using stack graphs for better code search / navigation / context / repo map / etc #742

github-actions · 2024-08-01T01:55:30Z

This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.

0xdevalias · 2024-08-01T03:04:59Z

IMO this shouldn't be closed as stale

FellowTraveler · 2024-08-09T17:05:38Z

repomap and graphrag are critical features IMO for coding agents.

github-actions · 2024-10-05T01:58:09Z

This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.

TomLucidor · 2024-11-20T05:39:48Z

Is this the reason why when the GitHub repo is too large, OpenHands will hang? That there are no systematic way of understanding/ignoring files and folders (including extensions)?

neubig · 2024-11-20T12:42:25Z

Hey @TomLucidor, sorry about the trouble!

If you have a large repo OpenHands should not hang, although it might become confused. The hanging behavior is likely due to something else. We'd be happy to help diagnose this, but in order to do so we'll need to be able to reproduce the problem. The easiest way for us to do this is if you can press the "feedback" button when you're encountering this behavior, or share logs. If you can do that it'd be great if you can open a new issue. I'd also be happy to discuss on slack.

xingyaoww · 2024-11-22T19:18:43Z

Hey @ryanhoangt can you self-assign this one?

ryanhoangt · 2024-11-23T04:52:17Z

Sure!

BradKML · 2025-01-05T03:41:57Z

As a little heads up, has anyone checked these repos out? (something along the lines of "codebase prompt")

enyst · 2025-01-05T03:54:59Z

I found this kind of tool helpful, but not for feeding it as such to openhands, more for local work in or alongside vscode/cursor. It works for large codebases, to dump into Gemini, and go.

It's not great (Gemini is not great, at least pre-2.0), but I get something out of it. For now, I doubt it has the precision to be useful for the agent, considering the tradeoffs here. Have you tried to give it like that, in a message?

BradKML · 2025-01-24T04:20:48Z

@enyst what do you have in mind compared to the established tools made specifically for chatbots? Cus I am tempted but not even sure whether to put/generate it in a text file for the repo, OR to throw it directly in the dialogue box. Asking second opinions

ryanpeach · 2025-01-24T19:17:43Z

Just wanted to point out, you can import aider.repomap and get this feature for free.

ryanpeach · 2025-01-31T19:17:10Z

@rbren where is the PR that completed this?

raymyers · 2025-01-31T19:26:20Z

Just noting here that @xingyaoww discussed the reason for closing this issue in today's community meetup (Jan 31, 2025), so until more info is added here, it should be in the recording about 15 min in.

xingyaoww · 2025-01-31T19:50:53Z

I believe @ryanhoangt can explain more about this, but in summary, we've been trying something like this: All-Hands-AI/openhands-aci#41 for integrating Aider-inspired Repo map -- and it seems we didn't get it working on claude after a few attempts (e.g., performance on SWE-Bench didn't improve when adding those file skeleton) -- hoang feel free to correct me if I'm wrong, and it will be nice if you can share some numbers too!

I found this kind of tool helpful, but not for feeding it as such to openhands, more for local work in or alongside vscode/cursor. It works for large codebases, to dump into Gemini, and go.

As Engel said, there's a significant trade-off here:

For single-turn systems where the LLM typically doesn't have the autonomy to look around and execute actions (e.g., I need to paste context to ChatGPT/Claude for it to generate another block of code), such type of RepoMap is essential -- an example of this would be Aider that primarily focuses on "file-editing" action, as opposed to executing general bash command like "grep -n".
However, for a flexible multi-turn agentic system like OpenHands (CodeAct) -- sometimes adding more of this info actually did more harm than good. Basically, if you don't tell the agent anything, it would think, "I need to poke around and figure out right context,"; but if you tell it something like RepoMap (but it could be incomplete!) -- the agent would trust your RepoMap and think that's everything they need to know to complete the task. Here's another bitter lesson that happened recently (fix: revert #5506 for SWE-Bench performance regression #6491): we think providing less context of the file tree structure to the agent would help, but it actually hurt things a lot - so definitely, "what humans think is useful might not hold true for autonomous multi-turn agents" - and that's why we depends on evals for different design decisions!

I would think things like RepoMap were initially created to build context for a "single-turn LLM-based system."
However, they may not be useful for multi-turn systems like OpenHands, because the agent has the autonomy to look for stuff they need -- and a lot of times, it did better than "our human-curated repo map based on our intuition of what the model needs" -- another vivid reminder of the bitter lesson 😢 .

Here are some baseline code localization performance numbers on a paper I've been working on with some folks (haven't released yet! i can share the preprint here when we get it online):

You can see even if we don't have any system like "RepoMap" to manually construct context within a large repo - whereas other systems like "agentless" and "moatless" all have RepoMap-style systems - they didn't really outperform OpenHands on everything (if not worse!).

Based on these existing results, I've personally feel it is probably less productive to keep pushing towards this direction compared to other things after spending a lot of effort here -- but I could be totally wrong, and would be open to re-open this PR if people from the community is interested in taking over and actually making it work ❤

xingyaoww · 2025-01-31T19:56:26Z

cc @jimwhite -- thanks for the great discussion in today's community meeting! 👆 hopefully this response above would better answer your question :)

aymannadeem · 2025-02-04T22:42:33Z

Hey @xingyaoww! You mentioned that agents often do better when left to explore versus being given a repo map. Could you share some specific examples where the agent's autonomous exploration led to better solutions than when provided with structured context?

BradKML · 2025-02-05T00:56:26Z

I am thinking something like eza can make dynamic folder maps https://github.com/eza-community/eza
Right now when I am testing OH often times the agent would forget the pwd or context of where the terminal is, after cding in or out, which is annoying and warrant some kind of tool that shows both ls and hints at pwd.

enyst · 2025-02-05T01:50:55Z

often times the agent would forget the pwd or context of where the terminal is, after cding in or out, which is annoying and warrant some kind of tool that shows both ls and hints at pwd.

The LLM is already supposed to get in the prompt, after each command, the current directory:

OpenHands/openhands/events/observation/commands.py

Line 159 in fe8b927

ret += f'\n[Current working directory: {self.metadata.working_dir}]'

If that is not happening, maybe you could post a new issue, with some log with it missing or wrong? But I think it's also possible that we send it and sometimes the LLM ignores it anyway (depending on LLM).

BradKML · 2025-02-05T01:55:23Z

It might be an LLM issue then @enyst since I am heavily leaning on DeepSeek v3 atm, but it can't invoke the observation whenever it moves around working directories it seems?

enyst · 2025-02-05T02:30:57Z

It's automatic. The LLM asks for a command to execute ('cd ...'), the environment performs it and responds with an observation which includes cwd as above.

ryanpeach · 2025-02-05T15:19:06Z

Very interesting writeup @xingyaoww

neubig added the enhancement New feature or request label Jun 1, 2024

0xdevalias mentioned this issue Jul 1, 2024

Explore using stack graphs for better code search / navigation / context / repo map / etc #742

Closed

neubig mentioned this issue Jul 3, 2024

Create Aider Agent #120

Closed

github-actions bot added the Stale Inactive for 30 days label Aug 1, 2024

enyst removed the Stale Inactive for 30 days label Aug 1, 2024

neubig added this to the 2024-09 milestone Sep 4, 2024

neubig added this to OpenHands Roadmap Sep 4, 2024

neubig moved this to In Progress in OpenHands Roadmap Sep 4, 2024

github-actions bot added the Stale Inactive for 30 days label Oct 5, 2024

neubig removed the Stale Inactive for 30 days label Oct 5, 2024

neubig removed this from the 2024-09 milestone Oct 8, 2024

neubig added this to the 2024-11 milestone Oct 25, 2024

ryanhoangt mentioned this issue Oct 26, 2024

[Experimental] Integrate repomap #4578

Closed

3 tasks

rbren modified the milestones: 2024-11, 2024-12 Nov 22, 2024

neubig assigned ryanhoangt Nov 23, 2024

ryanhoangt mentioned this issue Dec 5, 2024

Add navigation suggestion and file skeleton for view command All-Hands-AI/openhands-aci#19

Closed

2 tasks

ryanhoangt mentioned this issue Jan 3, 2025

Add navigation suggestion and file skeleton for view command All-Hands-AI/openhands-aci#41

Draft

2 tasks

neubig modified the milestones: 2024-12, 2025-01 Jan 13, 2025

BradKML mentioned this issue Jan 24, 2025

Q: Would GitIngest be good for coder bots? cyclotruc/gitingest#154

Open

BradKML mentioned this issue Jan 25, 2025

Comparison between Code2Prompt and others mufeedvh/code2prompt#65

Closed

rbren moved this from In Progress to Done in OpenHands Roadmap Jan 31, 2025

rbren closed this as completed by moving to Done in OpenHands Roadmap Jan 31, 2025

BradKML mentioned this issue Feb 5, 2025

[Bug]: Agent not recognizing working directory #6612

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Aider-inspired RepoMap #2185

[Feature]: Aider-inspired RepoMap #2185

neubig commented Jun 1, 2024

neubig commented Jun 1, 2024

tobitege commented Jun 22, 2024

0xdevalias commented Jul 1, 2024 •

edited

Loading

github-actions bot commented Aug 1, 2024

0xdevalias commented Aug 1, 2024

FellowTraveler commented Aug 9, 2024

github-actions bot commented Oct 5, 2024

TomLucidor commented Nov 20, 2024

neubig commented Nov 20, 2024

xingyaoww commented Nov 22, 2024

ryanhoangt commented Nov 23, 2024

BradKML commented Jan 5, 2025

enyst commented Jan 5, 2025

BradKML commented Jan 24, 2025

ryanpeach commented Jan 24, 2025

ryanpeach commented Jan 31, 2025

raymyers commented Jan 31, 2025

xingyaoww commented Jan 31, 2025 •

edited

Loading

xingyaoww commented Jan 31, 2025

aymannadeem commented Feb 4, 2025

BradKML commented Feb 5, 2025

enyst commented Feb 5, 2025

BradKML commented Feb 5, 2025

enyst commented Feb 5, 2025

ryanpeach commented Feb 5, 2025

[Feature]: Aider-inspired RepoMap #2185

[Feature]: Aider-inspired RepoMap #2185

Comments

neubig commented Jun 1, 2024

neubig commented Jun 1, 2024

tobitege commented Jun 22, 2024

0xdevalias commented Jul 1, 2024 • edited Loading

github-actions bot commented Aug 1, 2024

0xdevalias commented Aug 1, 2024

FellowTraveler commented Aug 9, 2024

github-actions bot commented Oct 5, 2024

TomLucidor commented Nov 20, 2024

neubig commented Nov 20, 2024

xingyaoww commented Nov 22, 2024

ryanhoangt commented Nov 23, 2024

BradKML commented Jan 5, 2025

enyst commented Jan 5, 2025

BradKML commented Jan 24, 2025

ryanpeach commented Jan 24, 2025

ryanpeach commented Jan 31, 2025

raymyers commented Jan 31, 2025

xingyaoww commented Jan 31, 2025 • edited Loading

xingyaoww commented Jan 31, 2025

aymannadeem commented Feb 4, 2025

BradKML commented Feb 5, 2025

enyst commented Feb 5, 2025

BradKML commented Feb 5, 2025

enyst commented Feb 5, 2025

ryanpeach commented Feb 5, 2025

0xdevalias commented Jul 1, 2024 •

edited

Loading

xingyaoww commented Jan 31, 2025 •

edited

Loading