-
Notifications
You must be signed in to change notification settings - Fork 63
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Using the autopilot open large git repository #195
Comments
Potential solutionTo solve the issue of token overflow when dealing with large code repositories, we need to implement a mechanism to split the repository into smaller modules and process each module separately. This approach ensures that we stay within the token limits imposed by the language model and can handle large repositories efficiently. What is causing this bug?The bug is caused by the current implementation's inability to handle large repositories efficiently. The existing code processes the entire repository in one go, leading to token overflow issues when interacting with language models. Specifically, the following issues contribute to the bug:
CodeTo address these issues, we need to make changes to several files:
|
Describe the issue:
I'm running into a problem right now when dealing with large code repositories. When I'm dealing with small code repositories, autopilot works great. But when I'm dealing with large repositories, I have a token overflow problem when interacting with LLM.
So how should we deal with large code repositories? Sending only part of the source code will affect the context.
Version details:
AutoPilot version tested: a6af6713400b4ba64dd17f0324b6ebf5c65dfad9
Node version: v18.16.1
Expected behaviors:
I expected the autopilot to read the entire large code repository. If you cannot read the entire code repository, you should split it into multiple modules and conduct multiple rounds of dialogue.
The text was updated successfully, but these errors were encountered: