fix(apply-patch): skip initial new-file hunk header emitted by some AI models #695

msmhrt · 2025-04-27T07:29:11Z

What & Why

Some AI-generated patches (e.g. from Gemini 2.0 Flash) prepend a full diff hunk header like:

@@ -0,0 +1,2 @@ SectionName

before any + lines. Our parser treats that as invalid, causing parse failures or stray blank lines when adding new files.
This PR makes both the TypeScript and Rust parsers ignore only the first such "new-file" hunk header, then proceed normally.

How

TypeScript

Introduce NEW_FILE_HUNK regex + isFirst flag in parse_add_file()
Skip the very first header matching that pattern

Rust

Import regex::Regex
Apply the same new_file_re + "first" logic in parse_one_hunk()

Tests

TS
- process_patch – add file skips full hunk headers
- Regex self-test
Rust
- test_add_file_skip_full_hunk_header_with_section
- test_new_file_hunk_regex

github-actions · 2025-04-27T07:29:22Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

msmhrt · 2025-04-27T07:36:36Z

I have read the CLA Document and I hereby sign the CLA

Gemini 2.0 Flash emits hunk headers in Add File directives (e.g. "@@ -0,0 +1 @@" or "@@ -0,0 +1,5 @@"). Use "/^@@\s+-0,0\s+\+1(?:,[1-9]\d*)?\s+@@/" to detect and skip these headers. Signed-off-by: Masami HIRATA <[email protected]>

…I models and add unit tests - TypeScript parser: introduce NEW_FILE_HUNK regex + isFirst flag - Rust parser: same regex+first logic in parse_one_hunk - Tests (TS & Rust) covering skip-header behavior and regex self-test Signed-off-by: Masami HIRATA <[email protected]>

…codex into fix/apply-patch-skip-gemini-hunk

bolinfest · 2025-04-29T17:22:33Z

I know we are on the hook for putting up an actual proposal, but we are planning to introduce a concept of "plugins." Accepting a patch format that OpenAI models consider to be invalid output (from its own internal specification) seems like something that should live in a plugin rather in the core of Codex itself.

msmhrt · 2025-04-29T22:52:26Z

Hi @bolinfest,

Thank you for your comment and for sharing the insight about the planned "plugins" concept. That sounds like an interesting direction for enhancing extensibility in the long run.

Regarding the immediate issue with the apply_patch command, I wanted to provide a bit more context on the problems it's causing. Specifically, there are two main concerns:

Strict Input Requirements: The command currently lacks tolerance for patch formats that slightly deviate from the expected structure, even if they are conceptually valid.
Cryptic Error Messages: The error messages provided upon failure are often unclear and don't offer enough information to pinpoint the exact issue, making debugging very difficult, especially for an AI agent.

These issues combined create a significant usability problem. AI agents interacting with apply_patch often get stuck in inefficient loops: they generate a patch, it fails (sometimes for unclear reasons), the error message isn't helpful for correction, leading to repeated attempts, wasted token consumption, and ultimately, task failure. This severely impacts the ability to reliably use Codex CLI for automating patch applications right now.

I understand from your comment that you envision functionality like accepting varied patch formats potentially living in a plugin, rather than the core. That makes sense from an architectural perspective to keep the core clean.

However, given the significant usability hurdle the current behavior presents, and acknowledging that developing the plugin system might take time, I'd like to ask if exploring any interim improvements to the core apply_patch command might be possible. Would you be open to considering merging the proposed tolerance improvements PR #695, perhaps as a temporary measure or even optionally, just to alleviate the immediate pain point and make the tool more usable in the meantime? The current strictness and poor error feedback are major blockers.

Looking ahead to the plugin system, I think it's a great opportunity. To ensure plugins (and potentially core components too) work effectively in collaboration with AI agents, I'd strongly suggest considering these two principles as part of the design guidelines or best practices:

Robustness/Forgiveness towards AI-generated Input: Components handling AI-generated content should anticipate and tolerate slight imperfections or variations in format, which is inherent in AI generation.
Clear and Actionable Error Messages: Errors should be reported in a way that helps the user (AI or human) understand what went wrong and how to potentially fix it. This is crucial for enabling an effective learning and iteration process.

Incorporating these principles would greatly enhance the value and usability of Codex CLI as an AI-collaborative tool.

Thank you again for engaging with this feedback and considering the PR. I'm happy to discuss further.

Best regards,

github-actions bot added a commit that referenced this pull request Apr 27, 2025

@msmhrt has signed the CLA in #695

a2a75e8

msmhrt mentioned this pull request Apr 27, 2025

Models not using apply patch or other cli tools correctly then crashing (Gemini) #693

Open

msmhrt force-pushed the fix/apply-patch-skip-gemini-hunk branch from 77bdda0 to 389d877 Compare April 28, 2025 01:13

msmhrt added 3 commits April 28, 2025 15:01

Merge branch 'main' into fix/apply-patch-skip-gemini-hunk

6e6ad63

Merge branch 'fix/apply-patch-skip-gemini-hunk' of github.com:msmhrt/…

2e2aa7e

…codex into fix/apply-patch-skip-gemini-hunk

msmhrt changed the title ~~apply-patch: skip gemini-2.0-flash hunk headers in Add File blocks~~ fix(apply-patch): skip initial new-file hunk header emitted by some AI models Apr 28, 2025

msmhrt added 3 commits April 29, 2025 09:09

Merge branch 'main' into fix/apply-patch-skip-gemini-hunk

55705a8

Merge branch 'main' into fix/apply-patch-skip-gemini-hunk

bd25616

Merge branch 'main' into fix/apply-patch-skip-gemini-hunk

635179e

msmhrt added 6 commits April 30, 2025 07:58

Merge branch 'main' into fix/apply-patch-skip-gemini-hunk

a0ab8ec

Merge branch 'main' into fix/apply-patch-skip-gemini-hunk

3cfacb1

Merge branch 'main' into fix/apply-patch-skip-gemini-hunk

82751d6

Merge branch 'main' into fix/apply-patch-skip-gemini-hunk

944ae79

Merge branch 'main' into fix/apply-patch-skip-gemini-hunk

5dd7af7

style: apply Prettier formatting

792e516

This was referenced May 4, 2025

Tool call 'apply_patch' not working on gemini-2.5-pro-preview-03-25 and gemini-2.5-flash-preview-04-17 ymichael/open-codex#24

Open

Gemini Models having problems with executing commands ymichael/open-codex#31

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(apply-patch): skip initial new-file hunk header emitted by some AI models #695

fix(apply-patch): skip initial new-file hunk header emitted by some AI models #695

msmhrt commented Apr 27, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 27, 2025 •

edited

Loading

Uh oh!

msmhrt commented Apr 27, 2025

Uh oh!

bolinfest commented Apr 29, 2025

Uh oh!

msmhrt commented Apr 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

fix(apply-patch): skip initial new-file hunk header emitted by some AI models #695

Are you sure you want to change the base?

fix(apply-patch): skip initial new-file hunk header emitted by some AI models #695

Conversation

msmhrt commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What & Why

How

TypeScript

Rust

Tests

Uh oh!

github-actions bot commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

msmhrt commented Apr 27, 2025

Uh oh!

bolinfest commented Apr 29, 2025

Uh oh!

msmhrt commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

msmhrt commented Apr 27, 2025 •

edited

Loading

github-actions bot commented Apr 27, 2025 •

edited

Loading

msmhrt commented Apr 29, 2025 •

edited

Loading