fix: invalid identifiers #133

j4k0xb · 2024-09-28T22:49:54Z

Closes #117
Also related to #111

0xdevalias · 2024-09-29T01:21:32Z

Closes #117

I would say this is a partial fix/workaround to #117, though I think there is probably a better higher level fix (that can be implemented in another PR) that I would sort of suggest should be done before we consider #117 closed entirely:

The proper fix would be to use toIdentifier from @babel/types

@j4k0xb @jehna From a 2sec search I couldn't find rendered docs, but here's the relevant source for each:

isValidIdentifier: https://github.com/babel/babel/blob/a2025d7d695b0f3b0506f66225d6b15bcfa1cf6a/packages/babel-types/src/validators/isValidIdentifier.ts#L7-L25

toIdentifier: https://github.com/babel/babel/blob/a2025d7d695b0f3b0506f66225d6b15bcfa1cf6a/packages/babel-types/src/converters/toIdentifier.ts#L4-L26

toIdentifier definitely seems like a more robust approach than the current 'prefix with _' approach for sure.

Though I wonder if a 'proper fix' should also involve tweaking how we prompt for/filter the suggestions coming back from the LLM itself as well. Like instead of just forcing an invalid suggestion to be valid (with toIdentifier), we could detect that it's invalid (with isValidIdentifier) and then provide that feedback to the LLM, asking it to give a new suggestion; probably with some max retry limit; after which we could fall back to using the invalid suggestion run through toIdentifier, or log a warning and leave it un-renamed or similar.

Originally posted by @0xdevalias in #117 (comment)

Also related to #111

For context/continuity, while it's related to the 2nd part of the bugs identified in #111, your other fix PR is related to the original error on it:

fix: empty code error #134

jehna · 2024-10-06T20:58:43Z

@0xdevalias good point! I opened another issue for that implementation at #147

j4k0xb · 2024-10-06T21:28:57Z

src/plugins/local-llm-rename/visit-all-identifiers.test.ts

+  assert.equal(result, "const thisKLength = 1;");
+});
+
+test("should handle space in identifier name (happens for some reason though it shouldn't)", async () => {


humanify/src/plugins/local-llm-rename/unminify-variable-name.ts

Line 25 in 73f0424

gbnf`A good name would be '${/[a-zA-Z] [a-zA-Z0-9]{2,12}/}'`

maybe because the regex has a space
and would ^ $ make a difference?

This regex is actually a representation of gbnf, which is a superset of regex that allows ~~strings~~ edit: spaces in between. If it were a normal regex, all variable names should have a space in them, right?

To me this seems like an issue with the llama.cpp gbnf handling

Another thing I'll need to check from the previous issues is that whether the users were using local or openai mode. Local mode is much more robust, openai does not have any actual guarantees about returning a valid identifier name.

@jehna From memory, for at least some of the ones I was looking at/debugging, I believe they were using openai, not local.

j4k0xb force-pushed the fix/invalid-identifiers branch from cb4ea98 to 4eca63a Compare September 28, 2024 23:15

0xdevalias mentioned this pull request Sep 29, 2024

Syntax Error on Babel? #117

Closed

0xdevalias mentioned this pull request Sep 29, 2024

Error: Failed to stringify code #111

Closed

jehna added the bug Something isn't working label Oct 6, 2024

j4k0xb and others added 2 commits October 6, 2024 23:45

fix: invalid identifiers

b7e5d63

Add more invalid identifier tests

a317fd3

jehna force-pushed the fix/invalid-identifiers branch from e31ab3a to a317fd3 Compare October 6, 2024 20:45

jehna merged commit 7f9e0b4 into jehna:main Oct 6, 2024
3 of 4 checks passed

j4k0xb commented Oct 6, 2024

View reviewed changes

j4k0xb deleted the fix/invalid-identifiers branch October 6, 2024 21:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: invalid identifiers #133

fix: invalid identifiers #133

j4k0xb commented Sep 28, 2024 •

edited

Loading

0xdevalias commented Sep 29, 2024 •

edited

Loading

jehna commented Oct 6, 2024

j4k0xb Oct 6, 2024 •

edited

Loading

jehna Oct 6, 2024 •

edited

Loading

jehna Oct 6, 2024

0xdevalias Oct 15, 2024

fix: invalid identifiers #133

fix: invalid identifiers #133

Conversation

j4k0xb commented Sep 28, 2024 • edited Loading

0xdevalias commented Sep 29, 2024 • edited Loading

jehna commented Oct 6, 2024

j4k0xb Oct 6, 2024 • edited Loading

Choose a reason for hiding this comment

jehna Oct 6, 2024 • edited Loading

Choose a reason for hiding this comment

jehna Oct 6, 2024

Choose a reason for hiding this comment

0xdevalias Oct 15, 2024

Choose a reason for hiding this comment

j4k0xb commented Sep 28, 2024 •

edited

Loading

0xdevalias commented Sep 29, 2024 •

edited

Loading

j4k0xb Oct 6, 2024 •

edited

Loading

jehna Oct 6, 2024 •

edited

Loading