fix: empty code error #134

j4k0xb · 2024-09-29T00:36:36Z

Closes #111

0xdevalias · 2024-09-29T01:26:46Z

Surfacing some of my notes/comments from debugging for additional context/musings on the solution:

So the above seems to be the debugging as to the 'why' of what is going on; though I'm not familiar enough with the codebase to suggest where the best point of implementing the fix is.

If I was to suggest the most minimal/localised fix, it would be to the if (!stringified?.code) check within visitAllIdentifiers (Ref)

Though an argument could also be made that filtering out empty/similar files at the unminify level could have it's merits (Ref); though that would then make it less generic/flexible, as theoretically a plugin in the chain could make it a non-empty file before the openaiRename actually needs to process it.

So I still think the best fix is probably within visitAllIdentifiers.

In fact, in looking through related issues, this just seems to be a variation of the same root cause that I raised in this issue:

Better error handling when humanify local ran on empty file (Error: Failed to stringify code) #54

I also think the general CLI output should be improved to explain what is happening better. eg. a message before running webcrack, outputting the actual filenames (not just the file number), etc.

These issues also seem to tangentially relate to some aspect of this latter suggestion:

Where is the output? #92

Improve model download speed, progress display, etc #58

Originally posted by @0xdevalias in #111 (comment)

But still not a fan of skipping files...

@j4k0xb I guess that really depends where 'skipping' is implemented. The hacky workaround in unminify (Ref) was never suggested as a proper fix to this, it was only a hacky workaround, as I said at the bottom of it:

While this wouldn't really be suited for a proper fix, it should hopefully be enough to do what you need to work around this bug in the meantime.

But for example, because unminify reduces over the passed in plugins, and feeds the code into them; the openaiRename plugin:

humanify/src/plugins/openai/openai-rename.ts

Lines 6 to 20 in 64a1b95

export function openaiRename({

apiKey,

model

}: {

apiKey: string;

model: string;

}) {

const client = new OpenAI({ apiKey });

return async (code: string): Promise<string> => {

return await visitAllIdentifiers(

code,

async (name, surroundingCode) => {

verbose.log(`Renaming ${name}`);

verbose.log("Context: ", surroundingCode);

Or even visitAllIdentifiers itself:

humanify/src/plugins/local-llm-rename/visit-all-identifiers.ts

Lines 15 to 25 in 64a1b95

export async function visitAllIdentifiers(

code: string,

visitor: Visitor,

onProgress?: (percentageDone: number) => void

) {

const ast = await parseAsync(code);

const visited = new Set<string>();

const renames = new Set<string>();

if (!ast) {

throw new Error("Failed to parse code");

}

Could have an 'early guard statement' when code is empty, that provides some feedback and then returns before even needing to attempt to parse the AST.

My suggestion is about identifying the appropriate place to check that the inputs are relevant, and react appropriately if not. There's no reason to parse the AST and have all the other code in that file run just to turn it back into an empty string if we can just handle that case specifically at the beginning. While it's potentially a negligible performance gain, at the very least it makes the handling explicit, and makes the code easier to reason about/less chances for bugs to occur because of it.

Like without thinking too deeply about it, I would probably implement something like this:
   export async function visitAllIdentifiers( 
     code: string, 
     visitor: Visitor, 
     onProgress?: (percentageDone: number) => void 
   ) {
+    if (code === "") return ""
+
     const ast = await parseAsync(code); 
     const visited = new Set<string>(); 
     const renames = new Set<string>(); 
     if (!ast) { 
       throw new Error("Failed to parse code"); 
     }
And because visitAllIdentifiers doesn't seem to currently have any patterns around logging, to keep with that, I would also make a change in openaiRename (and other similar plugins) like this:
  export function openaiRename({ 
     apiKey, 
     model 
   }: { 
     apiKey: string; 
     model: string; 
   }) { 
     const client = new OpenAI({ apiKey }); 
    
     return async (code: string): Promise<string> => { 
+    if (code === "") {
+      verbose.log("Skipping empty file");
+      return code
+    }
+
       return await visitAllIdentifiers( 
         code, 
         async (name, surroundingCode) => { 
           verbose.log(`Renaming ${name}`); 
           verbose.log("Context: ", surroundingCode);
Originally posted by @0xdevalias in #54 (comment)

jehna · 2024-10-06T21:06:31Z

@0xdevalias thanks for pointing this out, added a commit to verify that there's no extra processing on the top level

@j4k0xb thank you so much for your great pull requests! Creating a new release with them soon

No need to unminify empty files

0xdevalias · 2024-10-15T01:17:56Z

src/unminify.ts

+    if (code.trim().length === 0) {
+      verbose.log(`Skipping empty file ${file.path}`);
+      continue;
+    }


@0xdevalias thanks for pointing this out, added a commit to verify that there's no extra processing on the top level

Originally posted by @jehna in #134 (comment)

@jehna This was a suggestion I made only as a quick/dirty hacky workaround. By skipping files at the unminify level you technically weaken the flexibility of the plugins; as it means a plugin can no longer take potentially empty code as input, and have valid code as output. While that's technically not an issue with how things are currently implemented; the other solutions I mentioned would solve this better/more flexibly.

Better error handling when humanify local ran on empty file (Error: Failed to stringify code) #54 (comment)

This was referenced Sep 29, 2024

fix: invalid identifiers #133

Merged

Error: Failed to stringify code #111

Closed

0xdevalias mentioned this pull request Sep 29, 2024

Better error handling when humanify local ran on empty file (Error: Failed to stringify code) #54

Closed

jehna force-pushed the fix/empty-code branch from 4a4b05c to 9be0712 Compare October 6, 2024 21:04

j4k0xb and others added 2 commits October 7, 2024 00:12

fix: empty code error

3f329ec

Add empty file check from top level

6fac3f7

No need to unminify empty files

jehna force-pushed the fix/empty-code branch from 9be0712 to 6fac3f7 Compare October 6, 2024 21:12

jehna added the bug Something isn't working label Oct 6, 2024

jehna merged commit 96c093b into jehna:main Oct 6, 2024
5 checks passed

j4k0xb deleted the fix/empty-code branch October 6, 2024 21:32

0xdevalias reviewed Oct 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: empty code error #134

fix: empty code error #134

j4k0xb commented Sep 29, 2024

0xdevalias commented Sep 29, 2024 •

edited

Loading

jehna commented Oct 6, 2024

0xdevalias Oct 15, 2024

fix: empty code error #134

fix: empty code error #134

Conversation

j4k0xb commented Sep 29, 2024

0xdevalias commented Sep 29, 2024 • edited Loading

jehna commented Oct 6, 2024

0xdevalias Oct 15, 2024

Choose a reason for hiding this comment

0xdevalias commented Sep 29, 2024 •

edited

Loading