Add failing tests for #1173: off-by-one discrepancy in the location of reported parser errors #1175

hal7df · 2023-12-22T14:13:47Z

This adds (currently failing) test cases for the issue identified in #1173. I've tried my best at adding a plethora of different variations of invalid JSON, but it's possible I might still be missing some edge cases. I also have written this to test every variation of invalid JSON against the four parser backends identified by @cowtowncoder in the issue comments.

hal7df · 2023-12-22T14:15:29Z

Curiously, DataInput-based parsers pass all of these tests, because they don't provide byte-/character-level position information. It's possible that they might still be susceptible to the off-by-one error if the error occurs across a line boundary, but I haven't been able to get that behavior to occur yet.

hal7df · 2023-12-22T14:18:43Z

src/test/java/com/fasterxml/jackson/failing/read/LocationOfError1173Test.java

+            26,
+            24,
+            1,
+            25


Parser backends aren't consistent about the column in this case, because the exact definition of "column" seems to depend on whether the parser is operating on a byte array or a character array. I'm open to suggestions about what to do with this case.

Column should be exactly same for byte/char -backed input, for ASCII characters (for multi-byte characters there is indeed difference). But I think few tests use characters outside ASCII range..

cowtowncoder · 2023-12-23T22:22:47Z

src/test/java/com/fasterxml/jackson/failing/read/LocationOfError1173Test.java

+
+                byte[] inputBytes = input.getBytes(StandardCharsets.UTF_8);
+                feeder.feedInput(inputBytes, 0, inputBytes.length);
+                feeder.endOfInput();


I don't think this works the way you'd expect... I think endOfInput() should not be called until all input is consumed.

Or maybe I misremember this part, as test appears to work fine :)

cowtowncoder

Ok parameterization etc make things bit hard to follow but I assume things are fine.
I do have suspicion async-parser case won't work as expected but I'll merge and see how things fare.

Add failing tests for FasterXML#1173: parse error location

6f059b2

hal7df commented Dec 22, 2023

View reviewed changes

hal7df mentioned this pull request Dec 22, 2023

JsonLocation consistently off by one character for many invalid JSON parsing cases #1173

Closed

cowtowncoder reviewed Dec 23, 2023

View reviewed changes

cowtowncoder approved these changes Dec 23, 2023

View reviewed changes

cowtowncoder merged commit 9fb3f21 into FasterXML:2.16 Dec 23, 2023
5 checks passed

cowtowncoder added a commit that referenced this pull request Dec 23, 2023

Minor tweaking post-merge wrt #1175

aa5c887

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add failing tests for #1173: off-by-one discrepancy in the location of reported parser errors #1175

Add failing tests for #1173: off-by-one discrepancy in the location of reported parser errors #1175

hal7df commented Dec 22, 2023

hal7df commented Dec 22, 2023

hal7df Dec 22, 2023

cowtowncoder Dec 23, 2023

cowtowncoder Dec 23, 2023

cowtowncoder Dec 24, 2023

cowtowncoder left a comment

+,
+,
+,
+

Add failing tests for #1173: off-by-one discrepancy in the location of reported parser errors #1175

Add failing tests for #1173: off-by-one discrepancy in the location of reported parser errors #1175

Conversation

hal7df commented Dec 22, 2023

hal7df commented Dec 22, 2023

hal7df Dec 22, 2023

Choose a reason for hiding this comment

cowtowncoder Dec 23, 2023

Choose a reason for hiding this comment

cowtowncoder Dec 23, 2023

Choose a reason for hiding this comment

cowtowncoder Dec 24, 2023

Choose a reason for hiding this comment

cowtowncoder left a comment

Choose a reason for hiding this comment