Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clean up GenerateBreakTest #975

Merged
merged 32 commits into from
Nov 28, 2024
Merged
Changes from 1 commit
Commits
Show all changes
32 commits
Select commit Hold shift + click to select a range
c48a170
Eggsperiment
eggrobin Nov 14, 2024
a66f92e
Merge remote-tracking branch 'la-vache/main' into generate-old-monkeys
eggrobin Nov 16, 2024
bf59f08
meow
eggrobin Nov 19, 2024
50a3157
Use remap rules for word and sentence too
eggrobin Nov 19, 2024
0205cab
No CM1 or ZWJ_O
eggrobin Nov 19, 2024
523cd0a
Regenerate UCD
eggrobin Nov 19, 2024
1e3d34f
Merge branch 'more-remapping-less-renaming' into generate-old-monkeys
eggrobin Nov 19, 2024
3a61d91
meow
eggrobin Nov 25, 2024
4433cc2
Some segmenter changes
eggrobin Nov 25, 2024
dc33cd0
Merge remote-tracking branch 'la-vache/main' into more-remapping-less…
eggrobin Nov 25, 2024
52e9dbd
^ rather than a variable called Not, UnicodeSet unions rather than |
eggrobin Nov 25, 2024
d6f96c4
Regenerate UCD
eggrobin Nov 25, 2024
cce6869
Merge branch 'more-remapping-less-renaming' into generate-old-monkeys
eggrobin Nov 25, 2024
5f254b9
Not bad but I should do something about QUmPi_Pf
eggrobin Nov 25, 2024
f6597bb
Better.
eggrobin Nov 26, 2024
533377d
Document the thing
eggrobin Nov 26, 2024
52a89cd
Regenerate UCD
eggrobin Nov 26, 2024
6fa6120
spotless
eggrobin Nov 26, 2024
94fb39a
Spotless and remove commented-out code.
eggrobin Nov 26, 2024
c46754f
Dead code elimination
eggrobin Nov 26, 2024
d8e4c4b
Pick the sample cleverly
eggrobin Nov 26, 2024
d22dfbe
Single assignment
eggrobin Nov 26, 2024
0ab3a58
spots
eggrobin Nov 26, 2024
6bcd20f
orig
eggrobin Nov 26, 2024
4ed94be
aaaaa
eggrobin Nov 26, 2024
dc77ece
Lex separately from name resolution…
eggrobin Nov 27, 2024
e213674
Unused variable, name gcb=XX, and a comment.
eggrobin Nov 27, 2024
9d2ddb8
No codepoint left behind
eggrobin Nov 27, 2024
ab96b84
Throw on undefined variables
eggrobin Nov 27, 2024
1bcb433
Suignardian pair table in LineBreakTest.html
eggrobin Nov 27, 2024
a901062
Merge remote-tracking branch 'la-vache/main' into generate-old-monkeys
eggrobin Nov 27, 2024
29fe201
Make it compile
eggrobin Nov 27, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Regenerate UCD
eggrobin committed Nov 19, 2024
commit 523cd0a9c7b6d71bf40a7dfcf3ee6ab980b34ec3
2 changes: 1 addition & 1 deletion unicodetools/data/ucd/dev/auxiliary/GraphemeBreakTest.html
Original file line number Diff line number Diff line change
@@ -7,7 +7,7 @@
<body bgcolor='#FFFFFF'>
<h2>Grapheme_Cluster_Break Chart</h2>
<p><b>Unicode Version:</b> 17.0.0</p>
<p><b>Date:</b> 2024-10-30, 21:25:11 GMT</p>
<p><b>Date:</b> 2024-11-19, 12:23:00 GMT</p>
<p>This page illustrates the application of the Grapheme_Cluster_Break specification. The material here is informative, not normative.</p> <p>The first chart shows where breaks would appear between different sample characters or strings. The sample characters are chosen mechanically to represent the different properties used by the specification.</p><p>Each cell shows the break-status for the position between the character(s) in its row header and the character(s) in its column header. The × symbol indicates no break, while the ÷ symbol indicated a break. The cells with × are also shaded to make it easier to scan the table. For example, in the cell at the intersection of the row headed by “CR” and the column headed by “LF”, there is a × symbol, indicating that there is no break between CR and LF.</p>
<p>After the heavy blue line in the table are additional rows, either with different sample characters or for sequences. Some column headers may be composed, reflecting “treat as” or “ignore” rules.</p>
<p>If your browser handles titles (tooltips), then hovering the mouse over the row header will show a sample character of that type. Hovering over a column header will show the sample character, plus its abbreviated general category and script. Hovering over the intersected cells shows the rule number that produces the break-status. For example, hovering over the cell at the intersection of LVT and T shows ×, with the rule 8.0. Checking below the table, rule 8.0 is “( LVT | T) × T”, which is the one that applies to that case. Note that a rule is invoked only when no lower-numbered rules have applied.</p>
258 changes: 129 additions & 129 deletions unicodetools/data/ucd/dev/auxiliary/LineBreakTest.html

Large diffs are not rendered by default.

17,958 changes: 8,979 additions & 8,979 deletions unicodetools/data/ucd/dev/auxiliary/LineBreakTest.txt

Large diffs are not rendered by default.

810 changes: 405 additions & 405 deletions unicodetools/data/ucd/dev/auxiliary/SentenceBreakTest.html

Large diffs are not rendered by default.

900 changes: 450 additions & 450 deletions unicodetools/data/ucd/dev/auxiliary/SentenceBreakTest.txt

Large diffs are not rendered by default.

112 changes: 56 additions & 56 deletions unicodetools/data/ucd/dev/auxiliary/WordBreakTest.html

Large diffs are not rendered by default.

2,346 changes: 1,173 additions & 1,173 deletions unicodetools/data/ucd/dev/auxiliary/WordBreakTest.txt

Large diffs are not rendered by default.

Loading