From 3b5a91318b4fb42d11601a15b81fa60e52d12176 Mon Sep 17 00:00:00 2001 From: Robin Leroy Date: Fri, 13 Dec 2024 12:29:57 +0100 Subject: [PATCH 1/8] UnicodeData.txt lines pieced together from L2/24-151R and the decision --- unicodetools/data/ucd/dev/UnicodeData.txt | 2 ++ 1 file changed, 2 insertions(+) diff --git a/unicodetools/data/ucd/dev/UnicodeData.txt b/unicodetools/data/ucd/dev/UnicodeData.txt index dfe2b67a7..7d8a20b46 100644 --- a/unicodetools/data/ucd/dev/UnicodeData.txt +++ b/unicodetools/data/ucd/dev/UnicodeData.txt @@ -1,3 +1,5 @@ +1F1AE;TOMOBIKI SYMBOL;So;0;L;;;;;N;;;;; +1F7DA;BLACK CIRCLE WITH WHITE VERTICAL BAR;So;0;L;;;;;N;;;;; 0000;;Cc;0;BN;;;;;N;NULL;;;; 0001;;Cc;0;BN;;;;;N;START OF HEADING;;;; 0002;;Cc;0;BN;;;;;N;START OF TEXT;;;; From c897e1f556fc65bbac6b54148d75ef6358bc0bc6 Mon Sep 17 00:00:00 2001 From: Robin Leroy Date: Fri, 13 Dec 2024 12:33:13 +0100 Subject: [PATCH 2/8] Both ID as they were by default --- unicodetools/data/ucd/dev/LineBreak.txt | 9 +++++---- 1 file changed, 5 insertions(+), 4 deletions(-) diff --git a/unicodetools/data/ucd/dev/LineBreak.txt b/unicodetools/data/ucd/dev/LineBreak.txt index 1b00f178a..37157f3f3 100644 --- a/unicodetools/data/ucd/dev/LineBreak.txt +++ b/unicodetools/data/ucd/dev/LineBreak.txt @@ -1,5 +1,5 @@ # LineBreak-17.0.0.txt -# Date: 2024-11-16, 02:53:11 GMT +# Date: 2024-12-13, 11:31:33 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -3522,7 +3522,8 @@ FFFD ; AI # So REPLACEMENT CHARACTER 1F16A..1F16F ; AL # So [6] RAISED MC SIGN..CIRCLED HUMAN FIGURE 1F170..1F1AC ; AI # So [61] NEGATIVE SQUARED LATIN CAPITAL LETTER A..SQUARED VOD 1F1AD ; AL # So MASK WORK SYMBOL -1F1AE..1F1E5 ; ID # Cn [56] .. +1F1AE ; ID # So TOMOBIKI SYMBOL +1F1AF..1F1E5 ; ID # Cn [55] .. 1F1E6..1F1FF ; RI # So [26] REGIONAL INDICATOR SYMBOL LETTER A..REGIONAL INDICATOR SYMBOL LETTER Z 1F200..1F202 ; ID # So [3] SQUARE HIRAGANA HOKA..SQUARED KATAKANA SA 1F203..1F20F ; ID # Cn [13] .. @@ -3624,8 +3625,8 @@ FFFD ; AI # So REPLACEMENT CHARACTER 1F777..1F77A ; AL # So [4] VESTA FORM TWO..PARTHENOPE FORM TWO 1F77B..1F77F ; ID # So [5] HAUMEA..ORCUS 1F780..1F7D4 ; AL # So [85] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..HEAVY TWELVE POINTED PINWHEEL STAR -1F7D5..1F7D9 ; ID # So [5] CIRCLED TRIANGLE..NINE POINTED WHITE STAR -1F7DA..1F7DF ; ID # Cn [6] .. +1F7D5..1F7DA ; ID # So [6] CIRCLED TRIANGLE..BLACK CIRCLE WITH WHITE VERTICAL BAR +1F7DB..1F7DF ; ID # Cn [5] .. 1F7E0..1F7EB ; ID # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 1F7EC..1F7EF ; ID # Cn [4] .. 1F7F0 ; ID # So HEAVY EQUALS SIGN From 8d32693f5caedbc75709ed510a9d8540c9af71dd Mon Sep 17 00:00:00 2001 From: Robin Leroy Date: Fri, 13 Dec 2024 12:37:54 +0100 Subject: [PATCH 3/8] common --- unicodetools/data/ucd/dev/Scripts.txt | 2 ++ 1 file changed, 2 insertions(+) diff --git a/unicodetools/data/ucd/dev/Scripts.txt b/unicodetools/data/ucd/dev/Scripts.txt index 21224d93a..39c9a9432 100644 --- a/unicodetools/data/ucd/dev/Scripts.txt +++ b/unicodetools/data/ucd/dev/Scripts.txt @@ -1,3 +1,5 @@ +1F1AE;Common +1F7DA;Common # Scripts-17.0.0.txt # Date: 2024-11-16, 02:53:45 GMT # © 2024 Unicode®, Inc. From 7b275e7d2a040db97fbab350ba7670c76d1b98f4 Mon Sep 17 00:00:00 2001 From: Robin Leroy Date: Fri, 13 Dec 2024 12:41:23 +0100 Subject: [PATCH 4/8] Regenerate UCD --- unicodetools/data/ucd/dev/DerivedAge.txt | 6 ++++-- .../data/ucd/dev/DerivedCoreProperties.txt | 8 ++++---- unicodetools/data/ucd/dev/EastAsianWidth.txt | 6 +++--- unicodetools/data/ucd/dev/Scripts.txt | 10 ++++------ unicodetools/data/ucd/dev/UnicodeData.txt | 4 ++-- unicodetools/data/ucd/dev/VerticalOrientation.txt | 10 +++++----- .../data/ucd/dev/extracted/DerivedBidiClass.txt | 6 ++++-- .../ucd/dev/extracted/DerivedCombiningClass.txt | 8 ++++---- .../ucd/dev/extracted/DerivedEastAsianWidth.txt | 8 ++++---- .../ucd/dev/extracted/DerivedGeneralCategory.txt | 14 +++++++------- .../data/ucd/dev/extracted/DerivedLineBreak.txt | 7 ++++--- .../data/ucd/dev/extracted/DerivedName.txt | 6 ++++-- 12 files changed, 49 insertions(+), 44 deletions(-) diff --git a/unicodetools/data/ucd/dev/DerivedAge.txt b/unicodetools/data/ucd/dev/DerivedAge.txt index f6edbad34..e89b19925 100644 --- a/unicodetools/data/ucd/dev/DerivedAge.txt +++ b/unicodetools/data/ucd/dev/DerivedAge.txt @@ -1,5 +1,5 @@ # DerivedAge-17.0.0.txt -# Date: 2024-11-16, 02:52:39 GMT +# Date: 2024-12-13, 11:39:18 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -2101,8 +2101,10 @@ FDC8..FDCE ; 17.0 # [7] ARABIC LIGATURE RAHIMAHU ALLAAH TAAALAA..ARABIC LIG 1E6C0..1E6DE ; 17.0 # [31] TAI YO LETTER LOW KO..TAI YO LETTER HIGH KVO 1E6E0..1E6F5 ; 17.0 # [22] TAI YO LETTER AA..TAI YO SIGN OM 1E6FE..1E6FF ; 17.0 # [2] TAI YO SYMBOL MUEANG..TAI YO XAM LAI +1F1AE ; 17.0 # TOMOBIKI SYMBOL 1F6D8 ; 17.0 # LANDSLIDE 1F777..1F77A ; 17.0 # [4] VESTA FORM TWO..PARTHENOPE FORM TWO +1F7DA ; 17.0 # BLACK CIRCLE WITH WHITE VERTICAL BAR 1F8D0..1F8D8 ; 17.0 # [9] LONG RIGHTWARDS ARROW OVER LONG LEFTWARDS ARROW..LONG LEFT RIGHT ARROW WITH DEPENDENT LOBE 1FA54..1FA57 ; 17.0 # [4] WHITE CHESS FERZ..BLACK CHESS ALFIL 1FA8A ; 17.0 # TROMBONE @@ -2116,6 +2118,6 @@ FDC8..FDCE ; 17.0 # [7] ARABIC LIGATURE RAHIMAHU ALLAAH TAAALAA..ARABIC LIG 2B73A..2B73E ; 17.0 # [5] CJK UNIFIED IDEOGRAPH-2B73A..CJK UNIFIED IDEOGRAPH-2B73E 323B0..33479 ; 17.0 # [4298] CJK UNIFIED IDEOGRAPH-323B0..CJK UNIFIED IDEOGRAPH-33479 -# Total code points: 4836 +# Total code points: 4838 # EOF diff --git a/unicodetools/data/ucd/dev/DerivedCoreProperties.txt b/unicodetools/data/ucd/dev/DerivedCoreProperties.txt index b40874a5a..0c6ae2d0a 100644 --- a/unicodetools/data/ucd/dev/DerivedCoreProperties.txt +++ b/unicodetools/data/ucd/dev/DerivedCoreProperties.txt @@ -1,5 +1,5 @@ # DerivedCoreProperties-17.0.0.txt -# Date: 2024-11-16, 02:53:03 GMT +# Date: 2024-12-13, 11:39:47 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -12971,7 +12971,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME 1F0C1..1F0CF ; Grapheme_Base # So [15] PLAYING CARD ACE OF DIAMONDS..PLAYING CARD BLACK JOKER 1F0D1..1F0F5 ; Grapheme_Base # So [37] PLAYING CARD ACE OF CLUBS..PLAYING CARD TRUMP-21 1F100..1F10C ; Grapheme_Base # No [13] DIGIT ZERO FULL STOP..DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT ZERO -1F10D..1F1AD ; Grapheme_Base # So [161] CIRCLED ZERO WITH SLASH..MASK WORK SYMBOL +1F10D..1F1AE ; Grapheme_Base # So [162] CIRCLED ZERO WITH SLASH..TOMOBIKI SYMBOL 1F1E6..1F202 ; Grapheme_Base # So [29] REGIONAL INDICATOR SYMBOL LETTER A..SQUARED KATAKANA SA 1F210..1F23B ; Grapheme_Base # So [44] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-914D 1F240..1F248 ; Grapheme_Base # So [9] TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-672C..TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-6557 @@ -12982,7 +12982,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME 1F400..1F6D8 ; Grapheme_Base # So [729] RAT..LANDSLIDE 1F6DC..1F6EC ; Grapheme_Base # So [17] WIRELESS..AIRPLANE ARRIVING 1F6F0..1F6FC ; Grapheme_Base # So [13] SATELLITE..ROLLER SKATE -1F700..1F7D9 ; Grapheme_Base # So [218] ALCHEMICAL SYMBOL FOR QUINTESSENCE..NINE POINTED WHITE STAR +1F700..1F7DA ; Grapheme_Base # So [219] ALCHEMICAL SYMBOL FOR QUINTESSENCE..BLACK CIRCLE WITH WHITE VERTICAL BAR 1F7E0..1F7EB ; Grapheme_Base # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 1F7F0 ; Grapheme_Base # So HEAVY EQUALS SIGN 1F800..1F80B ; Grapheme_Base # So [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD @@ -13016,7 +13016,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME 30000..3134A ; Grapheme_Base # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A 31350..33479 ; Grapheme_Base # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479 -# Total code points: 157523 +# Total code points: 157525 # ================================================ diff --git a/unicodetools/data/ucd/dev/EastAsianWidth.txt b/unicodetools/data/ucd/dev/EastAsianWidth.txt index d86aea8f4..32286a164 100644 --- a/unicodetools/data/ucd/dev/EastAsianWidth.txt +++ b/unicodetools/data/ucd/dev/EastAsianWidth.txt @@ -1,5 +1,5 @@ # EastAsianWidth-17.0.0.txt -# Date: 2024-11-16, 02:53:10 GMT +# Date: 2024-12-13, 11:39:53 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -2612,7 +2612,7 @@ FFFD ; A # So REPLACEMENT CHARACTER 1F18F..1F190 ; A # So [2] NEGATIVE SQUARED WC..SQUARE DJ 1F191..1F19A ; W # So [10] SQUARED CL..SQUARED VS 1F19B..1F1AC ; A # So [18] SQUARED THREE D..SQUARED VOD -1F1AD ; N # So MASK WORK SYMBOL +1F1AD..1F1AE ; N # So [2] MASK WORK SYMBOL..TOMOBIKI SYMBOL 1F1E6..1F1FF ; N # So [26] REGIONAL INDICATOR SYMBOL LETTER A..REGIONAL INDICATOR SYMBOL LETTER Z 1F200..1F202 ; W # So [3] SQUARE HIRAGANA HOKA..SQUARED KATAKANA SA 1F210..1F23B ; W # So [44] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-914D @@ -2671,7 +2671,7 @@ FFFD ; A # So REPLACEMENT CHARACTER 1F6F0..1F6F3 ; N # So [4] SATELLITE..PASSENGER SHIP 1F6F4..1F6FC ; W # So [9] SCOOTER..ROLLER SKATE 1F700..1F77F ; N # So [128] ALCHEMICAL SYMBOL FOR QUINTESSENCE..ORCUS -1F780..1F7D9 ; N # So [90] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..NINE POINTED WHITE STAR +1F780..1F7DA ; N # So [91] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..BLACK CIRCLE WITH WHITE VERTICAL BAR 1F7E0..1F7EB ; W # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 1F7F0 ; W # So HEAVY EQUALS SIGN 1F800..1F80B ; N # So [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD diff --git a/unicodetools/data/ucd/dev/Scripts.txt b/unicodetools/data/ucd/dev/Scripts.txt index 39c9a9432..40516236c 100644 --- a/unicodetools/data/ucd/dev/Scripts.txt +++ b/unicodetools/data/ucd/dev/Scripts.txt @@ -1,7 +1,5 @@ -1F1AE;Common -1F7DA;Common # Scripts-17.0.0.txt -# Date: 2024-11-16, 02:53:45 GMT +# Date: 2024-12-13, 11:40:28 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -601,7 +599,7 @@ FFFC..FFFD ; Common # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHAR 1F0C1..1F0CF ; Common # So [15] PLAYING CARD ACE OF DIAMONDS..PLAYING CARD BLACK JOKER 1F0D1..1F0F5 ; Common # So [37] PLAYING CARD ACE OF CLUBS..PLAYING CARD TRUMP-21 1F100..1F10C ; Common # No [13] DIGIT ZERO FULL STOP..DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT ZERO -1F10D..1F1AD ; Common # So [161] CIRCLED ZERO WITH SLASH..MASK WORK SYMBOL +1F10D..1F1AE ; Common # So [162] CIRCLED ZERO WITH SLASH..TOMOBIKI SYMBOL 1F1E6..1F1FF ; Common # So [26] REGIONAL INDICATOR SYMBOL LETTER A..REGIONAL INDICATOR SYMBOL LETTER Z 1F201..1F202 ; Common # So [2] SQUARED KATAKANA KOKO..SQUARED KATAKANA SA 1F210..1F23B ; Common # So [44] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-914D @@ -613,7 +611,7 @@ FFFC..FFFD ; Common # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHAR 1F400..1F6D8 ; Common # So [729] RAT..LANDSLIDE 1F6DC..1F6EC ; Common # So [17] WIRELESS..AIRPLANE ARRIVING 1F6F0..1F6FC ; Common # So [13] SATELLITE..ROLLER SKATE -1F700..1F7D9 ; Common # So [218] ALCHEMICAL SYMBOL FOR QUINTESSENCE..NINE POINTED WHITE STAR +1F700..1F7DA ; Common # So [219] ALCHEMICAL SYMBOL FOR QUINTESSENCE..BLACK CIRCLE WITH WHITE VERTICAL BAR 1F7E0..1F7EB ; Common # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 1F7F0 ; Common # So HEAVY EQUALS SIGN 1F800..1F80B ; Common # So [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD @@ -640,7 +638,7 @@ FFFC..FFFD ; Common # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHAR E0001 ; Common # Cf LANGUAGE TAG E0020..E007F ; Common # Cf [96] TAG SPACE..CANCEL TAG -# Total code points: 9123 +# Total code points: 9125 # ================================================ diff --git a/unicodetools/data/ucd/dev/UnicodeData.txt b/unicodetools/data/ucd/dev/UnicodeData.txt index 7d8a20b46..d11b9914f 100644 --- a/unicodetools/data/ucd/dev/UnicodeData.txt +++ b/unicodetools/data/ucd/dev/UnicodeData.txt @@ -1,5 +1,3 @@ -1F1AE;TOMOBIKI SYMBOL;So;0;L;;;;;N;;;;; -1F7DA;BLACK CIRCLE WITH WHITE VERTICAL BAR;So;0;L;;;;;N;;;;; 0000;;Cc;0;BN;;;;;N;NULL;;;; 0001;;Cc;0;BN;;;;;N;START OF HEADING;;;; 0002;;Cc;0;BN;;;;;N;START OF TEXT;;;; @@ -37484,6 +37482,7 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;; 1F1AB;SQUARED UHD;So;0;L;;;;;N;;;;; 1F1AC;SQUARED VOD;So;0;L;;;;;N;;;;; 1F1AD;MASK WORK SYMBOL;So;0;ON;;;;;N;;;;; +1F1AE;TOMOBIKI SYMBOL;So;0;L;;;;;N;;;;; 1F1E6;REGIONAL INDICATOR SYMBOL LETTER A;So;0;L;;;;;N;;;;; 1F1E7;REGIONAL INDICATOR SYMBOL LETTER B;So;0;L;;;;;N;;;;; 1F1E8;REGIONAL INDICATOR SYMBOL LETTER C;So;0;L;;;;;N;;;;; @@ -38807,6 +38806,7 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;; 1F7D7;CIRCLED SQUARE;So;0;ON;;;;;N;;;;; 1F7D8;NEGATIVE CIRCLED SQUARE;So;0;ON;;;;;N;;;;; 1F7D9;NINE POINTED WHITE STAR;So;0;ON;;;;;N;;;;; +1F7DA;BLACK CIRCLE WITH WHITE VERTICAL BAR;So;0;L;;;;;N;;;;; 1F7E0;LARGE ORANGE CIRCLE;So;0;ON;;;;;N;;;;; 1F7E1;LARGE YELLOW CIRCLE;So;0;ON;;;;;N;;;;; 1F7E2;LARGE GREEN CIRCLE;So;0;ON;;;;;N;;;;; diff --git a/unicodetools/data/ucd/dev/VerticalOrientation.txt b/unicodetools/data/ucd/dev/VerticalOrientation.txt index 557aa4454..828efa053 100644 --- a/unicodetools/data/ucd/dev/VerticalOrientation.txt +++ b/unicodetools/data/ucd/dev/VerticalOrientation.txt @@ -1,5 +1,5 @@ # VerticalOrientation-17.0.0.txt -# Date: 2024-11-16, 02:53:48 GMT +# Date: 2024-12-13, 11:40:32 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -2461,8 +2461,8 @@ FFFC..FFFD ; U # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHARA 1F0D1..1F0F5 ; U # So [37] PLAYING CARD ACE OF CLUBS..PLAYING CARD TRUMP-21 1F0F6..1F0FF ; U # Cn [10] .. 1F100..1F10C ; U # No [13] DIGIT ZERO FULL STOP..DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT ZERO -1F10D..1F1AD ; U # So [161] CIRCLED ZERO WITH SLASH..MASK WORK SYMBOL -1F1AE..1F1E5 ; U # Cn [56] .. +1F10D..1F1AE ; U # So [162] CIRCLED ZERO WITH SLASH..TOMOBIKI SYMBOL +1F1AF..1F1E5 ; U # Cn [55] .. 1F1E6..1F1FF ; U # So [26] REGIONAL INDICATOR SYMBOL LETTER A..REGIONAL INDICATOR SYMBOL LETTER Z 1F200..1F201 ; Tu # So [2] SQUARE HIRAGANA HOKA..SQUARED KATAKANA KOKO 1F202 ; U # So SQUARED KATAKANA SA @@ -2487,8 +2487,8 @@ FFFC..FFFD ; U # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHARA 1F6F0..1F6FC ; U # So [13] SATELLITE..ROLLER SKATE 1F6FD..1F6FF ; U # Cn [3] .. 1F700..1F77F ; U # So [128] ALCHEMICAL SYMBOL FOR QUINTESSENCE..ORCUS -1F780..1F7D9 ; U # So [90] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..NINE POINTED WHITE STAR -1F7DA..1F7DF ; U # Cn [6] .. +1F780..1F7DA ; U # So [91] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..BLACK CIRCLE WITH WHITE VERTICAL BAR +1F7DB..1F7DF ; U # Cn [5] .. 1F7E0..1F7EB ; U # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 1F7EC..1F7EF ; U # Cn [4] .. 1F7F0 ; U # So HEAVY EQUALS SIGN diff --git a/unicodetools/data/ucd/dev/extracted/DerivedBidiClass.txt b/unicodetools/data/ucd/dev/extracted/DerivedBidiClass.txt index 7dfa50cd7..07d5900f2 100644 --- a/unicodetools/data/ucd/dev/extracted/DerivedBidiClass.txt +++ b/unicodetools/data/ucd/dev/extracted/DerivedBidiClass.txt @@ -1,5 +1,5 @@ # DerivedBidiClass-17.0.0.txt -# Date: 2024-12-11, 16:17:55 GMT +# Date: 2024-12-13, 11:39:44 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -1218,10 +1218,12 @@ FFDA..FFDC ; L # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL LETTER 1F110..1F12E ; L # So [31] PARENTHESIZED LATIN CAPITAL LETTER A..CIRCLED WZ 1F130..1F169 ; L # So [58] SQUARED LATIN CAPITAL LETTER A..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z 1F170..1F1AC ; L # So [61] NEGATIVE SQUARED LATIN CAPITAL LETTER A..SQUARED VOD +1F1AE ; L # So TOMOBIKI SYMBOL 1F1E6..1F202 ; L # So [29] REGIONAL INDICATOR SYMBOL LETTER A..SQUARED KATAKANA SA 1F210..1F23B ; L # So [44] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-914D 1F240..1F248 ; L # So [9] TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-672C..TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-6557 1F250..1F251 ; L # So [2] CIRCLED IDEOGRAPH ADVANTAGE..CIRCLED IDEOGRAPH ACCEPT +1F7DA ; L # So BLACK CIRCLE WITH WHITE VERTICAL BAR 20000..2A6DF ; L # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF 2A700..2B73E ; L # Lo [4159] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B73E 2B740..2B81D ; L # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D @@ -1234,7 +1236,7 @@ FFDA..FFDC ; L # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL LETTER F0000..FFFFD ; L # Co [65534] .. 100000..10FFFD; L # Co [65534] .. -# The above property value applies to 810584 code points not listed here. +# The above property value applies to 810582 code points not listed here. # Total code points: 1095402 # ================================================ diff --git a/unicodetools/data/ucd/dev/extracted/DerivedCombiningClass.txt b/unicodetools/data/ucd/dev/extracted/DerivedCombiningClass.txt index 1d9d2477d..c57044086 100644 --- a/unicodetools/data/ucd/dev/extracted/DerivedCombiningClass.txt +++ b/unicodetools/data/ucd/dev/extracted/DerivedCombiningClass.txt @@ -1,5 +1,5 @@ # DerivedCombiningClass-17.0.0.txt -# Date: 2024-11-16, 02:53:02 GMT +# Date: 2024-12-13, 11:39:46 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -2045,7 +2045,7 @@ FFFC..FFFD ; 0 # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHARACTER 1F0C1..1F0CF ; 0 # So [15] PLAYING CARD ACE OF DIAMONDS..PLAYING CARD BLACK JOKER 1F0D1..1F0F5 ; 0 # So [37] PLAYING CARD ACE OF CLUBS..PLAYING CARD TRUMP-21 1F100..1F10C ; 0 # No [13] DIGIT ZERO FULL STOP..DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT ZERO -1F10D..1F1AD ; 0 # So [161] CIRCLED ZERO WITH SLASH..MASK WORK SYMBOL +1F10D..1F1AE ; 0 # So [162] CIRCLED ZERO WITH SLASH..TOMOBIKI SYMBOL 1F1E6..1F202 ; 0 # So [29] REGIONAL INDICATOR SYMBOL LETTER A..SQUARED KATAKANA SA 1F210..1F23B ; 0 # So [44] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-914D 1F240..1F248 ; 0 # So [9] TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-672C..TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-6557 @@ -2056,7 +2056,7 @@ FFFC..FFFD ; 0 # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHARACTER 1F400..1F6D8 ; 0 # So [729] RAT..LANDSLIDE 1F6DC..1F6EC ; 0 # So [17] WIRELESS..AIRPLANE ARRIVING 1F6F0..1F6FC ; 0 # So [13] SATELLITE..ROLLER SKATE -1F700..1F7D9 ; 0 # So [218] ALCHEMICAL SYMBOL FOR QUINTESSENCE..NINE POINTED WHITE STAR +1F700..1F7DA ; 0 # So [219] ALCHEMICAL SYMBOL FOR QUINTESSENCE..BLACK CIRCLE WITH WHITE VERTICAL BAR 1F7E0..1F7EB ; 0 # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 1F7F0 ; 0 # So HEAVY EQUALS SIGN 1F800..1F80B ; 0 # So [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD @@ -2095,7 +2095,7 @@ E0100..E01EF ; 0 # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256 F0000..FFFFD ; 0 # Co [65534] .. 100000..10FFFD; 0 # Co [65534] .. -# The above property value applies to 816745 code points not listed here. +# The above property value applies to 816743 code points not listed here. # Total code points: 1113143 # ================================================ diff --git a/unicodetools/data/ucd/dev/extracted/DerivedEastAsianWidth.txt b/unicodetools/data/ucd/dev/extracted/DerivedEastAsianWidth.txt index f805c9c6e..052ebb230 100644 --- a/unicodetools/data/ucd/dev/extracted/DerivedEastAsianWidth.txt +++ b/unicodetools/data/ucd/dev/extracted/DerivedEastAsianWidth.txt @@ -1,5 +1,5 @@ # DerivedEastAsianWidth-17.0.0.txt -# Date: 2024-11-16, 02:53:05 GMT +# Date: 2024-12-13, 11:39:49 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -2098,7 +2098,7 @@ FFFC ; N # So OBJECT REPLACEMENT CHARACTER 1F10D..1F10F ; N # So [3] CIRCLED ZERO WITH SLASH..CIRCLED DOLLAR SIGN WITH OVERLAID BACKSLASH 1F12E..1F12F ; N # So [2] CIRCLED WZ..COPYLEFT SYMBOL 1F16A..1F16F ; N # So [6] RAISED MC SIGN..CIRCLED HUMAN FIGURE -1F1AD ; N # So MASK WORK SYMBOL +1F1AD..1F1AE ; N # So [2] MASK WORK SYMBOL..TOMOBIKI SYMBOL 1F1E6..1F1FF ; N # So [26] REGIONAL INDICATOR SYMBOL LETTER A..REGIONAL INDICATOR SYMBOL LETTER Z 1F321..1F32C ; N # So [12] THERMOMETER..WIND BLOWING FACE 1F336 ; N # So HOT PEPPER @@ -2123,7 +2123,7 @@ FFFC ; N # So OBJECT REPLACEMENT CHARACTER 1F6D3..1F6D4 ; N # So [2] STUPA..PAGODA 1F6E0..1F6EA ; N # So [11] HAMMER AND WRENCH..NORTHEAST-POINTING AIRPLANE 1F6F0..1F6F3 ; N # So [4] SATELLITE..PASSENGER SHIP -1F700..1F7D9 ; N # So [218] ALCHEMICAL SYMBOL FOR QUINTESSENCE..NINE POINTED WHITE STAR +1F700..1F7DA ; N # So [219] ALCHEMICAL SYMBOL FOR QUINTESSENCE..BLACK CIRCLE WITH WHITE VERTICAL BAR 1F800..1F80B ; N # So [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD 1F810..1F847 ; N # So [56] LEFTWARDS ARROW WITH SMALL EQUILATERAL ARROWHEAD..DOWNWARDS HEAVY ARROW 1F850..1F859 ; N # So [10] LEFTWARDS SANS-SERIF ARROW..UP DOWN SANS-SERIF ARROW @@ -2144,7 +2144,7 @@ FFFC ; N # So OBJECT REPLACEMENT CHARACTER E0001 ; N # Cf LANGUAGE TAG E0020..E007F ; N # Cf [96] TAG SPACE..CANCEL TAG -# The above property value applies to 760566 code points not listed here. +# The above property value applies to 760564 code points not listed here. # Total code points: 792267 # ================================================ diff --git a/unicodetools/data/ucd/dev/extracted/DerivedGeneralCategory.txt b/unicodetools/data/ucd/dev/extracted/DerivedGeneralCategory.txt index fab067f6d..2d351c1bf 100644 --- a/unicodetools/data/ucd/dev/extracted/DerivedGeneralCategory.txt +++ b/unicodetools/data/ucd/dev/extracted/DerivedGeneralCategory.txt @@ -1,5 +1,5 @@ # DerivedGeneralCategory-17.0.0.txt -# Date: 2024-11-16, 02:53:06 GMT +# Date: 2024-12-13, 11:39:49 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -708,7 +708,7 @@ FFFE..FFFF ; Cn # [2] .. 1F0C0 ; Cn # 1F0D0 ; Cn # 1F0F6..1F0FF ; Cn # [10] .. -1F1AE..1F1E5 ; Cn # [56] .. +1F1AF..1F1E5 ; Cn # [55] .. 1F203..1F20F ; Cn # [13] .. 1F23C..1F23F ; Cn # [4] .. 1F249..1F24F ; Cn # [7] .. @@ -717,7 +717,7 @@ FFFE..FFFF ; Cn # [2] .. 1F6D9..1F6DB ; Cn # [3] .. 1F6ED..1F6EF ; Cn # [3] .. 1F6FD..1F6FF ; Cn # [3] .. -1F7DA..1F7DF ; Cn # [6] .. +1F7DB..1F7DF ; Cn # [5] .. 1F7EC..1F7EF ; Cn # [4] .. 1F7F1..1F7FF ; Cn # [15] .. 1F80C..1F80F ; Cn # [4] .. @@ -754,7 +754,7 @@ E01F0..EFFFF ; Cn # [65040] .. FFFFE..FFFFF ; Cn # [2] .. 10FFFE..10FFFF; Cn # [2] .. -# Total code points: 814697 +# Total code points: 814695 # ================================================ @@ -4305,7 +4305,7 @@ FFFC..FFFD ; So # [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHARACTER 1F0B1..1F0BF ; So # [15] PLAYING CARD ACE OF HEARTS..PLAYING CARD RED JOKER 1F0C1..1F0CF ; So # [15] PLAYING CARD ACE OF DIAMONDS..PLAYING CARD BLACK JOKER 1F0D1..1F0F5 ; So # [37] PLAYING CARD ACE OF CLUBS..PLAYING CARD TRUMP-21 -1F10D..1F1AD ; So # [161] CIRCLED ZERO WITH SLASH..MASK WORK SYMBOL +1F10D..1F1AE ; So # [162] CIRCLED ZERO WITH SLASH..TOMOBIKI SYMBOL 1F1E6..1F202 ; So # [29] REGIONAL INDICATOR SYMBOL LETTER A..SQUARED KATAKANA SA 1F210..1F23B ; So # [44] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-914D 1F240..1F248 ; So # [9] TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-672C..TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-6557 @@ -4315,7 +4315,7 @@ FFFC..FFFD ; So # [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHARACTER 1F400..1F6D8 ; So # [729] RAT..LANDSLIDE 1F6DC..1F6EC ; So # [17] WIRELESS..AIRPLANE ARRIVING 1F6F0..1F6FC ; So # [13] SATELLITE..ROLLER SKATE -1F700..1F7D9 ; So # [218] ALCHEMICAL SYMBOL FOR QUINTESSENCE..NINE POINTED WHITE STAR +1F700..1F7DA ; So # [219] ALCHEMICAL SYMBOL FOR QUINTESSENCE..BLACK CIRCLE WITH WHITE VERTICAL BAR 1F7E0..1F7EB ; So # [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 1F7F0 ; So # HEAVY EQUALS SIGN 1F800..1F80B ; So # [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD @@ -4338,7 +4338,7 @@ FFFC..FFFD ; So # [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHARACTER 1FB94..1FBEF ; So # [92] LEFT HALF INVERSE MEDIUM SHADE AND RIGHT HALF BLOCK..TOP LEFT JUSTIFIED LOWER RIGHT QUARTER BLACK CIRCLE 1FBFA ; So # ALARM BELL SYMBOL -# Total code points: 7469 +# Total code points: 7471 # ================================================ diff --git a/unicodetools/data/ucd/dev/extracted/DerivedLineBreak.txt b/unicodetools/data/ucd/dev/extracted/DerivedLineBreak.txt index 1b8181cb8..aee9715de 100644 --- a/unicodetools/data/ucd/dev/extracted/DerivedLineBreak.txt +++ b/unicodetools/data/ucd/dev/extracted/DerivedLineBreak.txt @@ -1,5 +1,5 @@ # DerivedLineBreak-17.0.0.txt -# Date: 2024-11-16, 02:53:07 GMT +# Date: 2024-12-13, 11:39:51 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -1803,6 +1803,7 @@ FFE4 ; ID # So FULLWIDTH BROKEN BAR 1F0B1..1F0BF ; ID # So [15] PLAYING CARD ACE OF HEARTS..PLAYING CARD RED JOKER 1F0C1..1F0CF ; ID # So [15] PLAYING CARD ACE OF DIAMONDS..PLAYING CARD BLACK JOKER 1F0D1..1F0F5 ; ID # So [37] PLAYING CARD ACE OF CLUBS..PLAYING CARD TRUMP-21 +1F1AE ; ID # So TOMOBIKI SYMBOL 1F200..1F202 ; ID # So [3] SQUARE HIRAGANA HOKA..SQUARED KATAKANA SA 1F210..1F23B ; ID # So [44] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-914D 1F240..1F248 ; ID # So [9] TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-672C..TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-6557 @@ -1850,7 +1851,7 @@ FFE4 ; ID # So FULLWIDTH BROKEN BAR 1F6F0..1F6FC ; ID # So [13] SATELLITE..ROLLER SKATE 1F774..1F776 ; ID # So [3] LOT OF FORTUNE..LUNAR ECLIPSE 1F77B..1F77F ; ID # So [5] HAUMEA..ORCUS -1F7D5..1F7D9 ; ID # So [5] CIRCLED TRIANGLE..NINE POINTED WHITE STAR +1F7D5..1F7DA ; ID # So [6] CIRCLED TRIANGLE..BLACK CIRCLE WITH WHITE VERTICAL BAR 1F7E0..1F7EB ; ID # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 1F7F0 ; ID # So HEAVY EQUALS SIGN 1F90D..1F90E ; ID # So [2] WHITE HEART..BROWN HEART @@ -1884,7 +1885,7 @@ FFE4 ; ID # So FULLWIDTH BROKEN BAR 30000..3134A ; ID # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A 31350..33479 ; ID # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479 -# The above property value applies to 57546 code points not listed here. +# The above property value applies to 57544 code points not listed here. # Total code points: 172561 # ================================================ diff --git a/unicodetools/data/ucd/dev/extracted/DerivedName.txt b/unicodetools/data/ucd/dev/extracted/DerivedName.txt index 7948745ff..b87b0c2a8 100644 --- a/unicodetools/data/ucd/dev/extracted/DerivedName.txt +++ b/unicodetools/data/ucd/dev/extracted/DerivedName.txt @@ -1,5 +1,5 @@ # DerivedName-17.0.0.txt -# Date: 2024-11-16, 02:53:08 GMT +# Date: 2024-12-13, 11:39:51 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -43287,6 +43287,7 @@ FFFD ; REPLACEMENT CHARACTER 1F1AB ; SQUARED UHD 1F1AC ; SQUARED VOD 1F1AD ; MASK WORK SYMBOL +1F1AE ; TOMOBIKI SYMBOL 1F1E6 ; REGIONAL INDICATOR SYMBOL LETTER A 1F1E7 ; REGIONAL INDICATOR SYMBOL LETTER B 1F1E8 ; REGIONAL INDICATOR SYMBOL LETTER C @@ -44610,6 +44611,7 @@ FFFD ; REPLACEMENT CHARACTER 1F7D7 ; CIRCLED SQUARE 1F7D8 ; NEGATIVE CIRCLED SQUARE 1F7D9 ; NINE POINTED WHITE STAR +1F7DA ; BLACK CIRCLE WITH WHITE VERTICAL BAR 1F7E0 ; LARGE ORANGE CIRCLE 1F7E1 ; LARGE YELLOW CIRCLE 1F7E2 ; LARGE GREEN CIRCLE @@ -45870,6 +45872,6 @@ E01ED ; VARIATION SELECTOR-254 E01EE ; VARIATION SELECTOR-255 E01EF ; VARIATION SELECTOR-256 -# Total code points: 159834 +# Total code points: 159836 # EOF From fa4656eefa87f9dd0343ea6e4500b17f8e2151b1 Mon Sep 17 00:00:00 2001 From: Robin Leroy Date: Fri, 13 Dec 2024 12:57:12 +0100 Subject: [PATCH 5/8] Should be W --- unicodetools/data/ucd/dev/EastAsianWidth.txt | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/unicodetools/data/ucd/dev/EastAsianWidth.txt b/unicodetools/data/ucd/dev/EastAsianWidth.txt index 32286a164..f57f1a2b9 100644 --- a/unicodetools/data/ucd/dev/EastAsianWidth.txt +++ b/unicodetools/data/ucd/dev/EastAsianWidth.txt @@ -2612,7 +2612,8 @@ FFFD ; A # So REPLACEMENT CHARACTER 1F18F..1F190 ; A # So [2] NEGATIVE SQUARED WC..SQUARE DJ 1F191..1F19A ; W # So [10] SQUARED CL..SQUARED VS 1F19B..1F1AC ; A # So [18] SQUARED THREE D..SQUARED VOD -1F1AD..1F1AE ; N # So [2] MASK WORK SYMBOL..TOMOBIKI SYMBOL +1F1AD ; N # So [2] MASK WORK SYMBOL..TOMOBIKI SYMBOL +1F1AE ; W 1F1E6..1F1FF ; N # So [26] REGIONAL INDICATOR SYMBOL LETTER A..REGIONAL INDICATOR SYMBOL LETTER Z 1F200..1F202 ; W # So [3] SQUARE HIRAGANA HOKA..SQUARED KATAKANA SA 1F210..1F23B ; W # So [44] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-914D @@ -2671,7 +2672,8 @@ FFFD ; A # So REPLACEMENT CHARACTER 1F6F0..1F6F3 ; N # So [4] SATELLITE..PASSENGER SHIP 1F6F4..1F6FC ; W # So [9] SCOOTER..ROLLER SKATE 1F700..1F77F ; N # So [128] ALCHEMICAL SYMBOL FOR QUINTESSENCE..ORCUS -1F780..1F7DA ; N # So [91] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..BLACK CIRCLE WITH WHITE VERTICAL BAR +1F780..1F7CF ; N # So [91] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..BLACK CIRCLE WITH WHITE VERTICAL BAR +1F7DA; W 1F7E0..1F7EB ; W # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 1F7F0 ; W # So HEAVY EQUALS SIGN 1F800..1F80B ; N # So [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD From a0884f6795fd7d70e006391952f57e8338956f52 Mon Sep 17 00:00:00 2001 From: Robin Leroy Date: Fri, 13 Dec 2024 13:13:50 +0100 Subject: [PATCH 6/8] Test --- .../text/UCD/AdditionComparisons/170.txt | 33 +++++++++++++++++++ 1 file changed, 33 insertions(+) create mode 100644 unicodetools/src/main/resources/org/unicode/text/UCD/AdditionComparisons/170.txt diff --git a/unicodetools/src/main/resources/org/unicode/text/UCD/AdditionComparisons/170.txt b/unicodetools/src/main/resources/org/unicode/text/UCD/AdditionComparisons/170.txt new file mode 100644 index 000000000..00567b54a --- /dev/null +++ b/unicodetools/src/main/resources/org/unicode/text/UCD/AdditionComparisons/170.txt @@ -0,0 +1,33 @@ +# Symbols: Two Japanese traditional calendar symbols (1F1AE, 1F7DA) +# https://github.com/unicode-org/utc-release-management/issues/170 + +# Names always differ. +# Age always differs since these tests are comparing additions to pre-existing characters. +Ignoring Name Age: + +# Ignore the security and IDNA properties, as these are not yet included for provisionally assigned characters. +Ignoring Confusable_MA Identifier_Status Identifier_Type Idn_Status Idn_Mapping Idn_2008: + +# The other 六曜 symbols [L2/24-151R, p. 1] are all propertywise alike. +Propertywise [\x{25D0} ◐ \N{CIRCLE WITH LEFT HALF BLACK} + \x{25D1} ◑ \N{CIRCLE WITH RIGHT HALF BLACK} + \x{25CF} ● \N{BLACK CIRCLE} + \x{25CB} ○ \N{WHITE CIRCLE}] AreAlike + +Ignoring Block: +Propertywise [\x{1F1AE} \N{TOMOBIKI SYMBOL} + \x{1F7DA} \N{BLACK CIRCLE WITH WHITE VERTICAL BAR}] +CorrespondTo [\x{25D0} ◐ \N{CIRCLE WITH LEFT HALF BLACK}] + UpTo: Bidi_Class (Left_To_Right vs Other_Neutral), + East_Asian_Width (Wide vs Ambiguous), + Line_Break (Ideographic vs Ambiguous), + Extended_Pictographic (Yes vs No), + Math (No vs Yes), + Other_Math (No vs Yes), + Pattern_Syntax (No vs Yes) + +end Ignoring; + +end Ignoring; + +end Ignoring; \ No newline at end of file From 137558a91fb8fc9ed39454ec23398854931e319a Mon Sep 17 00:00:00 2001 From: Robin Leroy Date: Fri, 13 Dec 2024 13:17:37 +0100 Subject: [PATCH 7/8] Regenerate UCD --- unicodetools/data/ucd/dev/EastAsianWidth.txt | 10 +++++----- .../data/ucd/dev/extracted/DerivedEastAsianWidth.txt | 12 +++++++----- 2 files changed, 12 insertions(+), 10 deletions(-) diff --git a/unicodetools/data/ucd/dev/EastAsianWidth.txt b/unicodetools/data/ucd/dev/EastAsianWidth.txt index f57f1a2b9..e442c5b39 100644 --- a/unicodetools/data/ucd/dev/EastAsianWidth.txt +++ b/unicodetools/data/ucd/dev/EastAsianWidth.txt @@ -1,5 +1,5 @@ # EastAsianWidth-17.0.0.txt -# Date: 2024-12-13, 11:39:53 GMT +# Date: 2024-12-13, 12:16:27 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -2612,8 +2612,8 @@ FFFD ; A # So REPLACEMENT CHARACTER 1F18F..1F190 ; A # So [2] NEGATIVE SQUARED WC..SQUARE DJ 1F191..1F19A ; W # So [10] SQUARED CL..SQUARED VS 1F19B..1F1AC ; A # So [18] SQUARED THREE D..SQUARED VOD -1F1AD ; N # So [2] MASK WORK SYMBOL..TOMOBIKI SYMBOL -1F1AE ; W +1F1AD ; N # So MASK WORK SYMBOL +1F1AE ; W # So TOMOBIKI SYMBOL 1F1E6..1F1FF ; N # So [26] REGIONAL INDICATOR SYMBOL LETTER A..REGIONAL INDICATOR SYMBOL LETTER Z 1F200..1F202 ; W # So [3] SQUARE HIRAGANA HOKA..SQUARED KATAKANA SA 1F210..1F23B ; W # So [44] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-914D @@ -2672,8 +2672,8 @@ FFFD ; A # So REPLACEMENT CHARACTER 1F6F0..1F6F3 ; N # So [4] SATELLITE..PASSENGER SHIP 1F6F4..1F6FC ; W # So [9] SCOOTER..ROLLER SKATE 1F700..1F77F ; N # So [128] ALCHEMICAL SYMBOL FOR QUINTESSENCE..ORCUS -1F780..1F7CF ; N # So [91] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..BLACK CIRCLE WITH WHITE VERTICAL BAR -1F7DA; W +1F780..1F7D9 ; N # So [90] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..NINE POINTED WHITE STAR +1F7DA ; W # So BLACK CIRCLE WITH WHITE VERTICAL BAR 1F7E0..1F7EB ; W # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 1F7F0 ; W # So HEAVY EQUALS SIGN 1F800..1F80B ; N # So [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD diff --git a/unicodetools/data/ucd/dev/extracted/DerivedEastAsianWidth.txt b/unicodetools/data/ucd/dev/extracted/DerivedEastAsianWidth.txt index 052ebb230..f16b278d5 100644 --- a/unicodetools/data/ucd/dev/extracted/DerivedEastAsianWidth.txt +++ b/unicodetools/data/ucd/dev/extracted/DerivedEastAsianWidth.txt @@ -1,5 +1,5 @@ # DerivedEastAsianWidth-17.0.0.txt -# Date: 2024-12-13, 11:39:49 GMT +# Date: 2024-12-13, 12:16:21 GMT # © 2024 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. # For terms of use and license, see https://www.unicode.org/terms_of_use.html @@ -2098,7 +2098,7 @@ FFFC ; N # So OBJECT REPLACEMENT CHARACTER 1F10D..1F10F ; N # So [3] CIRCLED ZERO WITH SLASH..CIRCLED DOLLAR SIGN WITH OVERLAID BACKSLASH 1F12E..1F12F ; N # So [2] CIRCLED WZ..COPYLEFT SYMBOL 1F16A..1F16F ; N # So [6] RAISED MC SIGN..CIRCLED HUMAN FIGURE -1F1AD..1F1AE ; N # So [2] MASK WORK SYMBOL..TOMOBIKI SYMBOL +1F1AD ; N # So MASK WORK SYMBOL 1F1E6..1F1FF ; N # So [26] REGIONAL INDICATOR SYMBOL LETTER A..REGIONAL INDICATOR SYMBOL LETTER Z 1F321..1F32C ; N # So [12] THERMOMETER..WIND BLOWING FACE 1F336 ; N # So HOT PEPPER @@ -2123,7 +2123,7 @@ FFFC ; N # So OBJECT REPLACEMENT CHARACTER 1F6D3..1F6D4 ; N # So [2] STUPA..PAGODA 1F6E0..1F6EA ; N # So [11] HAMMER AND WRENCH..NORTHEAST-POINTING AIRPLANE 1F6F0..1F6F3 ; N # So [4] SATELLITE..PASSENGER SHIP -1F700..1F7DA ; N # So [219] ALCHEMICAL SYMBOL FOR QUINTESSENCE..BLACK CIRCLE WITH WHITE VERTICAL BAR +1F700..1F7D9 ; N # So [218] ALCHEMICAL SYMBOL FOR QUINTESSENCE..NINE POINTED WHITE STAR 1F800..1F80B ; N # So [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD 1F810..1F847 ; N # So [56] LEFTWARDS ARROW WITH SMALL EQUILATERAL ARROWHEAD..DOWNWARDS HEAVY ARROW 1F850..1F859 ; N # So [10] LEFTWARDS SANS-SERIF ARROW..UP DOWN SANS-SERIF ARROW @@ -2145,7 +2145,7 @@ E0001 ; N # Cf LANGUAGE TAG E0020..E007F ; N # Cf [96] TAG SPACE..CANCEL TAG # The above property value applies to 760564 code points not listed here. -# Total code points: 792267 +# Total code points: 792265 # ================================================ @@ -2567,6 +2567,7 @@ FE6A..FE6B ; W # Po [2] SMALL PERCENT SIGN..SMALL COMMERCIAL AT 1F0CF ; W # So PLAYING CARD BLACK JOKER 1F18E ; W # So NEGATIVE SQUARED AB 1F191..1F19A ; W # So [10] SQUARED CL..SQUARED VS +1F1AE ; W # So TOMOBIKI SYMBOL 1F200..1F202 ; W # So [3] SQUARE HIRAGANA HOKA..SQUARED KATAKANA SA 1F210..1F23B ; W # So [44] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-914D 1F240..1F248 ; W # So [9] TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-672C..TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-6557 @@ -2599,6 +2600,7 @@ FE6A..FE6B ; W # Po [2] SMALL PERCENT SIGN..SMALL COMMERCIAL AT 1F6DC..1F6DF ; W # So [4] WIRELESS..RING BUOY 1F6EB..1F6EC ; W # So [2] AIRPLANE DEPARTURE..AIRPLANE ARRIVING 1F6F4..1F6FC ; W # So [9] SCOOTER..ROLLER SKATE +1F7DA ; W # So BLACK CIRCLE WITH WHITE VERTICAL BAR 1F7E0..1F7EB ; W # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE 1F7F0 ; W # So HEAVY EQUALS SIGN 1F90C..1F93A ; W # So [47] PINCHED FINGERS..FENCER @@ -2622,7 +2624,7 @@ FE6A..FE6B ; W # Po [2] SMALL PERCENT SIGN..SMALL COMMERCIAL AT 31350..33479 ; W # Lo [8490] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-33479 # The above property value applies to 56179 code points not listed here. -# Total code points: 182768 +# Total code points: 182770 # ================================================ From cd87456648c396cf136c686d38fd5e0e58b9c48f Mon Sep 17 00:00:00 2001 From: Robin Leroy Date: Thu, 19 Dec 2024 22:35:53 +0100 Subject: [PATCH 8/8] do not expect ExtPict as amended in PAG discussion --- .../resources/org/unicode/text/UCD/AdditionComparisons/170.txt | 1 - 1 file changed, 1 deletion(-) diff --git a/unicodetools/src/main/resources/org/unicode/text/UCD/AdditionComparisons/170.txt b/unicodetools/src/main/resources/org/unicode/text/UCD/AdditionComparisons/170.txt index 00567b54a..1fce65cd8 100644 --- a/unicodetools/src/main/resources/org/unicode/text/UCD/AdditionComparisons/170.txt +++ b/unicodetools/src/main/resources/org/unicode/text/UCD/AdditionComparisons/170.txt @@ -21,7 +21,6 @@ CorrespondTo [\x{25D0} ◐ \N{CIRCLE WITH LEFT HALF BLACK}] UpTo: Bidi_Class (Left_To_Right vs Other_Neutral), East_Asian_Width (Wide vs Ambiguous), Line_Break (Ideographic vs Ambiguous), - Extended_Pictographic (Yes vs No), Math (No vs Yes), Other_Math (No vs Yes), Pattern_Syntax (No vs Yes)