Update to Unicode 16.0.0 [BZ #32168]

Unicode 16.0.0 Support: Character encoding, character type info, and
transliteration tables are all updated to Unicode 16.0.0, using
the generator scripts contributed by Mike FABIAN (Red Hat).

Changes in CHARMAP and WIDTH:

    Total added characters in newly generated CHARMAP: 5185
    Total removed characters in newly generated WIDTH: 1
    Total added characters in newly generated WIDTH: 170

The removed character from WIDTH is U+1171E AHOM CONSONANT SIGN MEDIAL RA.
It changed like this:

UnicodeData.txt 15.1.0: 1171E;AHOM CONSONANT SIGN MEDIAL RA;Mn;0;NSM;;;;;N;;;;;
UnicodeData.txt 16.0.0: 1171E;AHOM CONSONANT SIGN MEDIAL RA;Mc;0;L;;;;;N;;;;;

EastAsianWidth.txt 15.1.0: 1171D..1171F   ; N  # Mn     [3] AHOM CONSONANT SIGN MEDIAL LA..AHOM CONSONANT SIGN MEDIAL LIGATING RA
EastAsianWidth.txt 16.0.0: 1171E          ; N  # Mc         AHOM CONSONANT SIGN MEDIAL RA

I.e it changed from Mn (Mark Nonspacing) to Mc (Mark Spacing
combining). So it should now have width 1 instead of 0, therefore it
is OK that it was removed from WIDTH, characters not in WIDTH get
width 1 by default.

Nothing suspicious when browsing the list of the 170 added characters.

Changes in ctype:

    alpha: Added 4452 characters in new ctype which were not in old ctype
    combining: Added 51 characters in new ctype which were not in old ctype
    combining_level3: Added 43 characters in new ctype which were not in old ctype
    graph: Added 5185 characters in new ctype which were not in old ctype
    lower: Added 25 characters in new ctype which were not in old ctype
    print: Added 5185 characters in new ctype which were not in old ctype
    punct: Missing 33 characters of old ctype in new ctype
    punct: Added 766 characters in new ctype which were not in old ctype
    tolower: Added 27 characters in new ctype which were not in old ctype
    totitle: Added 27 characters in new ctype which were not in old ctype
    toupper: Added 27 characters in new ctype which were not in old ctype
    upper: Added 27 characters in new ctype which were not in old ctype

Nothing suspicous in the additions.

About the 33 characters removed from `punct`:

U+0363 - U+036F are identical in UnicodeData.txt. Difference in DerivedCoreProperties.txt:

DerivedCoreProperties.txt 15.1.0: not there.
DerivedCoreProperties.txt 16.0.0: 0363..036F    ; Alphabetic # Mn  [13] COMBINING LATIN SMALL LETTER A..COMBINING LATIN SMALL LETTER X

So that’s the reason why they are added to `alpha` and removed from `punct`.

Same for U+1DD3 - U+1DE6, they are identical in UnicodeData.txt but there is a difference in DerivedCoreProperties.txt:

DerivedCoreProperties.txt 15.1.0: 1DE7..1DF4    ; Alphabetic # Mn  [14] COMBINING LATIN SMALL LETTER ALPHA..COMBINING LATIN SMALL LETTER U WITH DIAERESIS
DerivedCoreProperties.txt 16.0.0: 1DD3..1DF4    ; Alphabetic # Mn  [34] COMBINING LATIN SMALL LETTER FLATTENED OPEN A ABOVE..COMBINING LATIN SMALL LETTER U WITH DIAERESIS

So they became `Alphabetic` and were thus added to `alpha` and removed from `punct`.

Resolves: BZ #32168

Reviewed-by: Carlos O'Donell <carlos@redhat.com>
This commit is contained in:
Mike FABIAN 2024-09-12 15:02:55 +02:00
parent f47596fcfe
commit a7b5eb821d
15 changed files with 14404 additions and 3099 deletions

View File

@ -169,7 +169,7 @@ gettext:
# The following files are shared with the upstream Unicode project and must be
# updated regularly to stay in sync with the upstream unicode releases.
#
# Merged from Unicode 15.1.0 release.
# Merged from Unicode 16.0.0 release.
unicode:
localedata/unicode-gen/UnicodeData.txt
localedata/unicode-gen/unicode-license.txt

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

View File

@ -9,7 +9,7 @@ comment_char %
% otherwise be governed by that license.
% Transliterations of encircled characters.
% Generated automatically from UnicodeData.txt by gen_translit_circle.py on 2023-09-15 for Unicode 15.1.0.
% Generated automatically from UnicodeData.txt by gen_translit_circle.py on 2024-09-12 for Unicode 16.0.0.
LC_CTYPE

View File

@ -9,7 +9,7 @@ comment_char %
% otherwise be governed by that license.
% Transliterations of CJK compatibility characters.
% Generated automatically from UnicodeData.txt by gen_translit_cjk_compat.py on 2023-09-15 for Unicode 15.1.0.
% Generated automatically from UnicodeData.txt by gen_translit_cjk_compat.py on 2024-09-12 for Unicode 16.0.0.
LC_CTYPE

View File

@ -10,7 +10,7 @@ comment_char %
% Transliterations that remove all combining characters (accents,
% pronounciation marks, etc.).
% Generated automatically from UnicodeData.txt by gen_translit_combining.py on 2023-09-15 for Unicode 15.1.0.
% Generated automatically from UnicodeData.txt by gen_translit_combining.py on 2024-09-12 for Unicode 16.0.0.
LC_CTYPE
@ -446,6 +446,8 @@ translit_start
<U06EC> ""
% ARABIC SMALL LOW MEEM
<U06ED> ""
% ARABIC PEPET
<U0897> ""
% ARABIC SMALL HIGH WORD AL-JUZ
<U0898> ""
% ARABIC SMALL LOW WORD ISHMAAM
@ -878,10 +880,22 @@ translit_start
<U00010D26> ""
% HANIFI ROHINGYA SIGN TASSI
<U00010D27> ""
% GARAY VOWEL SIGN E
<U00010D69> ""
% GARAY CONSONANT GEMINATION MARK
<U00010D6A> ""
% GARAY COMBINING DOT ABOVE
<U00010D6B> ""
% GARAY COMBINING DOUBLE DOT ABOVE
<U00010D6C> ""
% GARAY CONSONANT NASALIZATION MARK
<U00010D6D> ""
% YEZIDI COMBINING HAMZA MARK
<U00010EAB> ""
% YEZIDI COMBINING MADDA MARK
<U00010EAC> ""
% ARABIC COMBINING ALEF OVERLAY
<U00010EFC> ""
% ARABIC SMALL LOW WORD SAKTA
<U00010EFD> ""
% ARABIC SMALL LOW WORD QASR
@ -920,6 +934,48 @@ translit_start
<U00010F85> ""
% COMBINING BINDU BELOW
<U0001133B> ""
% TULU-TIGALARI VOWEL SIGN AA
<U000113B8> ""
% TULU-TIGALARI VOWEL SIGN I
<U000113B9> ""
% TULU-TIGALARI VOWEL SIGN II
<U000113BA> ""
% TULU-TIGALARI VOWEL SIGN U
<U000113BB> ""
% TULU-TIGALARI VOWEL SIGN UU
<U000113BC> ""
% TULU-TIGALARI VOWEL SIGN VOCALIC R
<U000113BD> ""
% TULU-TIGALARI VOWEL SIGN VOCALIC RR
<U000113BE> ""
% TULU-TIGALARI VOWEL SIGN VOCALIC L
<U000113BF> ""
% TULU-TIGALARI VOWEL SIGN VOCALIC LL
<U000113C0> ""
% TULU-TIGALARI VOWEL SIGN EE
<U000113C2> ""
% TULU-TIGALARI VOWEL SIGN AI
<U000113C5> ""
% TULU-TIGALARI VOWEL SIGN OO
<U000113C7> ""
% TULU-TIGALARI VOWEL SIGN AU
<U000113C8> ""
% TULU-TIGALARI AU LENGTH MARK
<U000113C9> ""
% TULU-TIGALARI SIGN CANDRA ANUNASIKA
<U000113CA> ""
% TULU-TIGALARI SIGN ANUSVARA
<U000113CC> ""
% TULU-TIGALARI SIGN VISARGA
<U000113CD> ""
% TULU-TIGALARI SIGN VIRAMA
<U000113CE> ""
% TULU-TIGALARI SIGN LOOPED VIRAMA
<U000113CF> ""
% TULU-TIGALARI CONJOINER
<U000113D0> ""
% TULU-TIGALARI GEMINATION MARK
<U000113D2> ""
% NEWA VOWEL SIGN AA
<U00011435> ""
% NEWA VOWEL SIGN I
@ -1346,6 +1402,8 @@ translit_start
<U00011F41> ""
% KAWI CONJOINER
<U00011F42> ""
% KAWI SIGN NUKTA
<U00011F5A> ""
% EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
<U00013440> ""
% EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START
@ -1378,6 +1436,42 @@ translit_start
<U00013454> ""
% EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
<U00013455> ""
% GURUNG KHEMA VOWEL SIGN AA
<U0001611E> ""
% GURUNG KHEMA VOWEL SIGN I
<U0001611F> ""
% GURUNG KHEMA VOWEL SIGN II
<U00016120> ""
% GURUNG KHEMA VOWEL SIGN U
<U00016121> ""
% GURUNG KHEMA VOWEL SIGN UU
<U00016122> ""
% GURUNG KHEMA VOWEL SIGN E
<U00016123> ""
% GURUNG KHEMA VOWEL SIGN EE
<U00016124> ""
% GURUNG KHEMA VOWEL SIGN AI
<U00016125> ""
% GURUNG KHEMA VOWEL SIGN O
<U00016126> ""
% GURUNG KHEMA VOWEL SIGN OO
<U00016127> ""
% GURUNG KHEMA VOWEL SIGN AU
<U00016128> ""
% GURUNG KHEMA VOWEL LENGTH MARK
<U00016129> ""
% GURUNG KHEMA CONSONANT SIGN MEDIAL YA
<U0001612A> ""
% GURUNG KHEMA CONSONANT SIGN MEDIAL VA
<U0001612B> ""
% GURUNG KHEMA CONSONANT SIGN MEDIAL HA
<U0001612C> ""
% GURUNG KHEMA SIGN ANUSVARA
<U0001612D> ""
% GURUNG KHEMA CONSONANT SIGN MEDIAL RA
<U0001612E> ""
% GURUNG KHEMA SIGN THOLHOMA
<U0001612F> ""
% KHITAN SMALL SCRIPT FILLER
<U00016FE4> ""
% VIETNAMESE ALTERNATE READING MARK CA
@ -1636,6 +1730,10 @@ translit_start
<U0001E4EE> ""
% NAG MUNDARI SIGN SUTUH
<U0001E4EF> ""
% OL ONAL SIGN MU
<U0001E5EE> ""
% OL ONAL SIGN IKIR
<U0001E5EF> ""
% ADLAM ALIF LENGTHENER
<U0001E944> ""
% ADLAM VOWEL LENGTHENER
@ -3705,6 +3803,24 @@ translit_start
<UFB4D> <U05DB>
% HEBREW LETTER PE WITH RAFE
<UFB4E> <U05E4>
% TODHRI LETTER EI
<U000105C9> <U000105D2>
% TODHRI LETTER U
<U000105E4> <U000105DA>
% TULU-TIGALARI LETTER II
<U00011383> <U00011382>
% TULU-TIGALARI LETTER UU
<U00011385> <U00011384>
% TULU-TIGALARI LETTER AI
<U0001138E> <U0001138B>
% TULU-TIGALARI LETTER AU
<U00011391> <U00011390>
% KIRAT RAI VOWEL SIGN AI
<U00016D68> "<U00016D67><U00016D67>"
% KIRAT RAI VOWEL SIGN O
<U00016D69> "<U00016D63><U00016D67>"
% KIRAT RAI VOWEL SIGN AU
<U00016D6A> "<U00016D63><U00016D67><U00016D67>"
translit_end

View File

@ -9,7 +9,7 @@ comment_char %
% otherwise be governed by that license.
% Transliterations of compatibility characters and ligatures.
% Generated automatically from UnicodeData.txt by gen_translit_compat.py on 2023-09-15 for Unicode 15.1.0.
% Generated automatically from UnicodeData.txt by gen_translit_compat.py on 2024-09-12 for Unicode 16.0.0.
LC_CTYPE

View File

@ -9,7 +9,7 @@ comment_char %
% otherwise be governed by that license.
% Transliterations of font equivalents.
% Generated automatically from UnicodeData.txt by gen_translit_font.py on 2023-09-15 for Unicode 15.1.0.
% Generated automatically from UnicodeData.txt by gen_translit_font.py on 2024-09-12 for Unicode 16.0.0.
LC_CTYPE
@ -62,6 +62,42 @@ translit_start
<UFB27> <U05E8> % HEBREW LETTER WIDE RESH
<UFB28> <U05EA> % HEBREW LETTER WIDE TAV
<UFB29> <U002B> % HEBREW LETTER ALTERNATIVE PLUS SIGN
<U0001CCD6> <U0041> % OUTLINED LATIN CAPITAL LETTER A
<U0001CCD7> <U0042> % OUTLINED LATIN CAPITAL LETTER B
<U0001CCD8> <U0043> % OUTLINED LATIN CAPITAL LETTER C
<U0001CCD9> <U0044> % OUTLINED LATIN CAPITAL LETTER D
<U0001CCDA> <U0045> % OUTLINED LATIN CAPITAL LETTER E
<U0001CCDB> <U0046> % OUTLINED LATIN CAPITAL LETTER F
<U0001CCDC> <U0047> % OUTLINED LATIN CAPITAL LETTER G
<U0001CCDD> <U0048> % OUTLINED LATIN CAPITAL LETTER H
<U0001CCDE> <U0049> % OUTLINED LATIN CAPITAL LETTER I
<U0001CCDF> <U004A> % OUTLINED LATIN CAPITAL LETTER J
<U0001CCE0> <U004B> % OUTLINED LATIN CAPITAL LETTER K
<U0001CCE1> <U004C> % OUTLINED LATIN CAPITAL LETTER L
<U0001CCE2> <U004D> % OUTLINED LATIN CAPITAL LETTER M
<U0001CCE3> <U004E> % OUTLINED LATIN CAPITAL LETTER N
<U0001CCE4> <U004F> % OUTLINED LATIN CAPITAL LETTER O
<U0001CCE5> <U0050> % OUTLINED LATIN CAPITAL LETTER P
<U0001CCE6> <U0051> % OUTLINED LATIN CAPITAL LETTER Q
<U0001CCE7> <U0052> % OUTLINED LATIN CAPITAL LETTER R
<U0001CCE8> <U0053> % OUTLINED LATIN CAPITAL LETTER S
<U0001CCE9> <U0054> % OUTLINED LATIN CAPITAL LETTER T
<U0001CCEA> <U0055> % OUTLINED LATIN CAPITAL LETTER U
<U0001CCEB> <U0056> % OUTLINED LATIN CAPITAL LETTER V
<U0001CCEC> <U0057> % OUTLINED LATIN CAPITAL LETTER W
<U0001CCED> <U0058> % OUTLINED LATIN CAPITAL LETTER X
<U0001CCEE> <U0059> % OUTLINED LATIN CAPITAL LETTER Y
<U0001CCEF> <U005A> % OUTLINED LATIN CAPITAL LETTER Z
<U0001CCF0> <U0030> % OUTLINED DIGIT ZERO
<U0001CCF1> <U0031> % OUTLINED DIGIT ONE
<U0001CCF2> <U0032> % OUTLINED DIGIT TWO
<U0001CCF3> <U0033> % OUTLINED DIGIT THREE
<U0001CCF4> <U0034> % OUTLINED DIGIT FOUR
<U0001CCF5> <U0035> % OUTLINED DIGIT FIVE
<U0001CCF6> <U0036> % OUTLINED DIGIT SIX
<U0001CCF7> <U0037> % OUTLINED DIGIT SEVEN
<U0001CCF8> <U0038> % OUTLINED DIGIT EIGHT
<U0001CCF9> <U0039> % OUTLINED DIGIT NINE
<U0001D400> <U0041> % MATHEMATICAL BOLD CAPITAL A
<U0001D401> <U0042> % MATHEMATICAL BOLD CAPITAL B
<U0001D402> <U0043> % MATHEMATICAL BOLD CAPITAL C

View File

@ -9,7 +9,7 @@ comment_char %
% otherwise be governed by that license.
% Transliterations of fractions.
% Generated automatically from UnicodeData.txt by gen_translit_fraction.py on 2023-09-15 for Unicode 15.1.0.
% Generated automatically from UnicodeData.txt by gen_translit_fraction.py on 2024-09-12 for Unicode 16.0.0.
% The replacements have been surrounded with spaces, because fractions are
% often preceded by a decimal number and followed by a unit or a math symbol.

File diff suppressed because it is too large Load Diff

View File

@ -1,8 +1,8 @@
# EastAsianWidth-15.1.0.txt
# Date: 2023-07-28, 23:34:08 GMT
# © 2023 Unicode®, Inc.
# EastAsianWidth-16.0.0.txt
# Date: 2024-04-30, 21:48:20 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use, see https://www.unicode.org/terms_of_use.html
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
#
# Unicode Character Database
# For documentation, see https://www.unicode.org/reports/tr44/
@ -334,7 +334,7 @@
0888 ; N # Sk ARABIC RAISED ROUND DOT
0889..088E ; N # Lo [6] ARABIC LETTER NOON WITH INVERTED SMALL V..ARABIC VERTICAL TAIL
0890..0891 ; N # Cf [2] ARABIC POUND MARK ABOVE..ARABIC PIASTRE MARK ABOVE
0898..089F ; N # Mn [8] ARABIC SMALL HIGH WORD AL-JUZ..ARABIC HALF MADDA OVER MADDA
0897..089F ; N # Mn [9] ARABIC PEPET..ARABIC HALF MADDA OVER MADDA
08A0..08C8 ; N # Lo [41] ARABIC LETTER BEH WITH SMALL V BELOW..ARABIC LETTER GRAF
08C9 ; N # Lm ARABIC SMALL FARSI YEH
08CA..08E1 ; N # Mn [24] ARABIC SMALL HIGH FARSI YEH..ARABIC SMALL HIGH SIGN SAFHA
@ -819,12 +819,13 @@
1B42 ; N # Mn BALINESE VOWEL SIGN PEPET
1B43..1B44 ; N # Mc [2] BALINESE VOWEL SIGN PEPET TEDUNG..BALINESE ADEG ADEG
1B45..1B4C ; N # Lo [8] BALINESE LETTER KAF SASAK..BALINESE LETTER ARCHAIC JNYA
1B4E..1B4F ; N # Po [2] BALINESE INVERTED CARIK SIKI..BALINESE INVERTED CARIK PAREREN
1B50..1B59 ; N # Nd [10] BALINESE DIGIT ZERO..BALINESE DIGIT NINE
1B5A..1B60 ; N # Po [7] BALINESE PANTI..BALINESE PAMENENG
1B61..1B6A ; N # So [10] BALINESE MUSICAL SYMBOL DONG..BALINESE MUSICAL SYMBOL DANG GEDE
1B6B..1B73 ; N # Mn [9] BALINESE MUSICAL SYMBOL COMBINING TEGEH..BALINESE MUSICAL SYMBOL COMBINING GONG
1B74..1B7C ; N # So [9] BALINESE MUSICAL SYMBOL RIGHT-HAND OPEN DUG..BALINESE MUSICAL SYMBOL LEFT-HAND OPEN PING
1B7D..1B7E ; N # Po [2] BALINESE PANTI LANTANG..BALINESE PAMADA LANTANG
1B7D..1B7F ; N # Po [3] BALINESE PANTI LANTANG..BALINESE PANTI BAWAK
1B80..1B81 ; N # Mn [2] SUNDANESE SIGN PANYECEK..SUNDANESE SIGN PANGLAYAR
1B82 ; N # Mc SUNDANESE SIGN PANGWISAD
1B83..1BA0 ; N # Lo [30] SUNDANESE LETTER A..SUNDANESE LETTER HA
@ -859,7 +860,7 @@
1C5A..1C77 ; N # Lo [30] OL CHIKI LETTER LA..OL CHIKI LETTER OH
1C78..1C7D ; N # Lm [6] OL CHIKI MU TTUDDAG..OL CHIKI AHAD
1C7E..1C7F ; N # Po [2] OL CHIKI PUNCTUATION MUCAAD..OL CHIKI PUNCTUATION DOUBLE MUCAAD
1C80..1C88 ; N # Ll [9] CYRILLIC SMALL LETTER ROUNDED VE..CYRILLIC SMALL LETTER UNBLENDED UK
1C80..1C8A ; N # L& [11] CYRILLIC SMALL LETTER ROUNDED VE..CYRILLIC SMALL LETTER TJE
1C90..1CBA ; N # Lu [43] GEORGIAN MTAVRULI CAPITAL LETTER AN..GEORGIAN MTAVRULI CAPITAL LETTER AIN
1CBD..1CBF ; N # Lu [3] GEORGIAN MTAVRULI CAPITAL LETTER AEN..GEORGIAN MTAVRULI CAPITAL LETTER LABIAL SIGN
1CC0..1CC7 ; N # Po [8] SUNDANESE PUNCTUATION BINDU SURYA..SUNDANESE PUNCTUATION BINDU BA SATANGA
@ -1142,7 +1143,7 @@
23F1..23F2 ; N # So [2] STOPWATCH..TIMER CLOCK
23F3 ; W # So HOURGLASS WITH FLOWING SAND
23F4..23FF ; N # So [12] BLACK MEDIUM LEFT-POINTING TRIANGLE..OBSERVER EYE SYMBOL
2400..2426 ; N # So [39] SYMBOL FOR NULL..SYMBOL FOR SUBSTITUTE FORM TWO
2400..2429 ; N # So [42] SYMBOL FOR NULL..SYMBOL FOR DELETE MEDIUM SHADE FORM
2440..244A ; N # So [11] OCR HOOK..OCR DOUBLE BACKSLASH
2460..249B ; A # No [60] CIRCLED DIGIT ONE..NUMBER TWENTY FULL STOP
249C..24E9 ; A # So [78] PARENTHESIZED LATIN SMALL LETTER A..CIRCLED LATIN SMALL LETTER Z
@ -1195,7 +1196,9 @@
261C ; A # So WHITE LEFT POINTING INDEX
261D ; N # So WHITE UP POINTING INDEX
261E ; A # So WHITE RIGHT POINTING INDEX
261F..263F ; N # So [33] WHITE DOWN POINTING INDEX..MERCURY
261F..262F ; N # So [17] WHITE DOWN POINTING INDEX..YIN YANG
2630..2637 ; W # So [8] TRIGRAM FOR HEAVEN..TRIGRAM FOR EARTH
2638..263F ; N # So [8] WHEEL OF DHARMA..MERCURY
2640 ; A # So FEMALE SIGN
2641 ; N # So EARTH
2642 ; A # So MALE SIGN
@ -1213,7 +1216,9 @@
266F ; A # Sm MUSIC SHARP SIGN
2670..267E ; N # So [15] WEST SYRIAC CROSS..PERMANENT PAPER SIGN
267F ; W # So WHEELCHAIR SYMBOL
2680..2692 ; N # So [19] DIE FACE-1..HAMMER AND PICK
2680..2689 ; N # So [10] DIE FACE-1..BLACK CIRCLE WITH TWO WHITE DOTS
268A..268F ; W # So [6] MONOGRAM FOR YANG..DIGRAM FOR GREATER YIN
2690..2692 ; N # So [3] WHITE FLAG..HAMMER AND PICK
2693 ; W # So ANCHOR
2694..269D ; N # So [10] CROSSED SWORDS..OUTLINED WHITE STAR
269E..269F ; A # So [2] THREE LINES CONVERGING RIGHT..THREE LINES CONVERGING LEFT
@ -1487,7 +1492,7 @@
3192..3195 ; W # No [4] IDEOGRAPHIC ANNOTATION ONE MARK..IDEOGRAPHIC ANNOTATION FOUR MARK
3196..319F ; W # So [10] IDEOGRAPHIC ANNOTATION TOP MARK..IDEOGRAPHIC ANNOTATION MAN MARK
31A0..31BF ; W # Lo [32] BOPOMOFO LETTER BU..BOPOMOFO LETTER AH
31C0..31E3 ; W # So [36] CJK STROKE T..CJK STROKE Q
31C0..31E5 ; W # So [38] CJK STROKE T..CJK STROKE SZP
31EF ; W # So IDEOGRAPHIC DESCRIPTION CHARACTER SUBTRACTION
31F0..31FF ; W # Lo [16] KATAKANA LETTER SMALL KU..KATAKANA LETTER SMALL RO
3200..321E ; W # So [31] PARENTHESIZED HANGUL KIYEOK..PARENTHESIZED KOREAN CHARACTER O HU
@ -1503,7 +1508,7 @@
32C0..32FF ; W # So [64] IDEOGRAPHIC TELEGRAPH SYMBOL FOR JANUARY..SQUARE ERA NAME REIWA
3300..33FF ; W # So [256] SQUARE APAATO..SQUARE GAL
3400..4DBF ; W # Lo [6592] CJK UNIFIED IDEOGRAPH-3400..CJK UNIFIED IDEOGRAPH-4DBF
4DC0..4DFF ; N # So [64] HEXAGRAM FOR THE CREATIVE HEAVEN..HEXAGRAM FOR BEFORE COMPLETION
4DC0..4DFF ; W # So [64] HEXAGRAM FOR THE CREATIVE HEAVEN..HEXAGRAM FOR BEFORE COMPLETION
4E00..9FFF ; W # Lo [20992] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FFF
A000..A014 ; W # Lo [21] YI SYLLABLE IT..YI SYLLABLE E
A015 ; W # Lm YI SYLLABLE WU
@ -1543,10 +1548,10 @@ A788 ; N # Lm MODIFIER LETTER LOW CIRCUMFLEX ACCENT
A789..A78A ; N # Sk [2] MODIFIER LETTER COLON..MODIFIER LETTER SHORT EQUALS SIGN
A78B..A78E ; N # L& [4] LATIN CAPITAL LETTER SALTILLO..LATIN SMALL LETTER L WITH RETROFLEX HOOK AND BELT
A78F ; N # Lo LATIN LETTER SINOLOGICAL DOT
A790..A7CA ; N # L& [59] LATIN CAPITAL LETTER N WITH DESCENDER..LATIN SMALL LETTER S WITH SHORT STROKE OVERLAY
A790..A7CD ; N # L& [62] LATIN CAPITAL LETTER N WITH DESCENDER..LATIN SMALL LETTER S WITH DIAGONAL STROKE
A7D0..A7D1 ; N # L& [2] LATIN CAPITAL LETTER CLOSED INSULAR G..LATIN SMALL LETTER CLOSED INSULAR G
A7D3 ; N # Ll LATIN SMALL LETTER DOUBLE THORN
A7D5..A7D9 ; N # L& [5] LATIN SMALL LETTER DOUBLE WYNN..LATIN SMALL LETTER SIGMOID S
A7D5..A7DC ; N # L& [8] LATIN SMALL LETTER DOUBLE WYNN..LATIN CAPITAL LETTER LAMBDA WITH STROKE
A7F2..A7F4 ; N # Lm [3] MODIFIER LETTER CAPITAL C..MODIFIER LETTER CAPITAL Q
A7F5..A7F6 ; N # L& [2] LATIN CAPITAL LETTER REVERSED HALF H..LATIN SMALL LETTER REVERSED HALF H
A7F7 ; N # Lo LATIN EPIGRAPHIC LETTER SIDEWAYS I
@ -1870,6 +1875,7 @@ FFFD ; A # So REPLACEMENT CHARACTER
105A3..105B1 ; N # Ll [15] VITHKUQI SMALL LETTER HA..VITHKUQI SMALL LETTER RE
105B3..105B9 ; N # Ll [7] VITHKUQI SMALL LETTER SE..VITHKUQI SMALL LETTER XE
105BB..105BC ; N # Ll [2] VITHKUQI SMALL LETTER Y..VITHKUQI SMALL LETTER ZE
105C0..105F3 ; N # Lo [52] TODHRI LETTER A..TODHRI LETTER OO
10600..10736 ; N # Lo [311] LINEAR A SIGN AB001..LINEAR A SIGN A664
10740..10755 ; N # Lo [22] LINEAR A SIGN A701 A..LINEAR A SIGN A732 JE
10760..10767 ; N # Lo [8] LINEAR A SIGN A800..LINEAR A SIGN A807
@ -1942,12 +1948,23 @@ FFFD ; A # So REPLACEMENT CHARACTER
10D00..10D23 ; N # Lo [36] HANIFI ROHINGYA LETTER A..HANIFI ROHINGYA MARK NA KHONNA
10D24..10D27 ; N # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
10D30..10D39 ; N # Nd [10] HANIFI ROHINGYA DIGIT ZERO..HANIFI ROHINGYA DIGIT NINE
10D40..10D49 ; N # Nd [10] GARAY DIGIT ZERO..GARAY DIGIT NINE
10D4A..10D4D ; N # Lo [4] GARAY VOWEL SIGN A..GARAY VOWEL SIGN EE
10D4E ; N # Lm GARAY VOWEL LENGTH MARK
10D4F ; N # Lo GARAY SUKUN
10D50..10D65 ; N # Lu [22] GARAY CAPITAL LETTER A..GARAY CAPITAL LETTER OLD NA
10D69..10D6D ; N # Mn [5] GARAY VOWEL SIGN E..GARAY CONSONANT NASALIZATION MARK
10D6E ; N # Pd GARAY HYPHEN
10D6F ; N # Lm GARAY REDUPLICATION MARK
10D70..10D85 ; N # Ll [22] GARAY SMALL LETTER A..GARAY SMALL LETTER OLD NA
10D8E..10D8F ; N # Sm [2] GARAY PLUS SIGN..GARAY MINUS SIGN
10E60..10E7E ; N # No [31] RUMI DIGIT ONE..RUMI FRACTION TWO THIRDS
10E80..10EA9 ; N # Lo [42] YEZIDI LETTER ELIF..YEZIDI LETTER ET
10EAB..10EAC ; N # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
10EAD ; N # Pd YEZIDI HYPHENATION MARK
10EB0..10EB1 ; N # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
10EFD..10EFF ; N # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
10EC2..10EC4 ; N # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
10EFC..10EFF ; N # Mn [4] ARABIC COMBINING ALEF OVERLAY..ARABIC SMALL LOW WORD MADDA
10F00..10F1C ; N # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
10F1D..10F26 ; N # No [10] OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION ONE HALF
10F27 ; N # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
@ -2064,6 +2081,26 @@ FFFD ; A # So REPLACEMENT CHARACTER
11362..11363 ; N # Mc [2] GRANTHA VOWEL SIGN VOCALIC L..GRANTHA VOWEL SIGN VOCALIC LL
11366..1136C ; N # Mn [7] COMBINING GRANTHA DIGIT ZERO..COMBINING GRANTHA DIGIT SIX
11370..11374 ; N # Mn [5] COMBINING GRANTHA LETTER A..COMBINING GRANTHA LETTER PA
11380..11389 ; N # Lo [10] TULU-TIGALARI LETTER A..TULU-TIGALARI LETTER VOCALIC LL
1138B ; N # Lo TULU-TIGALARI LETTER EE
1138E ; N # Lo TULU-TIGALARI LETTER AI
11390..113B5 ; N # Lo [38] TULU-TIGALARI LETTER OO..TULU-TIGALARI LETTER LLLA
113B7 ; N # Lo TULU-TIGALARI SIGN AVAGRAHA
113B8..113BA ; N # Mc [3] TULU-TIGALARI VOWEL SIGN AA..TULU-TIGALARI VOWEL SIGN II
113BB..113C0 ; N # Mn [6] TULU-TIGALARI VOWEL SIGN U..TULU-TIGALARI VOWEL SIGN VOCALIC LL
113C2 ; N # Mc TULU-TIGALARI VOWEL SIGN EE
113C5 ; N # Mc TULU-TIGALARI VOWEL SIGN AI
113C7..113CA ; N # Mc [4] TULU-TIGALARI VOWEL SIGN OO..TULU-TIGALARI SIGN CANDRA ANUNASIKA
113CC..113CD ; N # Mc [2] TULU-TIGALARI SIGN ANUSVARA..TULU-TIGALARI SIGN VISARGA
113CE ; N # Mn TULU-TIGALARI SIGN VIRAMA
113CF ; N # Mc TULU-TIGALARI SIGN LOOPED VIRAMA
113D0 ; N # Mn TULU-TIGALARI CONJOINER
113D1 ; N # Lo TULU-TIGALARI REPHA
113D2 ; N # Mn TULU-TIGALARI GEMINATION MARK
113D3 ; N # Lo TULU-TIGALARI SIGN PLUTA
113D4..113D5 ; N # Po [2] TULU-TIGALARI DANDA..TULU-TIGALARI DOUBLE DANDA
113D7..113D8 ; N # Po [2] TULU-TIGALARI SIGN OM PUSHPIKA..TULU-TIGALARI SIGN SHRII PUSHPIKA
113E1..113E2 ; N # Mn [2] TULU-TIGALARI VEDIC TONE SVARITA..TULU-TIGALARI VEDIC TONE ANUDATTA
11400..11434 ; N # Lo [53] NEWA LETTER A..NEWA LETTER HA
11435..11437 ; N # Mc [3] NEWA VOWEL SIGN AA..NEWA VOWEL SIGN II
11438..1143F ; N # Mn [8] NEWA VOWEL SIGN U..NEWA VOWEL SIGN AI
@ -2123,8 +2160,11 @@ FFFD ; A # So REPLACEMENT CHARACTER
116B8 ; N # Lo TAKRI LETTER ARCHAIC KHA
116B9 ; N # Po TAKRI ABBREVIATION SIGN
116C0..116C9 ; N # Nd [10] TAKRI DIGIT ZERO..TAKRI DIGIT NINE
116D0..116E3 ; N # Nd [20] MYANMAR PAO DIGIT ZERO..MYANMAR EASTERN PWO KAREN DIGIT NINE
11700..1171A ; N # Lo [27] AHOM LETTER KA..AHOM LETTER ALTERNATE BA
1171D..1171F ; N # Mn [3] AHOM CONSONANT SIGN MEDIAL LA..AHOM CONSONANT SIGN MEDIAL LIGATING RA
1171D ; N # Mn AHOM CONSONANT SIGN MEDIAL LA
1171E ; N # Mc AHOM CONSONANT SIGN MEDIAL RA
1171F ; N # Mn AHOM CONSONANT SIGN MEDIAL LIGATING RA
11720..11721 ; N # Mc [2] AHOM VOWEL SIGN A..AHOM VOWEL SIGN AA
11722..11725 ; N # Mn [4] AHOM VOWEL SIGN I..AHOM VOWEL SIGN UU
11726 ; N # Mc AHOM VOWEL SIGN E
@ -2195,6 +2235,9 @@ FFFD ; A # So REPLACEMENT CHARACTER
11AB0..11ABF ; N # Lo [16] CANADIAN SYLLABICS NATTILIK HI..CANADIAN SYLLABICS SPA
11AC0..11AF8 ; N # Lo [57] PAU CIN HAU LETTER PA..PAU CIN HAU GLOTTAL STOP FINAL
11B00..11B09 ; N # Po [10] DEVANAGARI HEAD MARK..DEVANAGARI SIGN MINDU
11BC0..11BE0 ; N # Lo [33] SUNUWAR LETTER DEVI..SUNUWAR LETTER KLOKO
11BE1 ; N # Po SUNUWAR SIGN PVO
11BF0..11BF9 ; N # Nd [10] SUNUWAR DIGIT ZERO..SUNUWAR DIGIT NINE
11C00..11C08 ; N # Lo [9] BHAIKSUKI LETTER A..BHAIKSUKI LETTER VOCALIC L
11C0A..11C2E ; N # Lo [37] BHAIKSUKI LETTER E..BHAIKSUKI LETTER HA
11C2F ; N # Mc BHAIKSUKI VOWEL SIGN AA
@ -2253,6 +2296,7 @@ FFFD ; A # So REPLACEMENT CHARACTER
11F42 ; N # Mn KAWI CONJOINER
11F43..11F4F ; N # Po [13] KAWI DANDA..KAWI PUNCTUATION CLOSING SPIRAL
11F50..11F59 ; N # Nd [10] KAWI DIGIT ZERO..KAWI DIGIT NINE
11F5A ; N # Mn KAWI SIGN NUKTA
11FB0 ; N # Lo LISU LETTER YHA
11FC0..11FD4 ; N # No [21] TAMIL FRACTION ONE THREE-HUNDRED-AND-TWENTIETH..TAMIL FRACTION DOWNSCALING FACTOR KIIZH
11FD5..11FDC ; N # So [8] TAMIL SIGN NEL..TAMIL SIGN MUKKURUNI
@ -2270,7 +2314,13 @@ FFFD ; A # So REPLACEMENT CHARACTER
13440 ; N # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
13441..13446 ; N # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN
13447..13455 ; N # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
13460..143FA ; N # Lo [3995] EGYPTIAN HIEROGLYPH-13460..EGYPTIAN HIEROGLYPH-143FA
14400..14646 ; N # Lo [583] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A530
16100..1611D ; N # Lo [30] GURUNG KHEMA LETTER A..GURUNG KHEMA LETTER SA
1611E..16129 ; N # Mn [12] GURUNG KHEMA VOWEL SIGN AA..GURUNG KHEMA VOWEL LENGTH MARK
1612A..1612C ; N # Mc [3] GURUNG KHEMA CONSONANT SIGN MEDIAL YA..GURUNG KHEMA CONSONANT SIGN MEDIAL HA
1612D..1612F ; N # Mn [3] GURUNG KHEMA SIGN ANUSVARA..GURUNG KHEMA SIGN THOLHOMA
16130..16139 ; N # Nd [10] GURUNG KHEMA DIGIT ZERO..GURUNG KHEMA DIGIT NINE
16800..16A38 ; N # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
16A40..16A5E ; N # Lo [31] MRO LETTER TA..MRO LETTER TEK
16A60..16A69 ; N # Nd [10] MRO DIGIT ZERO..MRO DIGIT NINE
@ -2291,6 +2341,11 @@ FFFD ; A # So REPLACEMENT CHARACTER
16B5B..16B61 ; N # No [7] PAHAWH HMONG NUMBER TENS..PAHAWH HMONG NUMBER TRILLIONS
16B63..16B77 ; N # Lo [21] PAHAWH HMONG SIGN VOS LUB..PAHAWH HMONG SIGN CIM NRES TOS
16B7D..16B8F ; N # Lo [19] PAHAWH HMONG CLAN SIGN TSHEEJ..PAHAWH HMONG CLAN SIGN VWJ
16D40..16D42 ; N # Lm [3] KIRAT RAI SIGN ANUSVARA..KIRAT RAI SIGN VISARGA
16D43..16D6A ; N # Lo [40] KIRAT RAI LETTER A..KIRAT RAI VOWEL SIGN AU
16D6B..16D6C ; N # Lm [2] KIRAT RAI SIGN VIRAMA..KIRAT RAI SIGN SAAT
16D6D..16D6F ; N # Po [3] KIRAT RAI SIGN YUPI..KIRAT RAI DOUBLE DANDA
16D70..16D79 ; N # Nd [10] KIRAT RAI DIGIT ZERO..KIRAT RAI DIGIT NINE
16E40..16E7F ; N # L& [64] MEDEFAIDRIN CAPITAL LETTER M..MEDEFAIDRIN SMALL LETTER Y
16E80..16E96 ; N # No [23] MEDEFAIDRIN DIGIT ZERO..MEDEFAIDRIN DIGIT THREE ALTERNATE FORM
16E97..16E9A ; N # Po [4] MEDEFAIDRIN COMMA..MEDEFAIDRIN EXCLAMATION OH
@ -2308,6 +2363,7 @@ FFFD ; A # So REPLACEMENT CHARACTER
17000..187F7 ; W # Lo [6136] TANGUT IDEOGRAPH-17000..TANGUT IDEOGRAPH-187F7
18800..18AFF ; W # Lo [768] TANGUT COMPONENT-001..TANGUT COMPONENT-768
18B00..18CD5 ; W # Lo [470] KHITAN SMALL SCRIPT CHARACTER-18B00..KHITAN SMALL SCRIPT CHARACTER-18CD5
18CFF ; W # Lo KHITAN SMALL SCRIPT CHARACTER-18CFF
18D00..18D08 ; W # Lo [9] TANGUT IDEOGRAPH-18D00..TANGUT IDEOGRAPH-18D08
1AFF0..1AFF3 ; W # Lm [4] KATAKANA LETTER MINNAN TONE-2..KATAKANA LETTER MINNAN TONE-5
1AFF5..1AFFB ; W # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
@ -2327,6 +2383,9 @@ FFFD ; A # So REPLACEMENT CHARACTER
1BC9D..1BC9E ; N # Mn [2] DUPLOYAN THICK LETTER SELECTOR..DUPLOYAN DOUBLE MARK
1BC9F ; N # Po DUPLOYAN PUNCTUATION CHINOOK FULL STOP
1BCA0..1BCA3 ; N # Cf [4] SHORTHAND FORMAT LETTER OVERLAP..SHORTHAND FORMAT UP STEP
1CC00..1CCEF ; N # So [240] UP-POINTING GO-KART..OUTLINED LATIN CAPITAL LETTER Z
1CCF0..1CCF9 ; N # Nd [10] OUTLINED DIGIT ZERO..OUTLINED DIGIT NINE
1CD00..1CEB3 ; N # So [436] BLOCK OCTANT-3..BLACK RIGHT TRIANGLE CARET
1CF00..1CF2D ; N # Mn [46] ZNAMENNY COMBINING MARK GORAZDO NIZKO S KRYZHEM ON LEFT..ZNAMENNY COMBINING MARK KRYZH ON LEFT
1CF30..1CF46 ; N # Mn [23] ZNAMENNY COMBINING TONAL RANGE MARK MRACHNO..ZNAMENNY PRIZNAK MODIFIER ROG
1CF50..1CFC3 ; N # So [116] ZNAMENNY NEUME KRYUK..ZNAMENNY NEUME PAUK
@ -2349,8 +2408,9 @@ FFFD ; A # So REPLACEMENT CHARACTER
1D245 ; N # So GREEK MUSICAL LEIMMA
1D2C0..1D2D3 ; N # No [20] KAKTOVIK NUMERAL ZERO..KAKTOVIK NUMERAL NINETEEN
1D2E0..1D2F3 ; N # No [20] MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN
1D300..1D356 ; N # So [87] MONOGRAM FOR EARTH..TETRAGRAM FOR FOSTERING
1D360..1D378 ; N # No [25] COUNTING ROD UNIT DIGIT ONE..TALLY MARK FIVE
1D300..1D356 ; W # So [87] MONOGRAM FOR EARTH..TETRAGRAM FOR FOSTERING
1D360..1D376 ; W # No [23] COUNTING ROD UNIT DIGIT ONE..IDEOGRAPHIC TALLY MARK FIVE
1D377..1D378 ; N # No [2] TALLY MARK ONE..TALLY MARK FIVE
1D400..1D454 ; N # L& [85] MATHEMATICAL BOLD CAPITAL A..MATHEMATICAL ITALIC SMALL G
1D456..1D49C ; N # L& [71] MATHEMATICAL ITALIC SMALL I..MATHEMATICAL SCRIPT CAPITAL A
1D49E..1D49F ; N # Lu [2] MATHEMATICAL SCRIPT CAPITAL C..MATHEMATICAL SCRIPT CAPITAL D
@ -2431,6 +2491,11 @@ FFFD ; A # So REPLACEMENT CHARACTER
1E4EB ; N # Lm NAG MUNDARI SIGN OJOD
1E4EC..1E4EF ; N # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH
1E4F0..1E4F9 ; N # Nd [10] NAG MUNDARI DIGIT ZERO..NAG MUNDARI DIGIT NINE
1E5D0..1E5ED ; N # Lo [30] OL ONAL LETTER O..OL ONAL LETTER EG
1E5EE..1E5EF ; N # Mn [2] OL ONAL SIGN MU..OL ONAL SIGN IKIR
1E5F0 ; N # Lo OL ONAL SIGN HODDOND
1E5F1..1E5FA ; N # Nd [10] OL ONAL DIGIT ZERO..OL ONAL DIGIT NINE
1E5FF ; N # Po OL ONAL ABBREVIATION SIGN
1E7E0..1E7E6 ; N # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO
1E7E8..1E7EB ; N # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE
1E7ED..1E7EE ; N # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE
@ -2574,7 +2639,8 @@ FFFD ; A # So REPLACEMENT CHARACTER
1F850..1F859 ; N # So [10] LEFTWARDS SANS-SERIF ARROW..UP DOWN SANS-SERIF ARROW
1F860..1F887 ; N # So [40] WIDE-HEADED LEFTWARDS LIGHT BARB ARROW..WIDE-HEADED SOUTH WEST VERY HEAVY BARB ARROW
1F890..1F8AD ; N # So [30] LEFTWARDS TRIANGLE ARROWHEAD..WHITE ARROW SHAFT WIDTH TWO THIRDS
1F8B0..1F8B1 ; N # So [2] ARROW POINTING UPWARDS THEN NORTH WEST..ARROW POINTING RIGHTWARDS THEN CURVING SOUTH WEST
1F8B0..1F8BB ; N # So [12] ARROW POINTING UPWARDS THEN NORTH WEST..SOUTH WEST ARROW FROM BAR
1F8C0..1F8C1 ; N # So [2] LEFTWARDS ARROW FROM DOWNWARDS ARROW..RIGHTWARDS ARROW FROM DOWNWARDS ARROW
1F900..1F90B ; N # So [12] CIRCLED CROSS FORMEE WITH FOUR DOTS..DOWNWARD FACING NOTCHED HOOK WITH DOT
1F90C..1F93A ; W # So [47] PINCHED FINGERS..FENCER
1F93B ; N # So MODERN PENTATHLON
@ -2584,14 +2650,13 @@ FFFD ; A # So REPLACEMENT CHARACTER
1FA00..1FA53 ; N # So [84] NEUTRAL CHESS KING..BLACK CHESS KNIGHT-BISHOP
1FA60..1FA6D ; N # So [14] XIANGQI RED GENERAL..XIANGQI BLACK SOLDIER
1FA70..1FA7C ; W # So [13] BALLET SHOES..CRUTCH
1FA80..1FA88 ; W # So [9] YO-YO..FLUTE
1FA90..1FABD ; W # So [46] RINGED PLANET..WING
1FABF..1FAC5 ; W # So [7] GOOSE..PERSON WITH CROWN
1FACE..1FADB ; W # So [14] MOOSE..PEA POD
1FAE0..1FAE8 ; W # So [9] MELTING FACE..SHAKING FACE
1FA80..1FA89 ; W # So [10] YO-YO..HARP
1FA8F..1FAC6 ; W # So [56] SHOVEL..FINGERPRINT
1FACE..1FADC ; W # So [15] MOOSE..ROOT VEGETABLE
1FADF..1FAE9 ; W # So [11] SPLATTER..FACE WITH BAGS UNDER EYES
1FAF0..1FAF8 ; W # So [9] HAND WITH INDEX FINGER AND THUMB CROSSED..RIGHTWARDS PUSHING HAND
1FB00..1FB92 ; N # So [147] BLOCK SEXTANT-1..UPPER HALF INVERSE MEDIUM SHADE AND LOWER HALF BLOCK
1FB94..1FBCA ; N # So [55] LEFT HALF INVERSE MEDIUM SHADE AND RIGHT HALF BLOCK..WHITE UP-POINTING CHEVRON
1FB94..1FBEF ; N # So [92] LEFT HALF INVERSE MEDIUM SHADE AND RIGHT HALF BLOCK..TOP LEFT JUSTIFIED LOWER RIGHT QUARTER BLACK CIRCLE
1FBF0..1FBF9 ; N # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
20000..2A6DF ; W # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
2A6E0..2A6FF ; W # Cn [32] <reserved-2A6E0>..<reserved-2A6FF>

View File

@ -1,8 +1,8 @@
# HangulSyllableType-15.1.0.txt
# Date: 2023-01-05, 20:34:42 GMT
# © 2023 Unicode®, Inc.
# HangulSyllableType-16.0.0.txt
# Date: 2024-04-30, 21:48:21 GMT
# © 2024 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use, see https://www.unicode.org/terms_of_use.html
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
#
# Unicode Character Database
# For documentation, see https://www.unicode.org/reports/tr44/

View File

@ -36,7 +36,7 @@
# files for making modifications.
UNICODE_VERSION = 15.1.0
UNICODE_VERSION = 16.0.0
PYTHON3 = python3
WGET = wget

File diff suppressed because it is too large Load Diff