mirror of
https://sourceware.org/git/glibc.git
synced 2024-11-21 12:30:06 +00:00
Bug 23308: Update to Unicode 11.0.0
Unicode 11.0.0 Support: Character encoding, character type info, and transliteration tables are all updated to Unicode 11.0.0, using the generator scripts contributed by Mike FABIAN (Red Hat). Some info about the number of characters added: Total added characters in newly generated CHARMAP: 684 Total added characters in newly generated WIDTH: 119 alpha: Added 380 characters in new ctype which were not in old ctype combining: Added 56 characters in new ctype which were not in old ctype combining_level3: Added 37 characters in new ctype which were not in old ctype graph: Added 684 characters in new ctype which were not in old ctype lower: Added 82 characters in new ctype which were not in old ctype print: Added 684 characters in new ctype which were not in old ctype punct: Added 304 characters in new ctype which were not in old ctype tolower: Added 79 characters in new ctype which were not in old ctype totitle: Added 33 characters in new ctype which were not in old ctype toupper: Added 79 characters in new ctype which were not in old ctype upper: Added 79 characters in new ctype which were not in old ctype No characters were removed. [BZ #23308] * unicode-gen/Makefile (UNICODE_VERSION): Set to 11.0.0. * localedata/unicode-gen/DerivedCoreProperties.txt: Update to Unicode 11.0.0. * localedata/unicode-gen/EastAsianWidth.txt: likewise. * localedata/unicode-gen/PropList.txt: likewise. * localedata/unicode-gen/UnicodeData.txt: likewise. * localedata/charmaps/UTF-8: Regenerate. * localedata/locales/i18n_ctype: likewise. * localedata/locales/tr_TR: likewise. * localedata/locales/translit_circle: likewise. * localedata/locales/translit_cjk_compat: likewise. * localedata/locales/translit_combining: likewise. * localedata/locales/translit_compat: likewise. * localedata/locales/translit_font: likewise. * localedata/locales/translit_fraction: likewise.
This commit is contained in:
parent
5a35750665
commit
b11643c21c
18
ChangeLog
18
ChangeLog
@ -1,3 +1,21 @@
|
||||
2018-06-26 Mike FABIAN <mfabian@redhat.com>
|
||||
|
||||
[BZ #23308]
|
||||
* unicode-gen/Makefile (UNICODE_VERSION): Set to 11.0.0.
|
||||
* localedata/unicode-gen/DerivedCoreProperties.txt: Update to Unicode 11.0.0.
|
||||
* localedata/unicode-gen/EastAsianWidth.txt: likewise.
|
||||
* localedata/unicode-gen/PropList.txt: likewise.
|
||||
* localedata/unicode-gen/UnicodeData.txt: likewise.
|
||||
* localedata/charmaps/UTF-8: Regenerate.
|
||||
* localedata/locales/i18n_ctype: likewise.
|
||||
* localedata/locales/tr_TR: likewise.
|
||||
* localedata/locales/translit_circle: likewise.
|
||||
* localedata/locales/translit_cjk_compat: likewise.
|
||||
* localedata/locales/translit_combining: likewise.
|
||||
* localedata/locales/translit_compat: likewise.
|
||||
* localedata/locales/translit_font: likewise.
|
||||
* localedata/locales/translit_fraction: likewise.
|
||||
|
||||
2018-07-03 Florian Weimer <fweimer@redhat.com>
|
||||
|
||||
[BZ #23363]
|
||||
|
4
NEWS
4
NEWS
@ -9,6 +9,10 @@ Version 2.28
|
||||
|
||||
Major new features:
|
||||
|
||||
* Unicode 11.0.0 Support: Character encoding, character type info, and
|
||||
transliteration tables are all updated to Unicode 11.0.0, using
|
||||
generator scripts contributed by Mike FABIAN (Red Hat).
|
||||
|
||||
* <math.h> functions that round their results to a narrower type are added
|
||||
from TS 18661-1:2014 and TS 18661-3:2015:
|
||||
|
||||
|
File diff suppressed because it is too large
Load Diff
File diff suppressed because it is too large
Load Diff
File diff suppressed because it is too large
Load Diff
@ -9,7 +9,7 @@ comment_char %
|
||||
% otherwise be governed by that license.
|
||||
|
||||
% Transliterations of encircled characters.
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_circle.py on 2017-10-23 for Unicode 10.0.0.
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_circle.py on 2018-06-20 for Unicode 11.0.0.
|
||||
|
||||
LC_CTYPE
|
||||
|
||||
|
@ -9,7 +9,7 @@ comment_char %
|
||||
% otherwise be governed by that license.
|
||||
|
||||
% Transliterations of CJK compatibility characters.
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_cjk_compat.py on 2017-10-23 for Unicode 10.0.0.
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_cjk_compat.py on 2018-06-20 for Unicode 11.0.0.
|
||||
|
||||
LC_CTYPE
|
||||
|
||||
|
@ -10,7 +10,7 @@ comment_char %
|
||||
|
||||
% Transliterations that remove all combining characters (accents,
|
||||
% pronounciation marks, etc.).
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_combining.py on 2017-10-23 for Unicode 10.0.0.
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_combining.py on 2018-06-20 for Unicode 11.0.0.
|
||||
|
||||
LC_CTYPE
|
||||
|
||||
@ -446,6 +446,8 @@ translit_start
|
||||
<U06EC> ""
|
||||
% ARABIC SMALL LOW MEEM
|
||||
<U06ED> ""
|
||||
% ARABIC SMALL LOW WAW
|
||||
<U08D3> ""
|
||||
% ARABIC SMALL HIGH WORD AR-RUB
|
||||
<U08D4> ""
|
||||
% ARABIC SMALL HIGH SAD
|
||||
@ -800,6 +802,38 @@ translit_start
|
||||
<U00010379> ""
|
||||
% COMBINING OLD PERMIC LETTER SII
|
||||
<U0001037A> ""
|
||||
% HANIFI ROHINGYA SIGN HARBAHAY
|
||||
<U00010D24> ""
|
||||
% HANIFI ROHINGYA SIGN TAHALA
|
||||
<U00010D25> ""
|
||||
% HANIFI ROHINGYA SIGN TANA
|
||||
<U00010D26> ""
|
||||
% HANIFI ROHINGYA SIGN TASSI
|
||||
<U00010D27> ""
|
||||
% SOGDIAN COMBINING DOT BELOW
|
||||
<U00010F46> ""
|
||||
% SOGDIAN COMBINING TWO DOTS BELOW
|
||||
<U00010F47> ""
|
||||
% SOGDIAN COMBINING DOT ABOVE
|
||||
<U00010F48> ""
|
||||
% SOGDIAN COMBINING TWO DOTS ABOVE
|
||||
<U00010F49> ""
|
||||
% SOGDIAN COMBINING CURVE ABOVE
|
||||
<U00010F4A> ""
|
||||
% SOGDIAN COMBINING CURVE BELOW
|
||||
<U00010F4B> ""
|
||||
% SOGDIAN COMBINING HOOK ABOVE
|
||||
<U00010F4C> ""
|
||||
% SOGDIAN COMBINING HOOK BELOW
|
||||
<U00010F4D> ""
|
||||
% SOGDIAN COMBINING LONG HOOK BELOW
|
||||
<U00010F4E> ""
|
||||
% SOGDIAN COMBINING RESH BELOW
|
||||
<U00010F4F> ""
|
||||
% SOGDIAN COMBINING STROKE BELOW
|
||||
<U00010F50> ""
|
||||
% COMBINING BINDU BELOW
|
||||
<U0001133B> ""
|
||||
% NEWA VOWEL SIGN AA
|
||||
<U00011435> ""
|
||||
% NEWA VOWEL SIGN I
|
||||
@ -836,6 +870,38 @@ translit_start
|
||||
<U00011445> ""
|
||||
% NEWA SIGN NUKTA
|
||||
<U00011446> ""
|
||||
% NEWA SANDHI MARK
|
||||
<U0001145E> ""
|
||||
% DOGRA VOWEL SIGN AA
|
||||
<U0001182C> ""
|
||||
% DOGRA VOWEL SIGN I
|
||||
<U0001182D> ""
|
||||
% DOGRA VOWEL SIGN II
|
||||
<U0001182E> ""
|
||||
% DOGRA VOWEL SIGN U
|
||||
<U0001182F> ""
|
||||
% DOGRA VOWEL SIGN UU
|
||||
<U00011830> ""
|
||||
% DOGRA VOWEL SIGN VOCALIC R
|
||||
<U00011831> ""
|
||||
% DOGRA VOWEL SIGN VOCALIC RR
|
||||
<U00011832> ""
|
||||
% DOGRA VOWEL SIGN E
|
||||
<U00011833> ""
|
||||
% DOGRA VOWEL SIGN AI
|
||||
<U00011834> ""
|
||||
% DOGRA VOWEL SIGN O
|
||||
<U00011835> ""
|
||||
% DOGRA VOWEL SIGN AU
|
||||
<U00011836> ""
|
||||
% DOGRA SIGN ANUSVARA
|
||||
<U00011837> ""
|
||||
% DOGRA SIGN VISARGA
|
||||
<U00011838> ""
|
||||
% DOGRA SIGN VIRAMA
|
||||
<U00011839> ""
|
||||
% DOGRA SIGN NUKTA
|
||||
<U0001183A> ""
|
||||
% ZANABAZAR SQUARE VOWEL SIGN I
|
||||
<U00011A01> ""
|
||||
% ZANABAZAR SQUARE VOWEL SIGN UE
|
||||
@ -1072,6 +1138,38 @@ translit_start
|
||||
<U00011D45> ""
|
||||
% MASARAM GONDI RA-KARA
|
||||
<U00011D47> ""
|
||||
% GUNJALA GONDI VOWEL SIGN AA
|
||||
<U00011D8A> ""
|
||||
% GUNJALA GONDI VOWEL SIGN I
|
||||
<U00011D8B> ""
|
||||
% GUNJALA GONDI VOWEL SIGN II
|
||||
<U00011D8C> ""
|
||||
% GUNJALA GONDI VOWEL SIGN U
|
||||
<U00011D8D> ""
|
||||
% GUNJALA GONDI VOWEL SIGN UU
|
||||
<U00011D8E> ""
|
||||
% GUNJALA GONDI VOWEL SIGN EE
|
||||
<U00011D90> ""
|
||||
% GUNJALA GONDI VOWEL SIGN AI
|
||||
<U00011D91> ""
|
||||
% GUNJALA GONDI VOWEL SIGN OO
|
||||
<U00011D93> ""
|
||||
% GUNJALA GONDI VOWEL SIGN AU
|
||||
<U00011D94> ""
|
||||
% GUNJALA GONDI SIGN ANUSVARA
|
||||
<U00011D95> ""
|
||||
% GUNJALA GONDI SIGN VISARGA
|
||||
<U00011D96> ""
|
||||
% GUNJALA GONDI VIRAMA
|
||||
<U00011D97> ""
|
||||
% MAKASAR VOWEL SIGN I
|
||||
<U00011EF3> ""
|
||||
% MAKASAR VOWEL SIGN U
|
||||
<U00011EF4> ""
|
||||
% MAKASAR VOWEL SIGN E
|
||||
<U00011EF5> ""
|
||||
% MAKASAR VOWEL SIGN O
|
||||
<U00011EF6> ""
|
||||
% COMBINING GREEK MUSICAL TRISEME
|
||||
<U0001D242> ""
|
||||
% COMBINING GREEK MUSICAL TETRASEME
|
||||
|
@ -9,7 +9,7 @@ comment_char %
|
||||
% otherwise be governed by that license.
|
||||
|
||||
% Transliterations of compatibility characters and ligatures.
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_compat.py on 2017-10-23 for Unicode 10.0.0.
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_compat.py on 2018-06-20 for Unicode 11.0.0.
|
||||
|
||||
LC_CTYPE
|
||||
|
||||
|
@ -9,7 +9,7 @@ comment_char %
|
||||
% otherwise be governed by that license.
|
||||
|
||||
% Transliterations of font equivalents.
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_font.py on 2017-10-23 for Unicode 10.0.0.
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_font.py on 2018-06-20 for Unicode 11.0.0.
|
||||
|
||||
LC_CTYPE
|
||||
|
||||
|
@ -9,7 +9,7 @@ comment_char %
|
||||
% otherwise be governed by that license.
|
||||
|
||||
% Transliterations of fractions.
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_fraction.py on 2017-10-23 for Unicode 10.0.0.
|
||||
% Generated automatically from UnicodeData.txt by gen_translit_fraction.py on 2018-06-20 for Unicode 11.0.0.
|
||||
% The replacements have been surrounded with spaces, because fractions are
|
||||
% often preceded by a decimal number and followed by a unit or a math symbol.
|
||||
|
||||
|
File diff suppressed because it is too large
Load Diff
@ -1,6 +1,6 @@
|
||||
# EastAsianWidth-10.0.0.txt
|
||||
# Date: 2017-03-08, 02:00:00 GMT [KW, LI]
|
||||
# © 2017 Unicode®, Inc.
|
||||
# EastAsianWidth-11.0.0.txt
|
||||
# Date: 2018-05-14, 09:41:59 GMT [KW, LI]
|
||||
# © 2018 Unicode®, Inc.
|
||||
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
|
||||
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
||||
#
|
||||
@ -123,7 +123,7 @@
|
||||
00FC;A # Ll LATIN SMALL LETTER U WITH DIAERESIS
|
||||
00FD;N # Ll LATIN SMALL LETTER Y WITH ACUTE
|
||||
00FE;A # Ll LATIN SMALL LETTER THORN
|
||||
00FF;N # L& LATIN SMALL LETTER Y WITH DIAERESIS
|
||||
00FF;N # Ll LATIN SMALL LETTER Y WITH DIAERESIS
|
||||
0100;N # Lu LATIN CAPITAL LETTER A WITH MACRON
|
||||
0101;A # Ll LATIN SMALL LETTER A WITH MACRON
|
||||
0102..0110;N # L& [15] LATIN CAPITAL LETTER A WITH BREVE..LATIN CAPITAL LETTER D WITH STROKE
|
||||
@ -247,7 +247,7 @@
|
||||
0531..0556;N # Lu [38] ARMENIAN CAPITAL LETTER AYB..ARMENIAN CAPITAL LETTER FEH
|
||||
0559;N # Lm ARMENIAN MODIFIER LETTER LEFT HALF RING
|
||||
055A..055F;N # Po [6] ARMENIAN APOSTROPHE..ARMENIAN ABBREVIATION MARK
|
||||
0561..0587;N # Ll [39] ARMENIAN SMALL LETTER AYB..ARMENIAN SMALL LIGATURE ECH YIWN
|
||||
0560..0588;N # Ll [41] ARMENIAN SMALL LETTER TURNED AYB..ARMENIAN SMALL LETTER YI WITH STROKE
|
||||
0589;N # Po ARMENIAN FULL STOP
|
||||
058A;N # Pd ARMENIAN HYPHEN
|
||||
058D..058E;N # So [2] RIGHT-FACING ARMENIAN ETERNITY SIGN..LEFT-FACING ARMENIAN ETERNITY SIGN
|
||||
@ -262,7 +262,7 @@
|
||||
05C6;N # Po HEBREW PUNCTUATION NUN HAFUKHA
|
||||
05C7;N # Mn HEBREW POINT QAMATS QATAN
|
||||
05D0..05EA;N # Lo [27] HEBREW LETTER ALEF..HEBREW LETTER TAV
|
||||
05F0..05F2;N # Lo [3] HEBREW LIGATURE YIDDISH DOUBLE VAV..HEBREW LIGATURE YIDDISH DOUBLE YOD
|
||||
05EF..05F2;N # Lo [4] HEBREW YOD TRIANGLE..HEBREW LIGATURE YIDDISH DOUBLE YOD
|
||||
05F3..05F4;N # Po [2] HEBREW PUNCTUATION GERESH..HEBREW PUNCTUATION GERSHAYIM
|
||||
0600..0605;N # Cf [6] ARABIC NUMBER SIGN..ARABIC NUMBER MARK ABOVE
|
||||
0606..0608;N # Sm [3] ARABIC-INDIC CUBE ROOT..ARABIC RAY
|
||||
@ -316,6 +316,8 @@
|
||||
07F6;N # So NKO SYMBOL OO DENNEN
|
||||
07F7..07F9;N # Po [3] NKO SYMBOL GBAKURUNEN..NKO EXCLAMATION MARK
|
||||
07FA;N # Lm NKO LAJANYALAN
|
||||
07FD;N # Mn NKO DANTAYALAN
|
||||
07FE..07FF;N # Sc [2] NKO DOROME SIGN..NKO TAMAN SIGN
|
||||
0800..0815;N # Lo [22] SAMARITAN LETTER ALAF..SAMARITAN LETTER TAAF
|
||||
0816..0819;N # Mn [4] SAMARITAN MARK IN..SAMARITAN MARK DAGESH
|
||||
081A;N # Lm SAMARITAN MODIFIER LETTER EPENTHETIC YUT
|
||||
@ -331,7 +333,7 @@
|
||||
0860..086A;N # Lo [11] SYRIAC LETTER MALAYALAM NGA..SYRIAC LETTER MALAYALAM SSA
|
||||
08A0..08B4;N # Lo [21] ARABIC LETTER BEH WITH SMALL V BELOW..ARABIC LETTER KAF WITH DOT BELOW
|
||||
08B6..08BD;N # Lo [8] ARABIC LETTER BEH WITH SMALL MEEM ABOVE..ARABIC LETTER AFRICAN NOON
|
||||
08D4..08E1;N # Mn [14] ARABIC SMALL HIGH WORD AR-RUB..ARABIC SMALL HIGH SIGN SAFHA
|
||||
08D3..08E1;N # Mn [15] ARABIC SMALL LOW WAW..ARABIC SMALL HIGH SIGN SAFHA
|
||||
08E2;N # Cf ARABIC DISPUTED END OF AYAH
|
||||
08E3..08FF;N # Mn [29] ARABIC TURNED DAMMA BELOW..ARABIC MARK SIDEWAYS NOON GHUNNA
|
||||
0900..0902;N # Mn [3] DEVANAGARI SIGN INVERTED CANDRABINDU..DEVANAGARI SIGN ANUSVARA
|
||||
@ -384,6 +386,7 @@
|
||||
09FB;N # Sc BENGALI GANDA MARK
|
||||
09FC;N # Lo BENGALI LETTER VEDIC ANUSVARA
|
||||
09FD;N # Po BENGALI ABBREVIATION SIGN
|
||||
09FE;N # Mn BENGALI SANDHI MARK
|
||||
0A01..0A02;N # Mn [2] GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN BINDI
|
||||
0A03;N # Mc GURMUKHI SIGN VISARGA
|
||||
0A05..0A0A;N # Lo [6] GURMUKHI LETTER A..GURMUKHI LETTER UU
|
||||
@ -405,6 +408,7 @@
|
||||
0A70..0A71;N # Mn [2] GURMUKHI TIPPI..GURMUKHI ADDAK
|
||||
0A72..0A74;N # Lo [3] GURMUKHI IRI..GURMUKHI EK ONKAR
|
||||
0A75;N # Mn GURMUKHI SIGN YAKASH
|
||||
0A76;N # Po GURMUKHI ABBREVIATION SIGN
|
||||
0A81..0A82;N # Mn [2] GUJARATI SIGN CANDRABINDU..GUJARATI SIGN ANUSVARA
|
||||
0A83;N # Mc GUJARATI SIGN VISARGA
|
||||
0A85..0A8D;N # Lo [9] GUJARATI LETTER A..GUJARATI VOWEL CANDRA E
|
||||
@ -481,6 +485,7 @@
|
||||
0BFA;N # So TAMIL NUMBER SIGN
|
||||
0C00;N # Mn TELUGU SIGN COMBINING CANDRABINDU ABOVE
|
||||
0C01..0C03;N # Mc [3] TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA
|
||||
0C04;N # Mn TELUGU SIGN COMBINING ANUSVARA ABOVE
|
||||
0C05..0C0C;N # Lo [8] TELUGU LETTER A..TELUGU LETTER VOCALIC L
|
||||
0C0E..0C10;N # Lo [3] TELUGU LETTER E..TELUGU LETTER AI
|
||||
0C12..0C28;N # Lo [23] TELUGU LETTER O..TELUGU LETTER NA
|
||||
@ -500,6 +505,7 @@
|
||||
0C80;N # Lo KANNADA SIGN SPACING CANDRABINDU
|
||||
0C81;N # Mn KANNADA SIGN CANDRABINDU
|
||||
0C82..0C83;N # Mc [2] KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA
|
||||
0C84;N # Po KANNADA SIGN SIDDHAM
|
||||
0C85..0C8C;N # Lo [8] KANNADA LETTER A..KANNADA LETTER VOCALIC L
|
||||
0C8E..0C90;N # Lo [3] KANNADA LETTER E..KANNADA LETTER AI
|
||||
0C92..0CA8;N # Lo [23] KANNADA LETTER O..KANNADA LETTER NA
|
||||
@ -666,10 +672,10 @@
|
||||
10A0..10C5;N # Lu [38] GEORGIAN CAPITAL LETTER AN..GEORGIAN CAPITAL LETTER HOE
|
||||
10C7;N # Lu GEORGIAN CAPITAL LETTER YN
|
||||
10CD;N # Lu GEORGIAN CAPITAL LETTER AEN
|
||||
10D0..10FA;N # Lo [43] GEORGIAN LETTER AN..GEORGIAN LETTER AIN
|
||||
10D0..10FA;N # Ll [43] GEORGIAN LETTER AN..GEORGIAN LETTER AIN
|
||||
10FB;N # Po GEORGIAN PARAGRAPH SEPARATOR
|
||||
10FC;N # Lm MODIFIER LETTER GEORGIAN NAR
|
||||
10FD..10FF;N # Lo [3] GEORGIAN LETTER AEN..GEORGIAN LETTER LABIAL SIGN
|
||||
10FD..10FF;N # Ll [3] GEORGIAN LETTER AEN..GEORGIAN LETTER LABIAL SIGN
|
||||
1100..115F;W # Lo [96] HANGUL CHOSEONG KIYEOK..HANGUL CHOSEONG FILLER
|
||||
1160..11FF;N # Lo [160] HANGUL JUNGSEONG FILLER..HANGUL JONGSEONG SSANGNIEUN
|
||||
1200..1248;N # Lo [73] ETHIOPIC SYLLABLE HA..ETHIOPIC SYLLABLE QWA
|
||||
@ -742,7 +748,7 @@
|
||||
1810..1819;N # Nd [10] MONGOLIAN DIGIT ZERO..MONGOLIAN DIGIT NINE
|
||||
1820..1842;N # Lo [35] MONGOLIAN LETTER A..MONGOLIAN LETTER CHI
|
||||
1843;N # Lm MONGOLIAN LETTER TODO LONG VOWEL SIGN
|
||||
1844..1877;N # Lo [52] MONGOLIAN LETTER TODO E..MONGOLIAN LETTER MANCHU ZHA
|
||||
1844..1878;N # Lo [53] MONGOLIAN LETTER TODO E..MONGOLIAN LETTER CHA WITH TWO DOTS
|
||||
1880..1884;N # Lo [5] MONGOLIAN LETTER ALI GALI ANUSVARA ONE..MONGOLIAN LETTER ALI GALI INVERTED UBADAMA
|
||||
1885..1886;N # Mn [2] MONGOLIAN LETTER ALI GALI BALUDA..MONGOLIAN LETTER ALI GALI THREE BALUDA
|
||||
1887..18A8;N # Lo [34] MONGOLIAN LETTER ALI GALI A..MONGOLIAN LETTER MANCHU ALI GALI BHA
|
||||
@ -846,6 +852,8 @@
|
||||
1C78..1C7D;N # Lm [6] OL CHIKI MU TTUDDAG..OL CHIKI AHAD
|
||||
1C7E..1C7F;N # Po [2] OL CHIKI PUNCTUATION MUCAAD..OL CHIKI PUNCTUATION DOUBLE MUCAAD
|
||||
1C80..1C88;N # Ll [9] CYRILLIC SMALL LETTER ROUNDED VE..CYRILLIC SMALL LETTER UNBLENDED UK
|
||||
1C90..1CBA;N # Lu [43] GEORGIAN MTAVRULI CAPITAL LETTER AN..GEORGIAN MTAVRULI CAPITAL LETTER AIN
|
||||
1CBD..1CBF;N # Lu [3] GEORGIAN MTAVRULI CAPITAL LETTER AEN..GEORGIAN MTAVRULI CAPITAL LETTER LABIAL SIGN
|
||||
1CC0..1CC7;N # Po [8] SUNDANESE PUNCTUATION BINDU SURYA..SUNDANESE PUNCTUATION BINDU BA SATANGA
|
||||
1CD0..1CD2;N # Mn [3] VEDIC TONE KARSHANA..VEDIC TONE PRENKHA
|
||||
1CD3;N # Po VEDIC SIGN NIHSHVASA
|
||||
@ -1332,10 +1340,8 @@
|
||||
2B56..2B59;A # So [4] HEAVY OVAL WITH OVAL INSIDE..HEAVY CIRCLED SALTIRE
|
||||
2B5A..2B73;N # So [26] SLANTED NORTH ARROW WITH HOOKED HEAD..DOWNWARDS TRIANGLE-HEADED ARROW TO BAR
|
||||
2B76..2B95;N # So [32] NORTH WEST TRIANGLE-HEADED ARROW TO BAR..RIGHTWARDS BLACK ARROW
|
||||
2B98..2BB9;N # So [34] THREE-D TOP-LIGHTED LEFTWARDS EQUILATERAL ARROWHEAD..UP ARROWHEAD IN A RECTANGLE BOX
|
||||
2BBD..2BC8;N # So [12] BALLOT BOX WITH LIGHT X..BLACK MEDIUM RIGHT-POINTING TRIANGLE CENTRED
|
||||
2BCA..2BD2;N # So [9] TOP HALF BLACK CIRCLE..GROUP MARK
|
||||
2BEC..2BEF;N # So [4] LEFTWARDS TWO-HEADED ARROW WITH TRIANGLE ARROWHEADS..DOWNWARDS TWO-HEADED ARROW WITH TRIANGLE ARROWHEADS
|
||||
2B98..2BC8;N # So [49] THREE-D TOP-LIGHTED LEFTWARDS EQUILATERAL ARROWHEAD..BLACK MEDIUM RIGHT-POINTING TRIANGLE CENTRED
|
||||
2BCA..2BFE;N # So [53] TOP HALF BLACK CIRCLE..REVERSED RIGHT ANGLE
|
||||
2C00..2C2E;N # Lu [47] GLAGOLITIC CAPITAL LETTER AZU..GLAGOLITIC CAPITAL LETTER LATINATE MYSLITE
|
||||
2C30..2C5E;N # Ll [47] GLAGOLITIC SMALL LETTER AZU..GLAGOLITIC SMALL LETTER LATINATE MYSLITE
|
||||
2C60..2C7B;N # L& [28] LATIN CAPITAL LETTER L WITH DOUBLE BAR..LATIN LETTER SMALL CAPITAL TURNED E
|
||||
@ -1403,7 +1409,7 @@
|
||||
2E40;N # Pd DOUBLE HYPHEN
|
||||
2E41;N # Po REVERSED COMMA
|
||||
2E42;N # Ps DOUBLE LOW-REVERSED-9 QUOTATION MARK
|
||||
2E43..2E49;N # Po [7] DASH WITH LEFT UPTURN..DOUBLE STACKED COMMA
|
||||
2E43..2E4E;N # Po [12] DASH WITH LEFT UPTURN..PUNCTUS ELEVATUS MARK
|
||||
2E80..2E99;W # So [26] CJK RADICAL REPEAT..CJK RADICAL RAP
|
||||
2E9B..2EF3;W # So [89] CJK RADICAL CHOKE..CJK RADICAL C-SIMPLIFIED TURTLE
|
||||
2F00..2FD5;W # So [214] KANGXI RADICAL ONE..KANGXI RADICAL FLUTE
|
||||
@ -1459,7 +1465,7 @@
|
||||
30FB;W # Po KATAKANA MIDDLE DOT
|
||||
30FC..30FE;W # Lm [3] KATAKANA-HIRAGANA PROLONGED SOUND MARK..KATAKANA VOICED ITERATION MARK
|
||||
30FF;W # Lo KATAKANA DIGRAPH KOTO
|
||||
3105..312E;W # Lo [42] BOPOMOFO LETTER B..BOPOMOFO LETTER O WITH DOT ABOVE
|
||||
3105..312F;W # Lo [43] BOPOMOFO LETTER B..BOPOMOFO LETTER NN
|
||||
3131..318E;W # Lo [94] HANGUL LETTER KIYEOK..HANGUL LETTER ARAEAE
|
||||
3190..3191;W # So [2] IDEOGRAPHIC ANNOTATION LINKING MARK..IDEOGRAPHIC ANNOTATION REVERSE MARK
|
||||
3192..3195;W # No [4] IDEOGRAPHIC ANNOTATION ONE MARK..IDEOGRAPHIC ANNOTATION FOUR MARK
|
||||
@ -1482,8 +1488,8 @@
|
||||
3400..4DB5;W # Lo [6582] CJK UNIFIED IDEOGRAPH-3400..CJK UNIFIED IDEOGRAPH-4DB5
|
||||
4DB6..4DBF;W # Cn [10] <reserved-4DB6>..<reserved-4DBF>
|
||||
4DC0..4DFF;N # So [64] HEXAGRAM FOR THE CREATIVE HEAVEN..HEXAGRAM FOR BEFORE COMPLETION
|
||||
4E00..9FEA;W # Lo [20971] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FEA
|
||||
9FEB..9FFF;W # Cn [21] <reserved-9FEB>..<reserved-9FFF>
|
||||
4E00..9FEF;W # Lo [20976] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FEF
|
||||
9FF0..9FFF;W # Cn [16] <reserved-9FF0>..<reserved-9FFF>
|
||||
A000..A014;W # Lo [21] YI SYLLABLE IT..YI SYLLABLE E
|
||||
A015;W # Lm YI SYLLABLE WU
|
||||
A016..A48C;W # Lo [1143] YI SYLLABLE BIT..YI SYLLABLE YYR
|
||||
@ -1522,8 +1528,7 @@ A788;N # Lm MODIFIER LETTER LOW CIRCUMFLEX ACCENT
|
||||
A789..A78A;N # Sk [2] MODIFIER LETTER COLON..MODIFIER LETTER SHORT EQUALS SIGN
|
||||
A78B..A78E;N # L& [4] LATIN CAPITAL LETTER SALTILLO..LATIN SMALL LETTER L WITH RETROFLEX HOOK AND BELT
|
||||
A78F;N # Lo LATIN LETTER SINOLOGICAL DOT
|
||||
A790..A7AE;N # L& [31] LATIN CAPITAL LETTER N WITH DESCENDER..LATIN CAPITAL LETTER SMALL CAPITAL I
|
||||
A7B0..A7B7;N # L& [8] LATIN CAPITAL LETTER TURNED K..LATIN SMALL LETTER OMEGA
|
||||
A790..A7B9;N # L& [42] LATIN CAPITAL LETTER N WITH DESCENDER..LATIN SMALL LETTER U WITH STROKE
|
||||
A7F7;N # Lo LATIN EPIGRAPHIC LETTER SIDEWAYS I
|
||||
A7F8..A7F9;N # Lm [2] MODIFIER LETTER CAPITAL H WITH STROKE..MODIFIER LETTER SMALL LIGATURE OE
|
||||
A7FA;N # Ll LATIN LETTER SMALL CAPITAL TURNED M
|
||||
@ -1556,7 +1561,8 @@ A8F2..A8F7;N # Lo [6] DEVANAGARI SIGN SPACING CANDRABINDU..DEVANAGARI SI
|
||||
A8F8..A8FA;N # Po [3] DEVANAGARI SIGN PUSHPIKA..DEVANAGARI CARET
|
||||
A8FB;N # Lo DEVANAGARI HEADSTROKE
|
||||
A8FC;N # Po DEVANAGARI SIGN SIDDHAM
|
||||
A8FD;N # Lo DEVANAGARI JAIN OM
|
||||
A8FD..A8FE;N # Lo [2] DEVANAGARI JAIN OM..DEVANAGARI LETTER AY
|
||||
A8FF;N # Mn DEVANAGARI VOWEL SIGN AY
|
||||
A900..A909;N # Nd [10] KAYAH LI DIGIT ZERO..KAYAH LI DIGIT NINE
|
||||
A90A..A925;N # Lo [28] KAYAH LI LETTER KA..KAYAH LI LETTER OO
|
||||
A926..A92D;N # Mn [8] KAYAH LI VOWEL UE..KAYAH LI TONE CALYA PLOPHU
|
||||
@ -1868,10 +1874,10 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
10A0C..10A0F;N # Mn [4] KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI SIGN VISARGA
|
||||
10A10..10A13;N # Lo [4] KHAROSHTHI LETTER KA..KHAROSHTHI LETTER GHA
|
||||
10A15..10A17;N # Lo [3] KHAROSHTHI LETTER CA..KHAROSHTHI LETTER JA
|
||||
10A19..10A33;N # Lo [27] KHAROSHTHI LETTER NYA..KHAROSHTHI LETTER TTTHA
|
||||
10A19..10A35;N # Lo [29] KHAROSHTHI LETTER NYA..KHAROSHTHI LETTER VHA
|
||||
10A38..10A3A;N # Mn [3] KHAROSHTHI SIGN BAR ABOVE..KHAROSHTHI SIGN DOT BELOW
|
||||
10A3F;N # Mn KHAROSHTHI VIRAMA
|
||||
10A40..10A47;N # No [8] KHAROSHTHI DIGIT ONE..KHAROSHTHI NUMBER ONE THOUSAND
|
||||
10A40..10A48;N # No [9] KHAROSHTHI DIGIT ONE..KHAROSHTHI FRACTION ONE HALF
|
||||
10A50..10A58;N # Po [9] KHAROSHTHI PUNCTUATION DOT..KHAROSHTHI PUNCTUATION LINES
|
||||
10A60..10A7C;N # Lo [29] OLD SOUTH ARABIAN LETTER HE..OLD SOUTH ARABIAN LETTER THETH
|
||||
10A7D..10A7E;N # No [2] OLD SOUTH ARABIAN NUMBER ONE..OLD SOUTH ARABIAN NUMBER FIFTY
|
||||
@ -1897,7 +1903,17 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
10C80..10CB2;N # Lu [51] OLD HUNGARIAN CAPITAL LETTER A..OLD HUNGARIAN CAPITAL LETTER US
|
||||
10CC0..10CF2;N # Ll [51] OLD HUNGARIAN SMALL LETTER A..OLD HUNGARIAN SMALL LETTER US
|
||||
10CFA..10CFF;N # No [6] OLD HUNGARIAN NUMBER ONE..OLD HUNGARIAN NUMBER ONE THOUSAND
|
||||
10D00..10D23;N # Lo [36] HANIFI ROHINGYA LETTER A..HANIFI ROHINGYA MARK NA KHONNA
|
||||
10D24..10D27;N # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
|
||||
10D30..10D39;N # Nd [10] HANIFI ROHINGYA DIGIT ZERO..HANIFI ROHINGYA DIGIT NINE
|
||||
10E60..10E7E;N # No [31] RUMI DIGIT ONE..RUMI FRACTION TWO THIRDS
|
||||
10F00..10F1C;N # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
|
||||
10F1D..10F26;N # No [10] OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION ONE HALF
|
||||
10F27;N # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
|
||||
10F30..10F45;N # Lo [22] SOGDIAN LETTER ALEPH..SOGDIAN INDEPENDENT SHIN
|
||||
10F46..10F50;N # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
|
||||
10F51..10F54;N # No [4] SOGDIAN NUMBER ONE..SOGDIAN NUMBER ONE HUNDRED
|
||||
10F55..10F59;N # Po [5] SOGDIAN PUNCTUATION TWO VERTICAL BARS..SOGDIAN PUNCTUATION HALF CIRCLE WITH DOT
|
||||
11000;N # Mc BRAHMI SIGN CANDRABINDU
|
||||
11001;N # Mn BRAHMI SIGN ANUSVARA
|
||||
11002;N # Mc BRAHMI SIGN VISARGA
|
||||
@ -1917,6 +1933,7 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
110BB..110BC;N # Po [2] KAITHI ABBREVIATION SIGN..KAITHI ENUMERATION SIGN
|
||||
110BD;N # Cf KAITHI NUMBER SIGN
|
||||
110BE..110C1;N # Po [4] KAITHI SECTION MARK..KAITHI DOUBLE DANDA
|
||||
110CD;N # Cf KAITHI NUMBER SIGN ABOVE
|
||||
110D0..110E8;N # Lo [25] SORA SOMPENG LETTER SAH..SORA SOMPENG LETTER MAE
|
||||
110F0..110F9;N # Nd [10] SORA SOMPENG DIGIT ZERO..SORA SOMPENG DIGIT NINE
|
||||
11100..11102;N # Mn [3] CHAKMA SIGN CANDRABINDU..CHAKMA SIGN VISARGA
|
||||
@ -1926,6 +1943,8 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
1112D..11134;N # Mn [8] CHAKMA VOWEL SIGN AI..CHAKMA MAAYYAA
|
||||
11136..1113F;N # Nd [10] CHAKMA DIGIT ZERO..CHAKMA DIGIT NINE
|
||||
11140..11143;N # Po [4] CHAKMA SECTION MARK..CHAKMA QUESTION MARK
|
||||
11144;N # Lo CHAKMA LETTER LHAA
|
||||
11145..11146;N # Mc [2] CHAKMA VOWEL SIGN AA..CHAKMA VOWEL SIGN EI
|
||||
11150..11172;N # Lo [35] MAHAJANI LETTER A..MAHAJANI LETTER RRA
|
||||
11173;N # Mn MAHAJANI SIGN NUKTA
|
||||
11174..11175;N # Po [2] MAHAJANI ABBREVIATION SIGN..MAHAJANI SECTION MARK
|
||||
@ -1937,8 +1956,8 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
111B6..111BE;N # Mn [9] SHARADA VOWEL SIGN U..SHARADA VOWEL SIGN O
|
||||
111BF..111C0;N # Mc [2] SHARADA VOWEL SIGN AU..SHARADA SIGN VIRAMA
|
||||
111C1..111C4;N # Lo [4] SHARADA SIGN AVAGRAHA..SHARADA OM
|
||||
111C5..111C9;N # Po [5] SHARADA DANDA..SHARADA SANDHI MARK
|
||||
111CA..111CC;N # Mn [3] SHARADA SIGN NUKTA..SHARADA EXTRA SHORT VOWEL MARK
|
||||
111C5..111C8;N # Po [4] SHARADA DANDA..SHARADA SEPARATOR
|
||||
111C9..111CC;N # Mn [4] SHARADA SANDHI MARK..SHARADA EXTRA SHORT VOWEL MARK
|
||||
111CD;N # Po SHARADA SUTRA MARK
|
||||
111D0..111D9;N # Nd [10] SHARADA DIGIT ZERO..SHARADA DIGIT NINE
|
||||
111DA;N # Lo SHARADA EKAM
|
||||
@ -1975,7 +1994,7 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
1132A..11330;N # Lo [7] GRANTHA LETTER PA..GRANTHA LETTER RA
|
||||
11332..11333;N # Lo [2] GRANTHA LETTER LA..GRANTHA LETTER LLA
|
||||
11335..11339;N # Lo [5] GRANTHA LETTER VA..GRANTHA LETTER HA
|
||||
1133C;N # Mn GRANTHA SIGN NUKTA
|
||||
1133B..1133C;N # Mn [2] COMBINING BINDU BELOW..GRANTHA SIGN NUKTA
|
||||
1133D;N # Lo GRANTHA SIGN AVAGRAHA
|
||||
1133E..1133F;N # Mc [2] GRANTHA VOWEL SIGN AA..GRANTHA VOWEL SIGN I
|
||||
11340;N # Mn GRANTHA VOWEL SIGN II
|
||||
@ -2000,6 +2019,7 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
11450..11459;N # Nd [10] NEWA DIGIT ZERO..NEWA DIGIT NINE
|
||||
1145B;N # Po NEWA PLACEHOLDER MARK
|
||||
1145D;N # Po NEWA INSERTION SIGN
|
||||
1145E;N # Mn NEWA SANDHI MARK
|
||||
11480..114AF;N # Lo [48] TIRHUTA ANJI..TIRHUTA LETTER HA
|
||||
114B0..114B2;N # Mc [3] TIRHUTA VOWEL SIGN AA..TIRHUTA VOWEL SIGN II
|
||||
114B3..114B8;N # Mn [6] TIRHUTA VOWEL SIGN U..TIRHUTA VOWEL SIGN VOCALIC LL
|
||||
@ -2043,7 +2063,7 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
116B6;N # Mc TAKRI SIGN VIRAMA
|
||||
116B7;N # Mn TAKRI SIGN NUKTA
|
||||
116C0..116C9;N # Nd [10] TAKRI DIGIT ZERO..TAKRI DIGIT NINE
|
||||
11700..11719;N # Lo [26] AHOM LETTER KA..AHOM LETTER JHA
|
||||
11700..1171A;N # Lo [27] AHOM LETTER KA..AHOM LETTER ALTERNATE BA
|
||||
1171D..1171F;N # Mn [3] AHOM CONSONANT SIGN MEDIAL LA..AHOM CONSONANT SIGN MEDIAL LIGATING RA
|
||||
11720..11721;N # Mc [2] AHOM VOWEL SIGN A..AHOM VOWEL SIGN AA
|
||||
11722..11725;N # Mn [4] AHOM VOWEL SIGN I..AHOM VOWEL SIGN UU
|
||||
@ -2053,14 +2073,18 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
1173A..1173B;N # No [2] AHOM NUMBER TEN..AHOM NUMBER TWENTY
|
||||
1173C..1173E;N # Po [3] AHOM SIGN SMALL SECTION..AHOM SIGN RULAI
|
||||
1173F;N # So AHOM SYMBOL VI
|
||||
11800..1182B;N # Lo [44] DOGRA LETTER A..DOGRA LETTER RRA
|
||||
1182C..1182E;N # Mc [3] DOGRA VOWEL SIGN AA..DOGRA VOWEL SIGN II
|
||||
1182F..11837;N # Mn [9] DOGRA VOWEL SIGN U..DOGRA SIGN ANUSVARA
|
||||
11838;N # Mc DOGRA SIGN VISARGA
|
||||
11839..1183A;N # Mn [2] DOGRA SIGN VIRAMA..DOGRA SIGN NUKTA
|
||||
1183B;N # Po DOGRA ABBREVIATION SIGN
|
||||
118A0..118DF;N # L& [64] WARANG CITI CAPITAL LETTER NGAA..WARANG CITI SMALL LETTER VIYO
|
||||
118E0..118E9;N # Nd [10] WARANG CITI DIGIT ZERO..WARANG CITI DIGIT NINE
|
||||
118EA..118F2;N # No [9] WARANG CITI NUMBER TEN..WARANG CITI NUMBER NINETY
|
||||
118FF;N # Lo WARANG CITI OM
|
||||
11A00;N # Lo ZANABAZAR SQUARE LETTER A
|
||||
11A01..11A06;N # Mn [6] ZANABAZAR SQUARE VOWEL SIGN I..ZANABAZAR SQUARE VOWEL SIGN O
|
||||
11A07..11A08;N # Mc [2] ZANABAZAR SQUARE VOWEL SIGN AI..ZANABAZAR SQUARE VOWEL SIGN AU
|
||||
11A09..11A0A;N # Mn [2] ZANABAZAR SQUARE VOWEL SIGN REVERSED I..ZANABAZAR SQUARE VOWEL LENGTH MARK
|
||||
11A01..11A0A;N # Mn [10] ZANABAZAR SQUARE VOWEL SIGN I..ZANABAZAR SQUARE VOWEL LENGTH MARK
|
||||
11A0B..11A32;N # Lo [40] ZANABAZAR SQUARE LETTER KA..ZANABAZAR SQUARE LETTER KSSA
|
||||
11A33..11A38;N # Mn [6] ZANABAZAR SQUARE FINAL CONSONANT MARK..ZANABAZAR SQUARE SIGN ANUSVARA
|
||||
11A39;N # Mc ZANABAZAR SQUARE SIGN VISARGA
|
||||
@ -2078,6 +2102,7 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
11A97;N # Mc SOYOMBO SIGN VISARGA
|
||||
11A98..11A99;N # Mn [2] SOYOMBO GEMINATION MARK..SOYOMBO SUBJOINER
|
||||
11A9A..11A9C;N # Po [3] SOYOMBO MARK TSHEG..SOYOMBO MARK DOUBLE SHAD
|
||||
11A9D;N # Lo SOYOMBO MARK PLUTA
|
||||
11A9E..11AA2;N # Po [5] SOYOMBO HEAD MARK WITH MOON AND SUN AND TRIPLE FLAME..SOYOMBO TERMINAL MARK-2
|
||||
11AC0..11AF8;N # Lo [57] PAU CIN HAU LETTER PA..PAU CIN HAU GLOTTAL STOP FINAL
|
||||
11C00..11C08;N # Lo [9] BHAIKSUKI LETTER A..BHAIKSUKI LETTER VOCALIC L
|
||||
@ -2110,6 +2135,21 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
11D46;N # Lo MASARAM GONDI REPHA
|
||||
11D47;N # Mn MASARAM GONDI RA-KARA
|
||||
11D50..11D59;N # Nd [10] MASARAM GONDI DIGIT ZERO..MASARAM GONDI DIGIT NINE
|
||||
11D60..11D65;N # Lo [6] GUNJALA GONDI LETTER A..GUNJALA GONDI LETTER UU
|
||||
11D67..11D68;N # Lo [2] GUNJALA GONDI LETTER EE..GUNJALA GONDI LETTER AI
|
||||
11D6A..11D89;N # Lo [32] GUNJALA GONDI LETTER OO..GUNJALA GONDI LETTER SA
|
||||
11D8A..11D8E;N # Mc [5] GUNJALA GONDI VOWEL SIGN AA..GUNJALA GONDI VOWEL SIGN UU
|
||||
11D90..11D91;N # Mn [2] GUNJALA GONDI VOWEL SIGN EE..GUNJALA GONDI VOWEL SIGN AI
|
||||
11D93..11D94;N # Mc [2] GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI VOWEL SIGN AU
|
||||
11D95;N # Mn GUNJALA GONDI SIGN ANUSVARA
|
||||
11D96;N # Mc GUNJALA GONDI SIGN VISARGA
|
||||
11D97;N # Mn GUNJALA GONDI VIRAMA
|
||||
11D98;N # Lo GUNJALA GONDI OM
|
||||
11DA0..11DA9;N # Nd [10] GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT NINE
|
||||
11EE0..11EF2;N # Lo [19] MAKASAR LETTER KA..MAKASAR ANGKA
|
||||
11EF3..11EF4;N # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
|
||||
11EF5..11EF6;N # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
|
||||
11EF7..11EF8;N # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION
|
||||
12000..12399;N # Lo [922] CUNEIFORM SIGN A..CUNEIFORM SIGN U U
|
||||
12400..1246E;N # Nl [111] CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NUMERIC SIGN NINE U VARIANT FORM
|
||||
12470..12474;N # Po [5] CUNEIFORM PUNCTUATION SIGN OLD ASSYRIAN WORD DIVIDER..CUNEIFORM PUNCTUATION SIGN DIAGONAL QUADCOLON
|
||||
@ -2134,13 +2174,16 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
16B5B..16B61;N # No [7] PAHAWH HMONG NUMBER TENS..PAHAWH HMONG NUMBER TRILLIONS
|
||||
16B63..16B77;N # Lo [21] PAHAWH HMONG SIGN VOS LUB..PAHAWH HMONG SIGN CIM NRES TOS
|
||||
16B7D..16B8F;N # Lo [19] PAHAWH HMONG CLAN SIGN TSHEEJ..PAHAWH HMONG CLAN SIGN VWJ
|
||||
16E40..16E7F;N # L& [64] MEDEFAIDRIN CAPITAL LETTER M..MEDEFAIDRIN SMALL LETTER Y
|
||||
16E80..16E96;N # No [23] MEDEFAIDRIN DIGIT ZERO..MEDEFAIDRIN DIGIT THREE ALTERNATE FORM
|
||||
16E97..16E9A;N # Po [4] MEDEFAIDRIN COMMA..MEDEFAIDRIN EXCLAMATION OH
|
||||
16F00..16F44;N # Lo [69] MIAO LETTER PA..MIAO LETTER HHA
|
||||
16F50;N # Lo MIAO LETTER NASALIZATION
|
||||
16F51..16F7E;N # Mc [46] MIAO SIGN ASPIRATION..MIAO VOWEL SIGN NG
|
||||
16F8F..16F92;N # Mn [4] MIAO TONE RIGHT..MIAO TONE BELOW
|
||||
16F93..16F9F;N # Lm [13] MIAO LETTER TONE-2..MIAO LETTER REFORMED TONE-8
|
||||
16FE0..16FE1;W # Lm [2] TANGUT ITERATION MARK..NUSHU ITERATION MARK
|
||||
17000..187EC;W # Lo [6125] TANGUT IDEOGRAPH-17000..TANGUT IDEOGRAPH-187EC
|
||||
17000..187F1;W # Lo [6130] TANGUT IDEOGRAPH-17000..TANGUT IDEOGRAPH-187F1
|
||||
18800..18AF2;W # Lo [755] TANGUT COMPONENT-001..TANGUT COMPONENT-755
|
||||
1B000..1B0FF;W # Lo [256] KATAKANA LETTER ARCHAIC E..HENTAIGANA LETTER RE-2
|
||||
1B100..1B11E;W # Lo [31] HENTAIGANA LETTER RE-3..HENTAIGANA LETTER N-MU-MO-2
|
||||
@ -2170,8 +2213,9 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
1D200..1D241;N # So [66] GREEK VOCAL NOTATION SYMBOL-1..GREEK INSTRUMENTAL NOTATION SYMBOL-54
|
||||
1D242..1D244;N # Mn [3] COMBINING GREEK MUSICAL TRISEME..COMBINING GREEK MUSICAL PENTASEME
|
||||
1D245;N # So GREEK MUSICAL LEIMMA
|
||||
1D2E0..1D2F3;N # No [20] MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN
|
||||
1D300..1D356;N # So [87] MONOGRAM FOR EARTH..TETRAGRAM FOR FOSTERING
|
||||
1D360..1D371;N # No [18] COUNTING ROD UNIT DIGIT ONE..COUNTING ROD TENS DIGIT NINE
|
||||
1D360..1D378;N # No [25] COUNTING ROD UNIT DIGIT ONE..TALLY MARK FIVE
|
||||
1D400..1D454;N # L& [85] MATHEMATICAL BOLD CAPITAL A..MATHEMATICAL ITALIC SMALL G
|
||||
1D456..1D49C;N # L& [71] MATHEMATICAL ITALIC SMALL I..MATHEMATICAL SCRIPT CAPITAL A
|
||||
1D49E..1D49F;N # Lu [2] MATHEMATICAL SCRIPT CAPITAL C..MATHEMATICAL SCRIPT CAPITAL D
|
||||
@ -2237,6 +2281,11 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
1E944..1E94A;N # Mn [7] ADLAM ALIF LENGTHENER..ADLAM NUKTA
|
||||
1E950..1E959;N # Nd [10] ADLAM DIGIT ZERO..ADLAM DIGIT NINE
|
||||
1E95E..1E95F;N # Po [2] ADLAM INITIAL EXCLAMATION MARK..ADLAM INITIAL QUESTION MARK
|
||||
1EC71..1ECAB;N # No [59] INDIC SIYAQ NUMBER ONE..INDIC SIYAQ NUMBER PREFIXED NINE
|
||||
1ECAC;N # So INDIC SIYAQ PLACEHOLDER
|
||||
1ECAD..1ECAF;N # No [3] INDIC SIYAQ FRACTION ONE QUARTER..INDIC SIYAQ FRACTION THREE QUARTERS
|
||||
1ECB0;N # Sc INDIC SIYAQ RUPEE MARK
|
||||
1ECB1..1ECB4;N # No [4] INDIC SIYAQ NUMBER ALTERNATE ONE..INDIC SIYAQ ALTERNATE LAKH MARK
|
||||
1EE00..1EE03;N # Lo [4] ARABIC MATHEMATICAL ALEF..ARABIC MATHEMATICAL DAL
|
||||
1EE05..1EE1F;N # Lo [27] ARABIC MATHEMATICAL WAW..ARABIC MATHEMATICAL DOTLESS QAF
|
||||
1EE21..1EE22;N # Lo [2] ARABIC MATHEMATICAL INITIAL BEH..ARABIC MATHEMATICAL INITIAL JEEM
|
||||
@ -2283,7 +2332,7 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
1F100..1F10A;A # No [11] DIGIT ZERO FULL STOP..DIGIT NINE COMMA
|
||||
1F10B..1F10C;N # No [2] DINGBAT CIRCLED SANS-SERIF DIGIT ZERO..DINGBAT NEGATIVE CIRCLED SANS-SERIF DIGIT ZERO
|
||||
1F110..1F12D;A # So [30] PARENTHESIZED LATIN CAPITAL LETTER A..CIRCLED CD
|
||||
1F12E;N # So CIRCLED WZ
|
||||
1F12E..1F12F;N # So [2] CIRCLED WZ..COPYLEFT SYMBOL
|
||||
1F130..1F169;A # So [58] SQUARED LATIN CAPITAL LETTER A..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
|
||||
1F16A..1F16B;N # So [2] RAISED MC SIGN..RAISED MD SIGN
|
||||
1F170..1F18D;A # So [30] NEGATIVE SQUARED LATIN CAPITAL LETTER A..NEGATIVE SQUARED SA
|
||||
@ -2345,9 +2394,9 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
1F6E0..1F6EA;N # So [11] HAMMER AND WRENCH..NORTHEAST-POINTING AIRPLANE
|
||||
1F6EB..1F6EC;W # So [2] AIRPLANE DEPARTURE..AIRPLANE ARRIVING
|
||||
1F6F0..1F6F3;N # So [4] SATELLITE..PASSENGER SHIP
|
||||
1F6F4..1F6F8;W # So [5] SCOOTER..FLYING SAUCER
|
||||
1F6F4..1F6F9;W # So [6] SCOOTER..SKATEBOARD
|
||||
1F700..1F773;N # So [116] ALCHEMICAL SYMBOL FOR QUINTESSENCE..ALCHEMICAL SYMBOL FOR HALF OUNCE
|
||||
1F780..1F7D4;N # So [85] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..HEAVY TWELVE POINTED PINWHEEL STAR
|
||||
1F780..1F7D8;N # So [89] BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE..NEGATIVE CIRCLED SQUARE
|
||||
1F800..1F80B;N # So [12] LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD..DOWNWARDS ARROW WITH LARGE TRIANGLE ARROWHEAD
|
||||
1F810..1F847;N # So [56] LEFTWARDS ARROW WITH SMALL EQUILATERAL ARROWHEAD..DOWNWARDS HEAVY ARROW
|
||||
1F850..1F859;N # So [10] LEFTWARDS SANS-SERIF ARROW..UP DOWN SANS-SERIF ARROW
|
||||
@ -2355,11 +2404,14 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
1F890..1F8AD;N # So [30] LEFTWARDS TRIANGLE ARROWHEAD..WHITE ARROW SHAFT WIDTH TWO THIRDS
|
||||
1F900..1F90B;N # So [12] CIRCLED CROSS FORMEE WITH FOUR DOTS..DOWNWARD FACING NOTCHED HOOK WITH DOT
|
||||
1F910..1F93E;W # So [47] ZIPPER-MOUTH FACE..HANDBALL
|
||||
1F940..1F94C;W # So [13] WILTED FLOWER..CURLING STONE
|
||||
1F950..1F96B;W # So [28] CROISSANT..CANNED FOOD
|
||||
1F980..1F997;W # So [24] CRAB..CRICKET
|
||||
1F9C0;W # So CHEESE WEDGE
|
||||
1F9D0..1F9E6;W # So [23] FACE WITH MONOCLE..SOCKS
|
||||
1F940..1F970;W # So [49] WILTED FLOWER..SMILING FACE WITH SMILING EYES AND THREE HEARTS
|
||||
1F973..1F976;W # So [4] FACE WITH PARTY HORN AND PARTY HAT..FREEZING FACE
|
||||
1F97A;W # So FACE WITH PLEADING EYES
|
||||
1F97C..1F9A2;W # So [39] LAB COAT..SWAN
|
||||
1F9B0..1F9B9;W # So [10] EMOJI COMPONENT RED HAIR..SUPERVILLAIN
|
||||
1F9C0..1F9C2;W # So [3] CHEESE WEDGE..SALT SHAKER
|
||||
1F9D0..1F9FF;W # So [48] FACE WITH MONOCLE..NAZAR AMULET
|
||||
1FA60..1FA6D;N # So [14] XIANGQI RED GENERAL..XIANGQI BLACK SOLDIER
|
||||
20000..2A6D6;W # Lo [42711] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6D6
|
||||
2A6D7..2A6FF;W # Cn [41] <reserved-2A6D7>..<reserved-2A6FF>
|
||||
2A700..2B734;W # Lo [4149] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B734
|
||||
@ -2371,7 +2423,8 @@ FFFD;A # So REPLACEMENT CHARACTER
|
||||
2CEB0..2EBE0;W # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
|
||||
2EBE1..2F7FF;W # Cn [3103] <reserved-2EBE1>..<reserved-2F7FF>
|
||||
2F800..2FA1D;W # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
|
||||
2FA1E..2FFFD;W # Cn [1504] <reserved-2FA1E>..<reserved-2FFFD>
|
||||
2FA1E..2FA1F;W # Cn [2] <reserved-2FA1E>..<reserved-2FA1F>
|
||||
2FA20..2FFFD;W # Cn [1502] <reserved-2FA20>..<reserved-2FFFD>
|
||||
30000..3FFFD;W # Cn [65534] <reserved-30000>..<reserved-3FFFD>
|
||||
E0001;N # Cf LANGUAGE TAG
|
||||
E0020..E007F;N # Cf [96] TAG SPACE..CANCEL TAG
|
||||
|
@ -35,7 +35,7 @@
|
||||
# files for making modifications.
|
||||
|
||||
|
||||
UNICODE_VERSION = 10.0.0
|
||||
UNICODE_VERSION = 11.0.0
|
||||
|
||||
PYTHON3 = python3
|
||||
WGET = wget
|
||||
|
@ -1,6 +1,6 @@
|
||||
# PropList-10.0.0.txt
|
||||
# Date: 2017-03-10, 08:25:30 GMT
|
||||
# © 2017 Unicode®, Inc.
|
||||
# PropList-11.0.0.txt
|
||||
# Date: 2018-03-15, 04:28:35 GMT
|
||||
# © 2018 Unicode®, Inc.
|
||||
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
|
||||
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
||||
#
|
||||
@ -125,7 +125,7 @@ FF63 ; Quotation_Mark # Pe HALFWIDTH RIGHT CORNER BRACKET
|
||||
05C3 ; Terminal_Punctuation # Po HEBREW PUNCTUATION SOF PASUQ
|
||||
060C ; Terminal_Punctuation # Po ARABIC COMMA
|
||||
061B ; Terminal_Punctuation # Po ARABIC SEMICOLON
|
||||
061F ; Terminal_Punctuation # Po ARABIC QUESTION MARK
|
||||
061E..061F ; Terminal_Punctuation # Po [2] ARABIC TRIPLE DOT PUNCTUATION MARK..ARABIC QUESTION MARK
|
||||
06D4 ; Terminal_Punctuation # Po ARABIC FULL STOP
|
||||
0700..070A ; Terminal_Punctuation # Po [11] SYRIAC END OF PARAGRAPH..SYRIAC CONTRACTION
|
||||
070C ; Terminal_Punctuation # Po SYRIAC HARKLEAN METOBELUS
|
||||
@ -156,6 +156,8 @@ FF63 ; Quotation_Mark # Pe HALFWIDTH RIGHT CORNER BRACKET
|
||||
2E2E ; Terminal_Punctuation # Po REVERSED QUESTION MARK
|
||||
2E3C ; Terminal_Punctuation # Po STENOGRAPHIC FULL STOP
|
||||
2E41 ; Terminal_Punctuation # Po REVERSED COMMA
|
||||
2E4C ; Terminal_Punctuation # Po MEDIEVAL COMMA
|
||||
2E4E ; Terminal_Punctuation # Po PUNCTUS ELEVATUS MARK
|
||||
3001..3002 ; Terminal_Punctuation # Po [2] IDEOGRAPHIC COMMA..IDEOGRAPHIC FULL STOP
|
||||
A4FE..A4FF ; Terminal_Punctuation # Po [2] LISU PUNCTUATION COMMA..LISU PUNCTUATION FULL STOP
|
||||
A60D..A60F ; Terminal_Punctuation # Po [3] VAI COMMA..VAI QUESTION MARK
|
||||
@ -185,6 +187,7 @@ FF64 ; Terminal_Punctuation # Po HALFWIDTH IDEOGRAPHIC COMMA
|
||||
10AF0..10AF5 ; Terminal_Punctuation # Po [6] MANICHAEAN PUNCTUATION STAR..MANICHAEAN PUNCTUATION TWO DOTS
|
||||
10B3A..10B3F ; Terminal_Punctuation # Po [6] TINY TWO DOTS OVER ONE DOT PUNCTUATION..LARGE ONE RING OVER TWO RINGS PUNCTUATION
|
||||
10B99..10B9C ; Terminal_Punctuation # Po [4] PSALTER PAHLAVI SECTION MARK..PSALTER PAHLAVI FOUR DOTS WITH DOT
|
||||
10F55..10F59 ; Terminal_Punctuation # Po [5] SOGDIAN PUNCTUATION TWO VERTICAL BARS..SOGDIAN PUNCTUATION HALF CIRCLE WITH DOT
|
||||
11047..1104D ; Terminal_Punctuation # Po [7] BRAHMI DANDA..BRAHMI PUNCTUATION LOTUS
|
||||
110BE..110C1 ; Terminal_Punctuation # Po [4] KAITHI SECTION MARK..KAITHI DOUBLE DANDA
|
||||
11141..11143 ; Terminal_Punctuation # Po [3] CHAKMA DANDA..CHAKMA QUESTION MARK
|
||||
@ -204,15 +207,17 @@ FF64 ; Terminal_Punctuation # Po HALFWIDTH IDEOGRAPHIC COMMA
|
||||
11AA1..11AA2 ; Terminal_Punctuation # Po [2] SOYOMBO TERMINAL MARK-1..SOYOMBO TERMINAL MARK-2
|
||||
11C41..11C43 ; Terminal_Punctuation # Po [3] BHAIKSUKI DANDA..BHAIKSUKI WORD SEPARATOR
|
||||
11C71 ; Terminal_Punctuation # Po MARCHEN MARK SHAD
|
||||
11EF7..11EF8 ; Terminal_Punctuation # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION
|
||||
12470..12474 ; Terminal_Punctuation # Po [5] CUNEIFORM PUNCTUATION SIGN OLD ASSYRIAN WORD DIVIDER..CUNEIFORM PUNCTUATION SIGN DIAGONAL QUADCOLON
|
||||
16A6E..16A6F ; Terminal_Punctuation # Po [2] MRO DANDA..MRO DOUBLE DANDA
|
||||
16AF5 ; Terminal_Punctuation # Po BASSA VAH FULL STOP
|
||||
16B37..16B39 ; Terminal_Punctuation # Po [3] PAHAWH HMONG SIGN VOS THOM..PAHAWH HMONG SIGN CIM CHEEM
|
||||
16B44 ; Terminal_Punctuation # Po PAHAWH HMONG SIGN XAUS
|
||||
16E97..16E98 ; Terminal_Punctuation # Po [2] MEDEFAIDRIN COMMA..MEDEFAIDRIN FULL STOP
|
||||
1BC9F ; Terminal_Punctuation # Po DUPLOYAN PUNCTUATION CHINOOK FULL STOP
|
||||
1DA87..1DA8A ; Terminal_Punctuation # Po [4] SIGNWRITING COMMA..SIGNWRITING COLON
|
||||
|
||||
# Total code points: 252
|
||||
# Total code points: 264
|
||||
|
||||
# ================================================
|
||||
|
||||
@ -661,6 +666,7 @@ FB1E ; Other_Alphabetic # Mn HEBREW POINT JUDEO-SPANISH VARIKA
|
||||
10A01..10A03 ; Other_Alphabetic # Mn [3] KHAROSHTHI VOWEL SIGN I..KHAROSHTHI VOWEL SIGN VOCALIC R
|
||||
10A05..10A06 ; Other_Alphabetic # Mn [2] KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SIGN O
|
||||
10A0C..10A0F ; Other_Alphabetic # Mn [4] KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI SIGN VISARGA
|
||||
10D24..10D27 ; Other_Alphabetic # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
|
||||
11000 ; Other_Alphabetic # Mc BRAHMI SIGN CANDRABINDU
|
||||
11001 ; Other_Alphabetic # Mn BRAHMI SIGN ANUSVARA
|
||||
11002 ; Other_Alphabetic # Mc BRAHMI SIGN VISARGA
|
||||
@ -673,6 +679,7 @@ FB1E ; Other_Alphabetic # Mn HEBREW POINT JUDEO-SPANISH VARIKA
|
||||
11127..1112B ; Other_Alphabetic # Mn [5] CHAKMA VOWEL SIGN A..CHAKMA VOWEL SIGN UU
|
||||
1112C ; Other_Alphabetic # Mc CHAKMA VOWEL SIGN E
|
||||
1112D..11132 ; Other_Alphabetic # Mn [6] CHAKMA VOWEL SIGN AI..CHAKMA AU MARK
|
||||
11145..11146 ; Other_Alphabetic # Mc [2] CHAKMA VOWEL SIGN AA..CHAKMA VOWEL SIGN EI
|
||||
11180..11181 ; Other_Alphabetic # Mn [2] SHARADA SIGN CANDRABINDU..SHARADA SIGN ANUSVARA
|
||||
11182 ; Other_Alphabetic # Mc SHARADA SIGN VISARGA
|
||||
111B3..111B5 ; Other_Alphabetic # Mc [3] SHARADA VOWEL SIGN AA..SHARADA VOWEL SIGN II
|
||||
@ -730,9 +737,10 @@ FB1E ; Other_Alphabetic # Mn HEBREW POINT JUDEO-SPANISH VARIKA
|
||||
11722..11725 ; Other_Alphabetic # Mn [4] AHOM VOWEL SIGN I..AHOM VOWEL SIGN UU
|
||||
11726 ; Other_Alphabetic # Mc AHOM VOWEL SIGN E
|
||||
11727..1172A ; Other_Alphabetic # Mn [4] AHOM VOWEL SIGN AW..AHOM VOWEL SIGN AM
|
||||
11A01..11A06 ; Other_Alphabetic # Mn [6] ZANABAZAR SQUARE VOWEL SIGN I..ZANABAZAR SQUARE VOWEL SIGN O
|
||||
11A07..11A08 ; Other_Alphabetic # Mc [2] ZANABAZAR SQUARE VOWEL SIGN AI..ZANABAZAR SQUARE VOWEL SIGN AU
|
||||
11A09..11A0A ; Other_Alphabetic # Mn [2] ZANABAZAR SQUARE VOWEL SIGN REVERSED I..ZANABAZAR SQUARE VOWEL LENGTH MARK
|
||||
1182C..1182E ; Other_Alphabetic # Mc [3] DOGRA VOWEL SIGN AA..DOGRA VOWEL SIGN II
|
||||
1182F..11837 ; Other_Alphabetic # Mn [9] DOGRA VOWEL SIGN U..DOGRA SIGN ANUSVARA
|
||||
11838 ; Other_Alphabetic # Mc DOGRA SIGN VISARGA
|
||||
11A01..11A0A ; Other_Alphabetic # Mn [10] ZANABAZAR SQUARE VOWEL SIGN I..ZANABAZAR SQUARE VOWEL LENGTH MARK
|
||||
11A35..11A38 ; Other_Alphabetic # Mn [4] ZANABAZAR SQUARE SIGN CANDRABINDU..ZANABAZAR SQUARE SIGN ANUSVARA
|
||||
11A39 ; Other_Alphabetic # Mc ZANABAZAR SQUARE SIGN VISARGA
|
||||
11A3B..11A3E ; Other_Alphabetic # Mn [4] ZANABAZAR SQUARE CLUSTER-FINAL LETTER YA..ZANABAZAR SQUARE CLUSTER-FINAL LETTER VA
|
||||
@ -758,6 +766,13 @@ FB1E ; Other_Alphabetic # Mn HEBREW POINT JUDEO-SPANISH VARIKA
|
||||
11D3F..11D41 ; Other_Alphabetic # Mn [3] MASARAM GONDI VOWEL SIGN AU..MASARAM GONDI SIGN VISARGA
|
||||
11D43 ; Other_Alphabetic # Mn MASARAM GONDI SIGN CANDRA
|
||||
11D47 ; Other_Alphabetic # Mn MASARAM GONDI RA-KARA
|
||||
11D8A..11D8E ; Other_Alphabetic # Mc [5] GUNJALA GONDI VOWEL SIGN AA..GUNJALA GONDI VOWEL SIGN UU
|
||||
11D90..11D91 ; Other_Alphabetic # Mn [2] GUNJALA GONDI VOWEL SIGN EE..GUNJALA GONDI VOWEL SIGN AI
|
||||
11D93..11D94 ; Other_Alphabetic # Mc [2] GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI VOWEL SIGN AU
|
||||
11D95 ; Other_Alphabetic # Mn GUNJALA GONDI SIGN ANUSVARA
|
||||
11D96 ; Other_Alphabetic # Mc GUNJALA GONDI SIGN VISARGA
|
||||
11EF3..11EF4 ; Other_Alphabetic # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
|
||||
11EF5..11EF6 ; Other_Alphabetic # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
|
||||
16B30..16B36 ; Other_Alphabetic # Mn [7] PAHAWH HMONG MARK CIM TUB..PAHAWH HMONG MARK CIM TAUM
|
||||
16F51..16F7E ; Other_Alphabetic # Mc [46] MIAO SIGN ASPIRATION..MIAO VOWEL SIGN NG
|
||||
1BC9E ; Other_Alphabetic # Mn DUPLOYAN DOUBLE MARK
|
||||
@ -771,7 +786,7 @@ FB1E ; Other_Alphabetic # Mn HEBREW POINT JUDEO-SPANISH VARIKA
|
||||
1F150..1F169 ; Other_Alphabetic # So [26] NEGATIVE CIRCLED LATIN CAPITAL LETTER A..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
|
||||
1F170..1F189 ; Other_Alphabetic # So [26] NEGATIVE SQUARED LATIN CAPITAL LETTER A..NEGATIVE SQUARED LATIN CAPITAL LETTER Z
|
||||
|
||||
# Total code points: 1300
|
||||
# Total code points: 1334
|
||||
|
||||
# ================================================
|
||||
|
||||
@ -780,10 +795,10 @@ FB1E ; Other_Alphabetic # Mn HEBREW POINT JUDEO-SPANISH VARIKA
|
||||
3021..3029 ; Ideographic # Nl [9] HANGZHOU NUMERAL ONE..HANGZHOU NUMERAL NINE
|
||||
3038..303A ; Ideographic # Nl [3] HANGZHOU NUMERAL TEN..HANGZHOU NUMERAL THIRTY
|
||||
3400..4DB5 ; Ideographic # Lo [6582] CJK UNIFIED IDEOGRAPH-3400..CJK UNIFIED IDEOGRAPH-4DB5
|
||||
4E00..9FEA ; Ideographic # Lo [20971] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FEA
|
||||
4E00..9FEF ; Ideographic # Lo [20976] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FEF
|
||||
F900..FA6D ; Ideographic # Lo [366] CJK COMPATIBILITY IDEOGRAPH-F900..CJK COMPATIBILITY IDEOGRAPH-FA6D
|
||||
FA70..FAD9 ; Ideographic # Lo [106] CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COMPATIBILITY IDEOGRAPH-FAD9
|
||||
17000..187EC ; Ideographic # Lo [6125] TANGUT IDEOGRAPH-17000..TANGUT IDEOGRAPH-187EC
|
||||
17000..187F1 ; Ideographic # Lo [6130] TANGUT IDEOGRAPH-17000..TANGUT IDEOGRAPH-187F1
|
||||
18800..18AF2 ; Ideographic # Lo [755] TANGUT COMPONENT-001..TANGUT COMPONENT-755
|
||||
1B170..1B2FB ; Ideographic # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
|
||||
20000..2A6D6 ; Ideographic # Lo [42711] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6D6
|
||||
@ -793,7 +808,7 @@ FA70..FAD9 ; Ideographic # Lo [106] CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COM
|
||||
2CEB0..2EBE0 ; Ideographic # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
|
||||
2F800..2FA1D ; Ideographic # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
|
||||
|
||||
# Total code points: 96174
|
||||
# Total code points: 96184
|
||||
|
||||
# ================================================
|
||||
|
||||
@ -953,6 +968,9 @@ FF9E..FF9F ; Diacritic # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFW
|
||||
FFE3 ; Diacritic # Sk FULLWIDTH MACRON
|
||||
102E0 ; Diacritic # Mn COPTIC EPACT THOUSANDS MARK
|
||||
10AE5..10AE6 ; Diacritic # Mn [2] MANICHAEAN ABBREVIATION MARK ABOVE..MANICHAEAN ABBREVIATION MARK BELOW
|
||||
10D22..10D23 ; Diacritic # Lo [2] HANIFI ROHINGYA MARK SAKIN..HANIFI ROHINGYA MARK NA KHONNA
|
||||
10D24..10D27 ; Diacritic # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
|
||||
10F46..10F50 ; Diacritic # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
|
||||
110B9..110BA ; Diacritic # Mn [2] KAITHI SIGN VIRAMA..KAITHI SIGN NUKTA
|
||||
11133..11134 ; Diacritic # Mn [2] CHAKMA VIRAMA..CHAKMA MAAYYAA
|
||||
11173 ; Diacritic # Mn MAHAJANI SIGN NUKTA
|
||||
@ -973,12 +991,14 @@ FFE3 ; Diacritic # Sk FULLWIDTH MACRON
|
||||
116B6 ; Diacritic # Mc TAKRI SIGN VIRAMA
|
||||
116B7 ; Diacritic # Mn TAKRI SIGN NUKTA
|
||||
1172B ; Diacritic # Mn AHOM SIGN KILLER
|
||||
11839..1183A ; Diacritic # Mn [2] DOGRA SIGN VIRAMA..DOGRA SIGN NUKTA
|
||||
11A34 ; Diacritic # Mn ZANABAZAR SQUARE SIGN VIRAMA
|
||||
11A47 ; Diacritic # Mn ZANABAZAR SQUARE SUBJOINER
|
||||
11A99 ; Diacritic # Mn SOYOMBO SUBJOINER
|
||||
11C3F ; Diacritic # Mn BHAIKSUKI SIGN VIRAMA
|
||||
11D42 ; Diacritic # Mn MASARAM GONDI SIGN NUKTA
|
||||
11D44..11D45 ; Diacritic # Mn [2] MASARAM GONDI SIGN HALANTA..MASARAM GONDI VIRAMA
|
||||
11D97 ; Diacritic # Mn GUNJALA GONDI VIRAMA
|
||||
16AF0..16AF4 ; Diacritic # Mn [5] BASSA VAH COMBINING HIGH TONE..BASSA VAH COMBINING HIGH-LOW TONE
|
||||
16F8F..16F92 ; Diacritic # Mn [4] MIAO TONE RIGHT..MIAO TONE BELOW
|
||||
16F93..16F9F ; Diacritic # Lm [13] MIAO LETTER TONE-2..MIAO LETTER REFORMED TONE-8
|
||||
@ -991,7 +1011,7 @@ FFE3 ; Diacritic # Sk FULLWIDTH MACRON
|
||||
1E944..1E946 ; Diacritic # Mn [3] ADLAM ALIF LENGTHENER..ADLAM GEMINATION MARK
|
||||
1E948..1E94A ; Diacritic # Mn [3] ADLAM CONSONANT MODIFIER..ADLAM NUKTA
|
||||
|
||||
# Total code points: 798
|
||||
# Total code points: 818
|
||||
|
||||
# ================================================
|
||||
|
||||
@ -1137,7 +1157,7 @@ E0020..E007F ; Other_Grapheme_Extend # Cf [96] TAG SPACE..CANCEL TAG
|
||||
# ================================================
|
||||
|
||||
3400..4DB5 ; Unified_Ideograph # Lo [6582] CJK UNIFIED IDEOGRAPH-3400..CJK UNIFIED IDEOGRAPH-4DB5
|
||||
4E00..9FEA ; Unified_Ideograph # Lo [20971] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FEA
|
||||
4E00..9FEF ; Unified_Ideograph # Lo [20976] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FEF
|
||||
FA0E..FA0F ; Unified_Ideograph # Lo [2] CJK COMPATIBILITY IDEOGRAPH-FA0E..CJK COMPATIBILITY IDEOGRAPH-FA0F
|
||||
FA11 ; Unified_Ideograph # Lo CJK COMPATIBILITY IDEOGRAPH-FA11
|
||||
FA13..FA14 ; Unified_Ideograph # Lo [2] CJK COMPATIBILITY IDEOGRAPH-FA13..CJK COMPATIBILITY IDEOGRAPH-FA14
|
||||
@ -1151,7 +1171,7 @@ FA27..FA29 ; Unified_Ideograph # Lo [3] CJK COMPATIBILITY IDEOGRAPH-FA27..C
|
||||
2B820..2CEA1 ; Unified_Ideograph # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
|
||||
2CEB0..2EBE0 ; Unified_Ideograph # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
|
||||
|
||||
# Total code points: 87882
|
||||
# Total code points: 87887
|
||||
|
||||
# ================================================
|
||||
|
||||
@ -1255,10 +1275,13 @@ AABB..AABC ; Logical_Order_Exception # Lo [2] TAI VIET VOWEL AUE..TAI VIET
|
||||
002E ; Sentence_Terminal # Po FULL STOP
|
||||
003F ; Sentence_Terminal # Po QUESTION MARK
|
||||
0589 ; Sentence_Terminal # Po ARMENIAN FULL STOP
|
||||
061F ; Sentence_Terminal # Po ARABIC QUESTION MARK
|
||||
061E..061F ; Sentence_Terminal # Po [2] ARABIC TRIPLE DOT PUNCTUATION MARK..ARABIC QUESTION MARK
|
||||
06D4 ; Sentence_Terminal # Po ARABIC FULL STOP
|
||||
0700..0702 ; Sentence_Terminal # Po [3] SYRIAC END OF PARAGRAPH..SYRIAC SUBLINEAR FULL STOP
|
||||
07F9 ; Sentence_Terminal # Po NKO EXCLAMATION MARK
|
||||
0837 ; Sentence_Terminal # Po SAMARITAN PUNCTUATION MELODIC QITSA
|
||||
0839 ; Sentence_Terminal # Po SAMARITAN PUNCTUATION QITSA
|
||||
083D..083E ; Sentence_Terminal # Po [2] SAMARITAN PUNCTUATION SOF MASHFAAT..SAMARITAN PUNCTUATION ANNAAU
|
||||
0964..0965 ; Sentence_Terminal # Po [2] DEVANAGARI DANDA..DEVANAGARI DOUBLE DANDA
|
||||
104A..104B ; Sentence_Terminal # Po [2] MYANMAR SIGN LITTLE SECTION..MYANMAR SIGN SECTION
|
||||
1362 ; Sentence_Terminal # Po ETHIOPIC FULL STOP
|
||||
@ -1296,6 +1319,7 @@ FF0E ; Sentence_Terminal # Po FULLWIDTH FULL STOP
|
||||
FF1F ; Sentence_Terminal # Po FULLWIDTH QUESTION MARK
|
||||
FF61 ; Sentence_Terminal # Po HALFWIDTH IDEOGRAPHIC FULL STOP
|
||||
10A56..10A57 ; Sentence_Terminal # Po [2] KHAROSHTHI PUNCTUATION DANDA..KHAROSHTHI PUNCTUATION DOUBLE DANDA
|
||||
10F55..10F59 ; Sentence_Terminal # Po [5] SOGDIAN PUNCTUATION TWO VERTICAL BARS..SOGDIAN PUNCTUATION HALF CIRCLE WITH DOT
|
||||
11047..11048 ; Sentence_Terminal # Po [2] BRAHMI DANDA..BRAHMI DOUBLE DANDA
|
||||
110BE..110C1 ; Sentence_Terminal # Po [4] KAITHI SECTION MARK..KAITHI DOUBLE DANDA
|
||||
11141..11143 ; Sentence_Terminal # Po [3] CHAKMA DANDA..CHAKMA QUESTION MARK
|
||||
@ -1313,14 +1337,16 @@ FF61 ; Sentence_Terminal # Po HALFWIDTH IDEOGRAPHIC FULL STOP
|
||||
11A42..11A43 ; Sentence_Terminal # Po [2] ZANABAZAR SQUARE MARK SHAD..ZANABAZAR SQUARE MARK DOUBLE SHAD
|
||||
11A9B..11A9C ; Sentence_Terminal # Po [2] SOYOMBO MARK SHAD..SOYOMBO MARK DOUBLE SHAD
|
||||
11C41..11C42 ; Sentence_Terminal # Po [2] BHAIKSUKI DANDA..BHAIKSUKI DOUBLE DANDA
|
||||
11EF7..11EF8 ; Sentence_Terminal # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION
|
||||
16A6E..16A6F ; Sentence_Terminal # Po [2] MRO DANDA..MRO DOUBLE DANDA
|
||||
16AF5 ; Sentence_Terminal # Po BASSA VAH FULL STOP
|
||||
16B37..16B38 ; Sentence_Terminal # Po [2] PAHAWH HMONG SIGN VOS THOM..PAHAWH HMONG SIGN VOS TSHAB CEEB
|
||||
16B44 ; Sentence_Terminal # Po PAHAWH HMONG SIGN XAUS
|
||||
16E98 ; Sentence_Terminal # Po MEDEFAIDRIN FULL STOP
|
||||
1BC9F ; Sentence_Terminal # Po DUPLOYAN PUNCTUATION CHINOOK FULL STOP
|
||||
1DA88 ; Sentence_Terminal # Po SIGNWRITING FULL STOP
|
||||
|
||||
# Total code points: 128
|
||||
# Total code points: 141
|
||||
|
||||
# ================================================
|
||||
|
||||
@ -1521,14 +1547,10 @@ E0100..E01EF ; Variation_Selector # Mn [240] VARIATION SELECTOR-17..VARIATION S
|
||||
2B74..2B75 ; Pattern_Syntax # Cn [2] <reserved-2B74>..<reserved-2B75>
|
||||
2B76..2B95 ; Pattern_Syntax # So [32] NORTH WEST TRIANGLE-HEADED ARROW TO BAR..RIGHTWARDS BLACK ARROW
|
||||
2B96..2B97 ; Pattern_Syntax # Cn [2] <reserved-2B96>..<reserved-2B97>
|
||||
2B98..2BB9 ; Pattern_Syntax # So [34] THREE-D TOP-LIGHTED LEFTWARDS EQUILATERAL ARROWHEAD..UP ARROWHEAD IN A RECTANGLE BOX
|
||||
2BBA..2BBC ; Pattern_Syntax # Cn [3] <reserved-2BBA>..<reserved-2BBC>
|
||||
2BBD..2BC8 ; Pattern_Syntax # So [12] BALLOT BOX WITH LIGHT X..BLACK MEDIUM RIGHT-POINTING TRIANGLE CENTRED
|
||||
2B98..2BC8 ; Pattern_Syntax # So [49] THREE-D TOP-LIGHTED LEFTWARDS EQUILATERAL ARROWHEAD..BLACK MEDIUM RIGHT-POINTING TRIANGLE CENTRED
|
||||
2BC9 ; Pattern_Syntax # Cn <reserved-2BC9>
|
||||
2BCA..2BD2 ; Pattern_Syntax # So [9] TOP HALF BLACK CIRCLE..GROUP MARK
|
||||
2BD3..2BEB ; Pattern_Syntax # Cn [25] <reserved-2BD3>..<reserved-2BEB>
|
||||
2BEC..2BEF ; Pattern_Syntax # So [4] LEFTWARDS TWO-HEADED ARROW WITH TRIANGLE ARROWHEADS..DOWNWARDS TWO-HEADED ARROW WITH TRIANGLE ARROWHEADS
|
||||
2BF0..2BFF ; Pattern_Syntax # Cn [16] <reserved-2BF0>..<reserved-2BFF>
|
||||
2BCA..2BFE ; Pattern_Syntax # So [53] TOP HALF BLACK CIRCLE..REVERSED RIGHT ANGLE
|
||||
2BFF ; Pattern_Syntax # Cn <reserved-2BFF>
|
||||
2E00..2E01 ; Pattern_Syntax # Po [2] RIGHT ANGLE SUBSTITUTION MARKER..RIGHT ANGLE DOTTED SUBSTITUTION MARKER
|
||||
2E02 ; Pattern_Syntax # Pi LEFT SUBSTITUTION BRACKET
|
||||
2E03 ; Pattern_Syntax # Pf RIGHT SUBSTITUTION BRACKET
|
||||
@ -1566,8 +1588,8 @@ E0100..E01EF ; Variation_Selector # Mn [240] VARIATION SELECTOR-17..VARIATION S
|
||||
2E40 ; Pattern_Syntax # Pd DOUBLE HYPHEN
|
||||
2E41 ; Pattern_Syntax # Po REVERSED COMMA
|
||||
2E42 ; Pattern_Syntax # Ps DOUBLE LOW-REVERSED-9 QUOTATION MARK
|
||||
2E43..2E49 ; Pattern_Syntax # Po [7] DASH WITH LEFT UPTURN..DOUBLE STACKED COMMA
|
||||
2E4A..2E7F ; Pattern_Syntax # Cn [54] <reserved-2E4A>..<reserved-2E7F>
|
||||
2E43..2E4E ; Pattern_Syntax # Po [12] DASH WITH LEFT UPTURN..PUNCTUS ELEVATUS MARK
|
||||
2E4F..2E7F ; Pattern_Syntax # Cn [49] <reserved-2E4F>..<reserved-2E7F>
|
||||
3001..3003 ; Pattern_Syntax # Po [3] IDEOGRAPHIC COMMA..DITTO MARK
|
||||
3008 ; Pattern_Syntax # Ps LEFT ANGLE BRACKET
|
||||
3009 ; Pattern_Syntax # Pe RIGHT ANGLE BRACKET
|
||||
@ -1606,8 +1628,9 @@ FE45..FE46 ; Pattern_Syntax # Po [2] SESAME DOT..WHITE SESAME DOT
|
||||
070F ; Prepended_Concatenation_Mark # Cf SYRIAC ABBREVIATION MARK
|
||||
08E2 ; Prepended_Concatenation_Mark # Cf ARABIC DISPUTED END OF AYAH
|
||||
110BD ; Prepended_Concatenation_Mark # Cf KAITHI NUMBER SIGN
|
||||
110CD ; Prepended_Concatenation_Mark # Cf KAITHI NUMBER SIGN ABOVE
|
||||
|
||||
# Total code points: 10
|
||||
# Total code points: 11
|
||||
|
||||
# ================================================
|
||||
|
||||
|
File diff suppressed because it is too large
Load Diff
Loading…
Reference in New Issue
Block a user