Update the Unicode data files up to v6.1.0
Change-Id: I20b94634b1f4ebff10757c2348cfdbbd906e8797 Reviewed-by: Lars Knoll <lars.knoll@nokia.com>
This commit is contained in:
parent
60e1892d83
commit
c9100bcce7
@ -1,26 +1,33 @@
|
||||
# ArabicShaping-5.0.0.txt
|
||||
# Date: 2006-07-14, 11:23:00 PST [KW]
|
||||
# ArabicShaping-6.1.0.txt
|
||||
# Date: 2011-04-15, 23:16:00 GMT [KW]
|
||||
#
|
||||
# This file is a normative contributory data file in the
|
||||
# Unicode Character Database.
|
||||
#
|
||||
# Copyright (c) 1991-2006 Unicode, Inc.
|
||||
# Copyright (c) 1991-2011 Unicode, Inc.
|
||||
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
||||
#
|
||||
# This file defines the shaping classes for Arabic and Syriac
|
||||
# This file defines the Joining_Type and Joining_Group
|
||||
# property values for Arabic, Syriac, N'Ko, and Mandaic
|
||||
# positional shaping, repeating in machine readable form the
|
||||
# information printed in Tables 8-3, 8-7, 8-8, 8-11, 8-12, and
|
||||
# 8-13 of The Unicode Standard, Version 4.0.
|
||||
# information exemplified in Tables 8-3, 8-8, 8-9, 8-10, 8-13, 8-14,
|
||||
# 8-15, 13-5, 14-5, and 14-6 of The Unicode Standard, Version 6.1.
|
||||
#
|
||||
# See sections 8.2 and 8.3 of The Unicode Standard, Version 4.0
|
||||
# for more information.
|
||||
# See sections 8.2, 8.3, 13.5, and 14.12 of The Unicode Standard,
|
||||
# Version 6.1 for more information.
|
||||
#
|
||||
# Each line contains four fields, separated by a semicolon.
|
||||
#
|
||||
# Field 0: the code point, in 4-digit hexadecimal
|
||||
# form, of an Arabic or Syriac character.
|
||||
# Field 1: gives a short schematic name for that character,
|
||||
# abbreviated from the normative Unicode character name.
|
||||
# form, of an Arabic, Syriac, N'Ko, or Mandaic character.
|
||||
#
|
||||
# Field 1: gives a short schematic name for that character.
|
||||
# The schematic name is descriptive of the shape, based as
|
||||
# consistently as possible on a name for the skeleton and
|
||||
# then the diacritic marks applied to the skeleton, if any.
|
||||
# Note that this schematic name is considered a comment,
|
||||
# and does not constitute a formal property value.
|
||||
#
|
||||
# Field 2: defines the joining type (property name: Joining_Type)
|
||||
# R Right_Joining
|
||||
# L Left_Joining
|
||||
@ -28,7 +35,8 @@
|
||||
# C Join_Causing
|
||||
# U Non_Joining
|
||||
# T Transparent
|
||||
# See the Arabic block description for more information on these types.
|
||||
# See Section 8.2, Arabic for more information on these types.
|
||||
#
|
||||
# Field 3: defines the joining group (property name: Joining_Group)
|
||||
#
|
||||
# The values of the joining group are based schematically on character
|
||||
@ -41,8 +49,8 @@
|
||||
#
|
||||
# corresponds to [Joining_Group = Teh_Marbuta].
|
||||
#
|
||||
# Note: For historical reasons, the property value [Joining_Group = Hamza_On_Heh_Goal]
|
||||
# is anachronistically named. It used to apply to both of the following characters
|
||||
# Note: The property value now designated [Joining_Group = Teh_Marbuta_Goal]
|
||||
# used to apply to both of the following characters
|
||||
# in earlier versions of the standard:
|
||||
#
|
||||
# U+06C2 ARABIC LETTER HEH GOAL WITH HAMZA ABOVE
|
||||
@ -50,58 +58,76 @@
|
||||
#
|
||||
# However, it currently applies only to U+06C3, and *not* to U+06C2.
|
||||
# To avoid destabilizing existing Joining_Group property aliases, the
|
||||
# value Hamza_On_Heh_Goal has not been changed, despite the fact that it
|
||||
# no longer applies to Hamza On Heh Goal, but only to Teh Marbuta Goal.
|
||||
# prior Joining_Group value for U+06C3 (Hamza_On_Heh_Goal) has been
|
||||
# retained as a property value alias, despite the fact that it
|
||||
# no longer applies to its namesake character, U+06C2.
|
||||
# See PropertyValueAliases.txt.
|
||||
#
|
||||
# When other cursive scripts are added to the Unicode Standard in
|
||||
# the future, the joining group value of all its letters will default
|
||||
# to jg=No_Joining_Group in this data file. Other, more specific
|
||||
# joining group values will be defined only if an explicit proposal
|
||||
# to define those values exactly has been approved by the UTC. This
|
||||
# is the convention exemplified by the N'Ko and Mandaic scripts. Only the Arabic
|
||||
# and Syriac scripts currently have explicit joining group values defined.
|
||||
#
|
||||
# Note: Code points that are not explicitly listed in this file are
|
||||
# either of joining type T or U:
|
||||
#
|
||||
# - Those that not explicitly listed that are of General Category Mn, Me, or Cf
|
||||
# have joining type T.
|
||||
# - All others not explicitly listed have type U.
|
||||
# - All others not explicitly listed have joining type U.
|
||||
#
|
||||
# For an explicit listing of characters of joining type T, see
|
||||
# the derived property file DerivedJoiningType.txt.
|
||||
#
|
||||
# There are currently no characters of type L defined in Unicode.
|
||||
# There are currently no characters of joining type L defined in Unicode.
|
||||
#
|
||||
# #############################################################
|
||||
|
||||
# Unicode; Schematic Name; Joining Type; Joining Group
|
||||
|
||||
# Arabic characters
|
||||
# Arabic Characters
|
||||
|
||||
0600; ARABIC NUMBER SIGN; U; No_Joining_Group
|
||||
0601; ARABIC SIGN SANAH; U; No_Joining_Group
|
||||
0602; ARABIC FOOTNOTE MARKER; U; No_Joining_Group
|
||||
0603; ARABIC SIGN SAFHA; U; No_Joining_Group
|
||||
0604; ARABIC SIGN SAMVAT; U; No_Joining_Group
|
||||
0608; ARABIC RAY; U; No_Joining_Group
|
||||
060B; AFGHANI SIGN; U; No_Joining_Group
|
||||
0620; DOTLESS YEH WITH SEPARATE RING BELOW; D; YEH
|
||||
0621; HAMZA; U; No_Joining_Group
|
||||
0622; MADDA ON ALEF; R; ALEF
|
||||
0623; HAMZA ON ALEF; R; ALEF
|
||||
0624; HAMZA ON WAW; R; WAW
|
||||
0625; HAMZA UNDER ALEF; R; ALEF
|
||||
0626; HAMZA ON YEH; D; YEH
|
||||
0622; ALEF WITH MADDA ABOVE; R; ALEF
|
||||
0623; ALEF WITH HAMZA ABOVE; R; ALEF
|
||||
0624; WAW WITH HAMZA ABOVE; R; WAW
|
||||
0625; ALEF WITH HAMZA BELOW; R; ALEF
|
||||
0626; DOTLESS YEH WITH HAMZA ABOVE; D; YEH
|
||||
0627; ALEF; R; ALEF
|
||||
0628; BEH; D; BEH
|
||||
0629; TEH MARBUTA; R; TEH MARBUTA
|
||||
062A; TEH; D; BEH
|
||||
062B; THEH; D; BEH
|
||||
062C; JEEM; D; HAH
|
||||
062A; DOTLESS BEH WITH 2 DOTS ABOVE; D; BEH
|
||||
062B; DOTLESS BEH WITH 3 DOTS ABOVE; D; BEH
|
||||
062C; HAH WITH DOT BELOW; D; HAH
|
||||
062D; HAH; D; HAH
|
||||
062E; KHAH; D; HAH
|
||||
062E; HAH WITH DOT ABOVE; D; HAH
|
||||
062F; DAL; R; DAL
|
||||
0630; THAL; R; DAL
|
||||
0630; DAL WITH DOT ABOVE; R; DAL
|
||||
0631; REH; R; REH
|
||||
0632; ZAIN; R; REH
|
||||
0632; REH WITH DOT ABOVE; R; REH
|
||||
0633; SEEN; D; SEEN
|
||||
0634; SHEEN; D; SEEN
|
||||
0634; SEEN WITH 3 DOTS ABOVE; D; SEEN
|
||||
0635; SAD; D; SAD
|
||||
0636; DAD; D; SAD
|
||||
0636; SAD WITH DOT ABOVE; D; SAD
|
||||
0637; TAH; D; TAH
|
||||
0638; ZAH; D; TAH
|
||||
0638; TAH WITH DOT ABOVE; D; TAH
|
||||
0639; AIN; D; AIN
|
||||
063A; GHAIN; D; AIN
|
||||
063A; AIN WITH DOT ABOVE; D; AIN
|
||||
063B; KEHEH WITH 2 DOTS ABOVE; D; GAF
|
||||
063C; KEHEH WITH 3 DOTS BELOW; D; GAF
|
||||
063D; FARSI YEH WITH INVERTED V ABOVE; D; FARSI YEH
|
||||
063E; FARSI YEH WITH 2 DOTS ABOVE; D; FARSI YEH
|
||||
063F; FARSI YEH WITH 3 DOTS ABOVE; D; FARSI YEH
|
||||
0640; TATWEEL; C; No_Joining_Group
|
||||
0641; FEH; D; FEH
|
||||
0642; QAF; D; QAF
|
||||
@ -111,48 +137,48 @@
|
||||
0646; NOON; D; NOON
|
||||
0647; HEH; D; HEH
|
||||
0648; WAW; R; WAW
|
||||
0649; ALEF MAKSURA; D; YEH
|
||||
0649; DOTLESS YEH; D; YEH
|
||||
064A; YEH; D; YEH
|
||||
066E; DOTLESS BEH; D; BEH
|
||||
066F; DOTLESS QAF; D; QAF
|
||||
0671; HAMZAT WASL ON ALEF; R; ALEF
|
||||
0672; WAVY HAMZA ON ALEF; R; ALEF
|
||||
0673; WAVY HAMZA UNDER ALEF; R; ALEF
|
||||
0671; ALEF WITH WASLA ABOVE; R; ALEF
|
||||
0672; ALEF WITH WAVY HAMZA ABOVE; R; ALEF
|
||||
0673; ALEF WITH WAVY HAMZA BELOW; R; ALEF
|
||||
0674; HIGH HAMZA; U; No_Joining_Group
|
||||
0675; HIGH HAMZA ALEF; R; ALEF
|
||||
0676; HIGH HAMZA WAW; R; WAW
|
||||
0677; HIGH HAMZA WAW WITH DAMMA; R; WAW
|
||||
0678; HIGH HAMZA YEH; D; YEH
|
||||
0679; TEH WITH SMALL TAH; D; BEH
|
||||
067A; TEH WITH 2 DOTS VERTICAL ABOVE; D; BEH
|
||||
067B; BEH WITH 2 DOTS VERTICAL BELOW; D; BEH
|
||||
067C; TEH WITH RING; D; BEH
|
||||
067D; TEH WITH 3 DOTS ABOVE DOWNWARD; D; BEH
|
||||
067E; TEH WITH 3 DOTS BELOW; D; BEH
|
||||
067F; TEH WITH 4 DOTS ABOVE; D; BEH
|
||||
0680; BEH WITH 4 DOTS BELOW; D; BEH
|
||||
0681; HAMZA ON HAH; D; HAH
|
||||
0682; HAH WITH 2 DOTS VERTICAL ABOVE; D; HAH
|
||||
0683; HAH WITH MIDDLE 2 DOTS; D; HAH
|
||||
0684; HAH WITH MIDDLE 2 DOTS VERTICAL; D; HAH
|
||||
0677; HIGH HAMZA WAW WITH DAMMA ABOVE; R; WAW
|
||||
0678; HIGH HAMZA DOTLESS YEH; D; YEH
|
||||
0679; DOTLESS BEH WITH TAH ABOVE; D; BEH
|
||||
067A; DOTLESS BEH WITH VERTICAL 2 DOTS ABOVE; D; BEH
|
||||
067B; DOTLESS BEH WITH VERTICAL 2 DOTS BELOW; D; BEH
|
||||
067C; DOTLESS BEH WITH ATTACHED RING BELOW AND 2 DOTS ABOVE; D; BEH
|
||||
067D; DOTLESS BEH WITH INVERTED 3 DOTS ABOVE; D; BEH
|
||||
067E; DOTLESS BEH WITH 3 DOTS BELOW; D; BEH
|
||||
067F; DOTLESS BEH WITH 4 DOTS ABOVE; D; BEH
|
||||
0680; DOTLESS BEH WITH 4 DOTS BELOW; D; BEH
|
||||
0681; HAH WITH HAMZA ABOVE; D; HAH
|
||||
0682; HAH WITH VERTICAL 2 DOTS ABOVE; D; HAH
|
||||
0683; HAH WITH 2 DOTS BELOW; D; HAH
|
||||
0684; HAH WITH VERTICAL 2 DOTS BELOW; D; HAH
|
||||
0685; HAH WITH 3 DOTS ABOVE; D; HAH
|
||||
0686; HAH WITH MIDDLE 3 DOTS DOWNWARD; D; HAH
|
||||
0687; HAH WITH MIDDLE 4 DOTS; D; HAH
|
||||
0688; DAL WITH SMALL TAH; R; DAL
|
||||
0689; DAL WITH RING; R; DAL
|
||||
0686; HAH WITH 3 DOTS BELOW; D; HAH
|
||||
0687; HAH WITH 4 DOTS BELOW; D; HAH
|
||||
0688; DAL WITH TAH ABOVE; R; DAL
|
||||
0689; DAL WITH ATTACHED RING BELOW; R; DAL
|
||||
068A; DAL WITH DOT BELOW; R; DAL
|
||||
068B; DAL WITH DOT BELOW AND SMALL TAH; R; DAL
|
||||
068B; DAL WITH DOT BELOW AND TAH ABOVE; R; DAL
|
||||
068C; DAL WITH 2 DOTS ABOVE; R; DAL
|
||||
068D; DAL WITH 2 DOTS BELOW; R; DAL
|
||||
068E; DAL WITH 3 DOTS ABOVE; R; DAL
|
||||
068F; DAL WITH 3 DOTS ABOVE DOWNWARD; R; DAL
|
||||
068F; DAL WITH INVERTED 3 DOTS ABOVE; R; DAL
|
||||
0690; DAL WITH 4 DOTS ABOVE; R; DAL
|
||||
0691; REH WITH SMALL TAH; R; REH
|
||||
0692; REH WITH SMALL V; R; REH
|
||||
0693; REH WITH RING; R; REH
|
||||
0691; REH WITH TAH ABOVE; R; REH
|
||||
0692; REH WITH V ABOVE; R; REH
|
||||
0693; REH WITH ATTACHED RING BELOW; R; REH
|
||||
0694; REH WITH DOT BELOW; R; REH
|
||||
0695; REH WITH SMALL V BELOW; R; REH
|
||||
0696; REH WITH DOT BELOW AND DOT ABOVE; R; REH
|
||||
0695; REH WITH V BELOW; R; REH
|
||||
0696; REH WITH DOT BELOW AND DOT WITHIN; R; REH
|
||||
0697; REH WITH 2 DOTS ABOVE; R; REH
|
||||
0698; REH WITH 3 DOTS ABOVE; R; REH
|
||||
0699; REH WITH 4 DOTS ABOVE; R; REH
|
||||
@ -164,66 +190,66 @@
|
||||
069F; TAH WITH 3 DOTS ABOVE; D; TAH
|
||||
06A0; AIN WITH 3 DOTS ABOVE; D; AIN
|
||||
06A1; DOTLESS FEH; D; FEH
|
||||
06A2; FEH WITH DOT MOVED BELOW; D; FEH
|
||||
06A2; DOTLESS FEH WITH DOT BELOW; D; FEH
|
||||
06A3; FEH WITH DOT BELOW; D; FEH
|
||||
06A4; FEH WITH 3 DOTS ABOVE; D; FEH
|
||||
06A5; FEH WITH 3 DOTS BELOW; D; FEH
|
||||
06A6; FEH WITH 4 DOTS ABOVE; D; FEH
|
||||
06A7; QAF WITH DOT ABOVE; D; QAF
|
||||
06A8; QAF WITH 3 DOTS ABOVE; D; QAF
|
||||
06A4; DOTLESS FEH WITH 3 DOTS ABOVE; D; FEH
|
||||
06A5; DOTLESS FEH WITH 3 DOTS BELOW; D; FEH
|
||||
06A6; DOTLESS FEH WITH 4 DOTS ABOVE; D; FEH
|
||||
06A7; DOTLESS QAF WITH DOT ABOVE; D; QAF
|
||||
06A8; DOTLESS QAF WITH 3 DOTS ABOVE; D; QAF
|
||||
06A9; KEHEH; D; GAF
|
||||
06AA; SWASH KAF; D; SWASH KAF
|
||||
06AB; KAF WITH RING; D; GAF
|
||||
06AB; KEHEH WITH ATTACHED RING BELOW; D; GAF
|
||||
06AC; KAF WITH DOT ABOVE; D; KAF
|
||||
06AD; KAF WITH 3 DOTS ABOVE; D; KAF
|
||||
06AE; KAF WITH 3 DOTS BELOW; D; KAF
|
||||
06AF; GAF; D; GAF
|
||||
06B0; GAF WITH RING; D; GAF
|
||||
06B0; GAF WITH ATTACHED RING BELOW; D; GAF
|
||||
06B1; GAF WITH 2 DOTS ABOVE; D; GAF
|
||||
06B2; GAF WITH 2 DOTS BELOW; D; GAF
|
||||
06B3; GAF WITH 2 DOTS VERTICAL BELOW; D; GAF
|
||||
06B3; GAF WITH VERTICAL 2 DOTS BELOW; D; GAF
|
||||
06B4; GAF WITH 3 DOTS ABOVE; D; GAF
|
||||
06B5; LAM WITH SMALL V; D; LAM
|
||||
06B5; LAM WITH V ABOVE; D; LAM
|
||||
06B6; LAM WITH DOT ABOVE; D; LAM
|
||||
06B7; LAM WITH 3 DOTS ABOVE; D; LAM
|
||||
06B8; LAM WITH 3 DOTS BELOW; D; LAM
|
||||
06B9; NOON WITH DOT BELOW; D; NOON
|
||||
06BA; DOTLESS NOON; D; NOON
|
||||
06BB; DOTLESS NOON WITH SMALL TAH; D; NOON
|
||||
06BC; NOON WITH RING; D; NOON
|
||||
06BD; NOON WITH 3 DOTS ABOVE; D; NOON
|
||||
06BB; DOTLESS NOON WITH TAH ABOVE; D; NOON
|
||||
06BC; NOON WITH ATTACHED RING BELOW; D; NOON
|
||||
06BD; NYA; D; NYA
|
||||
06BE; KNOTTED HEH; D; KNOTTED HEH
|
||||
06BF; HAH WITH MIDDLE 3 DOTS DOWNWARD AND DOT ABOVE; D; HAH
|
||||
06C0; HAMZA ON HEH; R; TEH MARBUTA
|
||||
06BF; HAH WITH 3 DOTS BELOW AND DOT ABOVE; D; HAH
|
||||
06C0; DOTLESS TEH MARBUTA WITH HAMZA ABOVE; R; TEH MARBUTA
|
||||
06C1; HEH GOAL; D; HEH GOAL
|
||||
06C2; HAMZA ON HEH GOAL; D; HEH GOAL
|
||||
06C3; TEH MARBUTA GOAL; R; HAMZA ON HEH GOAL
|
||||
06C4; WAW WITH RING; R; WAW
|
||||
06C2; HEH GOAL WITH HAMZA ABOVE; D; HEH GOAL
|
||||
06C3; TEH MARBUTA GOAL; R; TEH MARBUTA GOAL
|
||||
06C4; WAW WITH ATTACHED RING WITHIN; R; WAW
|
||||
06C5; WAW WITH BAR; R; WAW
|
||||
06C6; WAW WITH SMALL V; R; WAW
|
||||
06C7; WAW WITH DAMMA; R; WAW
|
||||
06C6; WAW WITH V ABOVE; R; WAW
|
||||
06C7; WAW WITH DAMMA ABOVE; R; WAW
|
||||
06C8; WAW WITH ALEF ABOVE; R; WAW
|
||||
06C9; WAW WITH INVERTED SMALL V; R; WAW
|
||||
06C9; WAW WITH INVERTED V ABOVE; R; WAW
|
||||
06CA; WAW WITH 2 DOTS ABOVE; R; WAW
|
||||
06CB; WAW WITH 3 DOTS ABOVE; R; WAW
|
||||
06CC; DOTLESS YEH; D; YEH
|
||||
06CC; FARSI YEH; D; FARSI YEH
|
||||
06CD; YEH WITH TAIL; R; YEH WITH TAIL
|
||||
06CE; YEH WITH SMALL V; D; YEH
|
||||
06CE; FARSI YEH WITH V ABOVE; D; FARSI YEH
|
||||
06CF; WAW WITH DOT ABOVE; R; WAW
|
||||
06D0; YEH WITH 2 DOTS VERTICAL BELOW; D; YEH
|
||||
06D1; YEH WITH 3 DOTS BELOW; D; YEH
|
||||
06D0; DOTLESS YEH WITH VERTICAL 2 DOTS BELOW; D; YEH
|
||||
06D1; DOTLESS YEH WITH 3 DOTS BELOW; D; YEH
|
||||
06D2; YEH BARREE; R; YEH BARREE
|
||||
06D3; HAMZA ON YEH BARREE; R; YEH BARREE
|
||||
06D5; AE; R; TEH MARBUTA
|
||||
06D3; YEH BARREE WITH HAMZA ABOVE; R; YEH BARREE
|
||||
06D5; DOTLESS TEH MARBUTA; R; TEH MARBUTA
|
||||
06DD; ARABIC END OF AYAH; U; No_Joining_Group
|
||||
06EE; DAL WITH INVERTED V; R; DAL
|
||||
06EF; REH WITH INVERTED V; R; REH
|
||||
06EE; DAL WITH INVERTED V ABOVE; R; DAL
|
||||
06EF; REH WITH INVERTED V ABOVE; R; REH
|
||||
06FA; SEEN WITH DOT BELOW AND 3 DOTS ABOVE; D; SEEN
|
||||
06FB; DAD WITH DOT BELOW; D; SAD
|
||||
06FC; GHAIN WITH DOT BELOW; D; AIN
|
||||
06FF; HEH WITH INVERTED V; D; KNOTTED HEH
|
||||
06FB; SAD WITH DOT BELOW AND DOT ABOVE; D; SAD
|
||||
06FC; AIN WITH DOT BELOW AND DOT ABOVE; D; AIN
|
||||
06FF; KNOTTED HEH WITH INVERTED V ABOVE; D; KNOTTED HEH
|
||||
|
||||
# Syriac characters
|
||||
# Syriac Characters
|
||||
|
||||
0710; ALAPH; R; ALAPH
|
||||
0712; BETH; D; BETH
|
||||
@ -260,38 +286,56 @@
|
||||
074E; SOGDIAN KHAPH; D; KHAPH
|
||||
074F; SOGDIAN FE; D; FE
|
||||
|
||||
# Arabic supplement characters
|
||||
# Arabic Supplement Characters
|
||||
|
||||
0750; BEH WITH 3 DOTS HORIZONTALLY BELOW; D; BEH
|
||||
0751; BEH WITH DOT BELOW AND 3 DOTS ABOVE; D; BEH
|
||||
0752; BEH WITH 3 DOTS POINTING UPWARDS BELOW; D; BEH
|
||||
0753; BEH WITH 3 DOTS POINTING UPWARDS BELOW AND 2 DOTS ABOVE; D; BEH
|
||||
0754; BEH WITH 2 DOTS BELOW AND DOT ABOVE; D; BEH
|
||||
0755; BEH WITH INVERTED SMALL V BELOW; D; BEH
|
||||
0756; BEH WITH SMALL V; D; BEH
|
||||
0750; DOTLESS BEH WITH HORIZONTAL 3 DOTS BELOW; D; BEH
|
||||
0751; BEH WITH 3 DOTS ABOVE; D; BEH
|
||||
0752; DOTLESS BEH WITH INVERTED 3 DOTS BELOW; D; BEH
|
||||
0753; DOTLESS BEH WITH INVERTED 3 DOTS BELOW AND 2 DOTS ABOVE; D; BEH
|
||||
0754; DOTLESS BEH WITH 2 DOTS BELOW AND DOT ABOVE; D; BEH
|
||||
0755; DOTLESS BEH WITH INVERTED V BELOW; D; BEH
|
||||
0756; DOTLESS BEH WITH V ABOVE; D; BEH
|
||||
0757; HAH WITH 2 DOTS ABOVE; D; HAH
|
||||
0758; HAH WITH 3 DOTS POINTING UPWARDS BELOW; D; HAH
|
||||
0759; DAL WITH 2 DOTS VERTICALLY BELOW AND SMALL TAH; R; DAL
|
||||
075A; DAL WITH INVERTED SMALL V BELOW; R; DAL
|
||||
075B; REH WITH STROKE; R; REH
|
||||
0758; HAH WITH INVERTED 3 DOTS BELOW; D; HAH
|
||||
0759; DAL WITH VERTICAL 2 DOTS BELOW AND TAH ABOVE; R; DAL
|
||||
075A; DAL WITH INVERTED V BELOW; R; DAL
|
||||
075B; REH WITH BAR; R; REH
|
||||
075C; SEEN WITH 4 DOTS ABOVE; D; SEEN
|
||||
075D; AIN WITH 2 DOTS ABOVE; D; AIN
|
||||
075E; AIN WITH 3 DOTS POINTING DOWNWARDS ABOVE; D; AIN
|
||||
075F; AIN WITH 2 DOTS VERTICALLY ABOVE; D; AIN
|
||||
0760; FEH WITH 2 DOTS BELOW; D; FEH
|
||||
0761; FEH WITH 3 DOTS POINTING UPWARDS BELOW; D; FEH
|
||||
075E; AIN WITH INVERTED 3 DOTS ABOVE; D; AIN
|
||||
075F; AIN WITH VERTICAL 2 DOTS ABOVE; D; AIN
|
||||
0760; DOTLESS FEH WITH 2 DOTS BELOW; D; FEH
|
||||
0761; DOTLESS FEH WITH INVERTED 3 DOTS BELOW; D; FEH
|
||||
0762; KEHEH WITH DOT ABOVE; D; GAF
|
||||
0763; KEHEH WITH 3 DOTS ABOVE; D; GAF
|
||||
0764; KEHEH WITH 3 DOTS POINTING UPWARDS BELOW; D; GAF
|
||||
0764; KEHEH WITH INVERTED 3 DOTS BELOW; D; GAF
|
||||
0765; MEEM WITH DOT ABOVE; D; MEEM
|
||||
0766; MEEM WITH DOT BELOW; D; MEEM
|
||||
0767; NOON WITH 2 DOTS BELOW; D; NOON
|
||||
0768; NOON WITH SMALL TAH; D; NOON
|
||||
0769; NOON WITH SMALL V; D; NOON
|
||||
0768; NOON WITH TAH ABOVE; D; NOON
|
||||
0769; NOON WITH V ABOVE; D; NOON
|
||||
076A; LAM WITH BAR; D; LAM
|
||||
076B; REH WITH 2 DOTS VERTICALLY ABOVE; R; REH
|
||||
076B; REH WITH VERTICAL 2 DOTS ABOVE; R; REH
|
||||
076C; REH WITH HAMZA ABOVE; R; REH
|
||||
076D; SEEN WITH 2 DOTS VERTICALLY ABOVE; D; SEEN
|
||||
076D; SEEN WITH VERTICAL 2 DOTS ABOVE; D; SEEN
|
||||
076E; HAH WITH TAH BELOW; D; HAH
|
||||
076F; HAH WITH TAH AND 2 DOTS BELOW; D; HAH
|
||||
0770; SEEN WITH 2 DOTS AND TAH ABOVE; D; SEEN
|
||||
0771; REH WITH 2 DOTS AND TAH ABOVE; R; REH
|
||||
0772; HAH WITH TAH ABOVE; D; HAH
|
||||
0773; ALEF WITH DIGIT TWO ABOVE; R; ALEF
|
||||
0774; ALEF WITH DIGIT THREE ABOVE; R; ALEF
|
||||
0775; FARSI YEH WITH DIGIT TWO ABOVE; D; FARSI YEH
|
||||
0776; FARSI YEH WITH DIGIT THREE ABOVE; D; FARSI YEH
|
||||
0777; DOTLESS YEH WITH DIGIT FOUR BELOW; D; YEH
|
||||
0778; WAW WITH DIGIT TWO ABOVE; R; WAW
|
||||
0779; WAW WITH DIGIT THREE ABOVE; R; WAW
|
||||
077A; BURUSHASKI YEH BARREE WITH DIGIT TWO ABOVE; D; BURUSHASKI YEH BARREE
|
||||
077B; BURUSHASKI YEH BARREE WITH DIGIT THREE ABOVE; D; BURUSHASKI YEH BARREE
|
||||
077C; HAH WITH DIGIT FOUR BELOW; D; HAH
|
||||
077D; SEEN WITH DIGIT FOUR ABOVE; D; SEEN
|
||||
077E; SEEN WITH INVERTED V ABOVE; D; SEEN
|
||||
077F; KAF WITH 2 DOTS ABOVE; D; KAF
|
||||
|
||||
# N'Ko Characters
|
||||
|
||||
@ -330,9 +374,52 @@
|
||||
07EA; NKO JONA RA; D; No_Joining_Group
|
||||
07FA; NKO LAJANYALAN; C; No_Joining_Group
|
||||
|
||||
# Mandaic Characters
|
||||
|
||||
0840; MANDAIC HALQA; R; No_Joining_Group
|
||||
0841; MANDAIC AB; D; No_Joining_Group
|
||||
0842; MANDAIC AG; D; No_Joining_Group
|
||||
0843; MANDAIC AD; D; No_Joining_Group
|
||||
0844; MANDAIC AH; D; No_Joining_Group
|
||||
0845; MANDAIC USHENNA; D; No_Joining_Group
|
||||
0846; MANDAIC AZ; R; No_Joining_Group
|
||||
0847; MANDAIC IT; D; No_Joining_Group
|
||||
0848; MANDAIC ATT; D; No_Joining_Group
|
||||
0849; MANDAIC AKSA; R; No_Joining_Group
|
||||
084A; MANDAIC AK; D; No_Joining_Group
|
||||
084B; MANDAIC AL; D; No_Joining_Group
|
||||
084C; MANDAIC AM; D; No_Joining_Group
|
||||
084D; MANDAIC AN; D; No_Joining_Group
|
||||
084E; MANDAIC AS; D; No_Joining_Group
|
||||
084F; MANDAIC IN; R; No_Joining_Group
|
||||
0850; MANDAIC AP; D; No_Joining_Group
|
||||
0851; MANDAIC ASZ; D; No_Joining_Group
|
||||
0852; MANDAIC AQ; D; No_Joining_Group
|
||||
0853; MANDAIC AR; D; No_Joining_Group
|
||||
0854; MANDAIC ASH; R; No_Joining_Group
|
||||
0855; MANDAIC AT; D; No_Joining_Group
|
||||
0856; MANDAIC DUSHENNA; U; No_Joining_Group
|
||||
0857; MANDAIC KAD; U; No_Joining_Group
|
||||
0858; MANDAIC AIN; U; No_Joining_Group
|
||||
|
||||
# Arabic Extended-A Characters
|
||||
|
||||
08A0; DOTLESS BEH WITH V BELOW; D; BEH
|
||||
08A2; HAH WITH DOT BELOW AND 2 DOTS ABOVE; D; HAH
|
||||
08A3; TAH WITH 2 DOTS ABOVE; D; TAH
|
||||
08A4; DOTLESS FEH WITH DOT BELOW AND 3 DOTS ABOVE; D; FEH
|
||||
08A5; QAF WITH DOT BELOW; D; QAF
|
||||
08A6; LAM WITH DOUBLE BAR; D; LAM
|
||||
08A7; MEEM WITH 3 DOTS ABOVE; D; MEEM
|
||||
08A8; YEH WITH HAMZA ABOVE; D; YEH
|
||||
08A9; YEH WITH DOT ABOVE; D; YEH
|
||||
08AA; REH WITH LOOP; R; REH
|
||||
08AB; WAW WITH DOT WITHIN; R; WAW
|
||||
08AC; ROHINGYA YEH; R; ROHINGYA YEH
|
||||
|
||||
# Other
|
||||
|
||||
200D; ZERO WIDTH JOINER; C; No_Joining_Group
|
||||
200C; ZERO WIDTH NON-JOINER; U; No_Joining_Group
|
||||
200D; ZERO WIDTH JOINER; C; No_Joining_Group
|
||||
|
||||
# EOF
|
||||
|
@ -1,18 +1,19 @@
|
||||
# BidiMirroring-5.0.0.txt
|
||||
# Date: 2007-08-22, 14:40:00 PDT [KW]
|
||||
# BidiMirroring-6.1.0.txt
|
||||
# Date: 2011-12-20, 19:31:00 GMT [KW, LI]
|
||||
#
|
||||
# Bidi_Mirroring_Glyph Property (Corrigendum #6 applied)
|
||||
# Bidi_Mirroring_Glyph Property
|
||||
#
|
||||
# This file is an informative contributory data file in the
|
||||
# Unicode Character Database.
|
||||
#
|
||||
# Copyright (c) 1991-2007 Unicode, Inc.
|
||||
# Copyright (c) 1991-2011 Unicode, Inc.
|
||||
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
||||
#
|
||||
# This data file lists characters that have the mirrored property
|
||||
# where there is another Unicode character that typically has a glyph
|
||||
# This data file lists characters that have the Bidi_Mirrored=Yes property
|
||||
# value, for which there is another Unicode character that typically has a glyph
|
||||
# that is the mirror image of the original character's glyph.
|
||||
# The repertoire covered by the file is Unicode 5.0.0.
|
||||
#
|
||||
# The repertoire covered by the file is Unicode 6.1.0.
|
||||
#
|
||||
# The file contains a list of lines with mappings from one code point
|
||||
# to another one for character-based mirroring.
|
||||
@ -25,16 +26,36 @@
|
||||
# variable-length hexadecimal value with 4 to 6 digits.
|
||||
# A comment indicates where the characters are "BEST FIT" mirroring.
|
||||
#
|
||||
# Code points with the "mirrored" property but no appropriate mirrors are
|
||||
# Code points for which Bidi_Mirrored=Yes, but for which no appropriate
|
||||
# characters exist with mirrored glyphs, are
|
||||
# listed as comments at the end of the file.
|
||||
#
|
||||
# Note: (2011-12-19) There is an inconsistency between the
|
||||
# following statement about the default value
|
||||
# of the Bidi_Mirroring_Glyph property and the
|
||||
# value of the @missing line for Bidi_Mirroring_Glyph in
|
||||
# PropertyValueAliases.txt. This inconsistency was discovered too
|
||||
# late in the release process to be resolved by
|
||||
# the UTC. The inconsistency will be resolved in a future revision.
|
||||
#
|
||||
# Formally, the default value of the Bidi_Mirroring_Glyph property
|
||||
# for each code point is the code point itself, unless a mapping to
|
||||
# some other character is specified in this data file. When a code
|
||||
# point has the default value for the Bidi_Mirroring_Glyph property,
|
||||
# that means that no other character exists whose glyph is suitable
|
||||
# for character-based mirroring.
|
||||
#
|
||||
# For information on bidi mirroring, see UAX #9: Bidirectional Algorithm,
|
||||
# at http://www.unicode.org/unicode/reports/tr9/
|
||||
#
|
||||
# This file was originally created by Markus Scherer.
|
||||
# Extended for Unicode 3.2, 4.0, 4.1, and 5.0 by Ken Whistler.
|
||||
# Extended for Unicode 3.2, 4.0, 4.1, 5.0, 5.1, 5.2, and 6.0 by Ken Whistler,
|
||||
# and for Unicode 6.1 by Ken Whistler and Laurentiu Iancu.
|
||||
#
|
||||
# ############################################################
|
||||
#
|
||||
# Property: Bidi_Mirroring_Glyph
|
||||
#
|
||||
|
||||
0028; 0029 # LEFT PARENTHESIS
|
||||
0029; 0028 # RIGHT PARENTHESIS
|
||||
@ -92,8 +113,8 @@
|
||||
2275; 2274 # [BEST FIT] NEITHER GREATER-THAN NOR EQUIVALENT TO
|
||||
2276; 2277 # LESS-THAN OR GREATER-THAN
|
||||
2277; 2276 # GREATER-THAN OR LESS-THAN
|
||||
2278; 2279 # NEITHER LESS-THAN NOR GREATER-THAN
|
||||
2279; 2278 # NEITHER GREATER-THAN NOR LESS-THAN
|
||||
2278; 2279 # [BEST FIT] NEITHER LESS-THAN NOR GREATER-THAN
|
||||
2279; 2278 # [BEST FIT] NEITHER GREATER-THAN NOR LESS-THAN
|
||||
227A; 227B # PRECEDES
|
||||
227B; 227A # SUCCEEDS
|
||||
227C; 227D # PRECEDES OR EQUAL TO
|
||||
@ -198,6 +219,10 @@
|
||||
27C4; 27C3 # OPEN SUPERSET
|
||||
27C5; 27C6 # LEFT S-SHAPED BAG DELIMITER
|
||||
27C6; 27C5 # RIGHT S-SHAPED BAG DELIMITER
|
||||
27C8; 27C9 # REVERSE SOLIDUS PRECEDING SUBSET
|
||||
27C9; 27C8 # SUPERSET PRECEDING SOLIDUS
|
||||
27CB; 27CD # MATHEMATICAL RISING DIAGONAL
|
||||
27CD; 27CB # MATHEMATICAL FALLING DIAGONAL
|
||||
27D5; 27D6 # LEFT OUTER JOIN
|
||||
27D6; 27D5 # RIGHT OUTER JOIN
|
||||
27DD; 27DE # LONG RIGHT TACK
|
||||
@ -212,6 +237,10 @@
|
||||
27E9; 27E8 # MATHEMATICAL RIGHT ANGLE BRACKET
|
||||
27EA; 27EB # MATHEMATICAL LEFT DOUBLE ANGLE BRACKET
|
||||
27EB; 27EA # MATHEMATICAL RIGHT DOUBLE ANGLE BRACKET
|
||||
27EC; 27ED # MATHEMATICAL LEFT WHITE TORTOISE SHELL BRACKET
|
||||
27ED; 27EC # MATHEMATICAL RIGHT WHITE TORTOISE SHELL BRACKET
|
||||
27EE; 27EF # MATHEMATICAL LEFT FLATTENED PARENTHESIS
|
||||
27EF; 27EE # MATHEMATICAL RIGHT FLATTENED PARENTHESIS
|
||||
2983; 2984 # LEFT WHITE CURLY BRACKET
|
||||
2984; 2983 # RIGHT WHITE CURLY BRACKET
|
||||
2985; 2986 # LEFT WHITE PARENTHESIS
|
||||
@ -344,6 +373,16 @@
|
||||
2E0D; 2E0C # RIGHT RAISED OMISSION BRACKET
|
||||
2E1C; 2E1D # LEFT LOW PARAPHRASE BRACKET
|
||||
2E1D; 2E1C # RIGHT LOW PARAPHRASE BRACKET
|
||||
2E20; 2E21 # LEFT VERTICAL BAR WITH QUILL
|
||||
2E21; 2E20 # RIGHT VERTICAL BAR WITH QUILL
|
||||
2E22; 2E23 # TOP LEFT HALF BRACKET
|
||||
2E23; 2E22 # TOP RIGHT HALF BRACKET
|
||||
2E24; 2E25 # BOTTOM LEFT HALF BRACKET
|
||||
2E25; 2E24 # BOTTOM RIGHT HALF BRACKET
|
||||
2E26; 2E27 # LEFT SIDEWAYS U BRACKET
|
||||
2E27; 2E26 # RIGHT SIDEWAYS U BRACKET
|
||||
2E28; 2E29 # LEFT DOUBLE PARENTHESIS
|
||||
2E29; 2E28 # RIGHT DOUBLE PARENTHESIS
|
||||
3008; 3009 # LEFT ANGLE BRACKET
|
||||
3009; 3008 # RIGHT ANGLE BRACKET
|
||||
300A; 300B # LEFT DOUBLE ANGLE BRACKET
|
||||
@ -449,6 +488,7 @@ FF63; FF62 # [BEST FIT] HALFWIDTH RIGHT CORNER BRACKET
|
||||
# 2320; TOP HALF INTEGRAL
|
||||
# 2321; BOTTOM HALF INTEGRAL
|
||||
# 27C0; THREE DIMENSIONAL ANGLE
|
||||
# 27CC; LONG DIVISION
|
||||
# 27D3; LOWER RIGHT CORNER WITH DOT
|
||||
# 27D4; UPPER LEFT CORNER WITH DOT
|
||||
# 27DC; LEFT MULTIMAP
|
||||
|
@ -1,10 +1,10 @@
|
||||
# Blocks-5.0.0.txt
|
||||
# Date: 2006-02-15, 15:40:00 [KW]
|
||||
# Blocks-6.1.0.txt
|
||||
# Date: 2011-06-14, 18:26:00 GMT [KW, LI]
|
||||
#
|
||||
# Unicode Character Database
|
||||
# Copyright (c) 1991-2006 Unicode, Inc.
|
||||
# Copyright (c) 1991-2011 Unicode, Inc.
|
||||
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
||||
# For documentation, see UCD.html
|
||||
# For documentation, see http://www.unicode.org/reports/tr44/
|
||||
#
|
||||
# Note: The casing of block names is not normative.
|
||||
# For example, "Basic Latin" and "BASIC LATIN" are equivalent.
|
||||
@ -18,7 +18,7 @@
|
||||
# and underbars are ignored.
|
||||
# For example, "Latin Extended-A" and "latin extended a" are equivalent.
|
||||
# For more information on the comparison of property values,
|
||||
# see UCD.html.
|
||||
# see UAX #44: http://www.unicode.org/reports/tr44/
|
||||
#
|
||||
# All code points not explicitly listed for Block
|
||||
# have the value No_Block.
|
||||
@ -44,6 +44,9 @@
|
||||
0750..077F; Arabic Supplement
|
||||
0780..07BF; Thaana
|
||||
07C0..07FF; NKo
|
||||
0800..083F; Samaritan
|
||||
0840..085F; Mandaic
|
||||
08A0..08FF; Arabic Extended-A
|
||||
0900..097F; Devanagari
|
||||
0980..09FF; Bengali
|
||||
0A00..0A7F; Gurmukhi
|
||||
@ -72,12 +75,20 @@
|
||||
1760..177F; Tagbanwa
|
||||
1780..17FF; Khmer
|
||||
1800..18AF; Mongolian
|
||||
18B0..18FF; Unified Canadian Aboriginal Syllabics Extended
|
||||
1900..194F; Limbu
|
||||
1950..197F; Tai Le
|
||||
1980..19DF; New Tai Lue
|
||||
19E0..19FF; Khmer Symbols
|
||||
1A00..1A1F; Buginese
|
||||
1A20..1AAF; Tai Tham
|
||||
1B00..1B7F; Balinese
|
||||
1B80..1BBF; Sundanese
|
||||
1BC0..1BFF; Batak
|
||||
1C00..1C4F; Lepcha
|
||||
1C50..1C7F; Ol Chiki
|
||||
1CC0..1CCF; Sundanese Supplement
|
||||
1CD0..1CFF; Vedic Extensions
|
||||
1D00..1D7F; Phonetic Extensions
|
||||
1D80..1DBF; Phonetic Extensions Supplement
|
||||
1DC0..1DFF; Combining Diacritical Marks Supplement
|
||||
@ -113,6 +124,7 @@
|
||||
2D00..2D2F; Georgian Supplement
|
||||
2D30..2D7F; Tifinagh
|
||||
2D80..2DDF; Ethiopic Extended
|
||||
2DE0..2DFF; Cyrillic Extended-A
|
||||
2E00..2E7F; Supplemental Punctuation
|
||||
2E80..2EFF; CJK Radicals Supplement
|
||||
2F00..2FDF; Kangxi Radicals
|
||||
@ -133,11 +145,29 @@
|
||||
4E00..9FFF; CJK Unified Ideographs
|
||||
A000..A48F; Yi Syllables
|
||||
A490..A4CF; Yi Radicals
|
||||
A4D0..A4FF; Lisu
|
||||
A500..A63F; Vai
|
||||
A640..A69F; Cyrillic Extended-B
|
||||
A6A0..A6FF; Bamum
|
||||
A700..A71F; Modifier Tone Letters
|
||||
A720..A7FF; Latin Extended-D
|
||||
A800..A82F; Syloti Nagri
|
||||
A830..A83F; Common Indic Number Forms
|
||||
A840..A87F; Phags-pa
|
||||
A880..A8DF; Saurashtra
|
||||
A8E0..A8FF; Devanagari Extended
|
||||
A900..A92F; Kayah Li
|
||||
A930..A95F; Rejang
|
||||
A960..A97F; Hangul Jamo Extended-A
|
||||
A980..A9DF; Javanese
|
||||
AA00..AA5F; Cham
|
||||
AA60..AA7F; Myanmar Extended-A
|
||||
AA80..AADF; Tai Viet
|
||||
AAE0..AAFF; Meetei Mayek Extensions
|
||||
AB00..AB2F; Ethiopic Extended-A
|
||||
ABC0..ABFF; Meetei Mayek
|
||||
AC00..D7AF; Hangul Syllables
|
||||
D7B0..D7FF; Hangul Jamo Extended-B
|
||||
D800..DB7F; High Surrogates
|
||||
DB80..DBFF; High Private Use Surrogates
|
||||
DC00..DFFF; Low Surrogates
|
||||
@ -157,6 +187,10 @@ FFF0..FFFF; Specials
|
||||
10080..100FF; Linear B Ideograms
|
||||
10100..1013F; Aegean Numbers
|
||||
10140..1018F; Ancient Greek Numbers
|
||||
10190..101CF; Ancient Symbols
|
||||
101D0..101FF; Phaistos Disc
|
||||
10280..1029F; Lycian
|
||||
102A0..102DF; Carian
|
||||
10300..1032F; Old Italic
|
||||
10330..1034F; Gothic
|
||||
10380..1039F; Ugaritic
|
||||
@ -165,17 +199,49 @@ FFF0..FFFF; Specials
|
||||
10450..1047F; Shavian
|
||||
10480..104AF; Osmanya
|
||||
10800..1083F; Cypriot Syllabary
|
||||
10840..1085F; Imperial Aramaic
|
||||
10900..1091F; Phoenician
|
||||
10920..1093F; Lydian
|
||||
10980..1099F; Meroitic Hieroglyphs
|
||||
109A0..109FF; Meroitic Cursive
|
||||
10A00..10A5F; Kharoshthi
|
||||
10A60..10A7F; Old South Arabian
|
||||
10B00..10B3F; Avestan
|
||||
10B40..10B5F; Inscriptional Parthian
|
||||
10B60..10B7F; Inscriptional Pahlavi
|
||||
10C00..10C4F; Old Turkic
|
||||
10E60..10E7F; Rumi Numeral Symbols
|
||||
11000..1107F; Brahmi
|
||||
11080..110CF; Kaithi
|
||||
110D0..110FF; Sora Sompeng
|
||||
11100..1114F; Chakma
|
||||
11180..111DF; Sharada
|
||||
11680..116CF; Takri
|
||||
12000..123FF; Cuneiform
|
||||
12400..1247F; Cuneiform Numbers and Punctuation
|
||||
13000..1342F; Egyptian Hieroglyphs
|
||||
16800..16A3F; Bamum Supplement
|
||||
16F00..16F9F; Miao
|
||||
1B000..1B0FF; Kana Supplement
|
||||
1D000..1D0FF; Byzantine Musical Symbols
|
||||
1D100..1D1FF; Musical Symbols
|
||||
1D200..1D24F; Ancient Greek Musical Notation
|
||||
1D300..1D35F; Tai Xuan Jing Symbols
|
||||
1D360..1D37F; Counting Rod Numerals
|
||||
1D400..1D7FF; Mathematical Alphanumeric Symbols
|
||||
1EE00..1EEFF; Arabic Mathematical Alphabetic Symbols
|
||||
1F000..1F02F; Mahjong Tiles
|
||||
1F030..1F09F; Domino Tiles
|
||||
1F0A0..1F0FF; Playing Cards
|
||||
1F100..1F1FF; Enclosed Alphanumeric Supplement
|
||||
1F200..1F2FF; Enclosed Ideographic Supplement
|
||||
1F300..1F5FF; Miscellaneous Symbols And Pictographs
|
||||
1F600..1F64F; Emoticons
|
||||
1F680..1F6FF; Transport And Map Symbols
|
||||
1F700..1F77F; Alchemical Symbols
|
||||
20000..2A6DF; CJK Unified Ideographs Extension B
|
||||
2A700..2B73F; CJK Unified Ideographs Extension C
|
||||
2B740..2B81F; CJK Unified Ideographs Extension D
|
||||
2F800..2FA1F; CJK Compatibility Ideographs Supplement
|
||||
E0000..E007F; Tags
|
||||
E0100..E01EF; Variation Selectors Supplement
|
||||
|
@ -1,10 +1,10 @@
|
||||
# CaseFolding-5.0.0.txt
|
||||
# Date: 2006-03-03, 08:22:43 GMT [MD]
|
||||
# CaseFolding-6.1.0.txt
|
||||
# Date: 2011-07-25, 21:21:56 GMT [MD]
|
||||
#
|
||||
# Unicode Character Database
|
||||
# Copyright (c) 1991-2006 Unicode, Inc.
|
||||
# Copyright (c) 1991-2011 Unicode, Inc.
|
||||
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
||||
# For documentation, see UCD.html
|
||||
# For documentation, see http://www.unicode.org/reports/tr44/
|
||||
#
|
||||
# Case Folding Properties
|
||||
#
|
||||
@ -23,8 +23,9 @@
|
||||
#
|
||||
# NOTE: case folding does not preserve normalization formats!
|
||||
#
|
||||
# For information on case folding, see
|
||||
# UTR #21 Case Mappings, at http://www.unicode.org/unicode/reports/tr21/
|
||||
# For information on case folding, including how to have case folding
|
||||
# preserve normalization formats, see Section 3.13 Default Case Algorithms in
|
||||
# The Unicode Standard, Version 5.0.
|
||||
#
|
||||
# ================================================================================
|
||||
# Format
|
||||
@ -52,6 +53,14 @@
|
||||
#
|
||||
# =================================================================
|
||||
|
||||
# Property: Case_Folding
|
||||
|
||||
# All code points not explicitly listed for Case_Folding
|
||||
# have the value C for the status field, and the code point itself for the mapping field.
|
||||
|
||||
# @missing: 0000..10FFFF; C; <code point>
|
||||
|
||||
# =================================================================
|
||||
0041; C; 0061; # LATIN CAPITAL LETTER A
|
||||
0042; C; 0062; # LATIN CAPITAL LETTER B
|
||||
0043; C; 0063; # LATIN CAPITAL LETTER C
|
||||
@ -286,6 +295,9 @@
|
||||
024C; C; 024D; # LATIN CAPITAL LETTER R WITH STROKE
|
||||
024E; C; 024F; # LATIN CAPITAL LETTER Y WITH STROKE
|
||||
0345; C; 03B9; # COMBINING GREEK YPOGEGRAMMENI
|
||||
0370; C; 0371; # GREEK CAPITAL LETTER HETA
|
||||
0372; C; 0373; # GREEK CAPITAL LETTER ARCHAIC SAMPI
|
||||
0376; C; 0377; # GREEK CAPITAL LETTER PAMPHYLIAN DIGAMMA
|
||||
0386; C; 03AC; # GREEK CAPITAL LETTER ALPHA WITH TONOS
|
||||
0388; C; 03AD; # GREEK CAPITAL LETTER EPSILON WITH TONOS
|
||||
0389; C; 03AE; # GREEK CAPITAL LETTER ETA WITH TONOS
|
||||
@ -322,6 +334,7 @@
|
||||
03AB; C; 03CB; # GREEK CAPITAL LETTER UPSILON WITH DIALYTIKA
|
||||
03B0; F; 03C5 0308 0301; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND TONOS
|
||||
03C2; C; 03C3; # GREEK SMALL LETTER FINAL SIGMA
|
||||
03CF; C; 03D7; # GREEK CAPITAL KAI SYMBOL
|
||||
03D0; C; 03B2; # GREEK BETA SYMBOL
|
||||
03D1; C; 03B8; # GREEK THETA SYMBOL
|
||||
03D5; C; 03C6; # GREEK PHI SYMBOL
|
||||
@ -482,6 +495,16 @@
|
||||
050E; C; 050F; # CYRILLIC CAPITAL LETTER KOMI TJE
|
||||
0510; C; 0511; # CYRILLIC CAPITAL LETTER REVERSED ZE
|
||||
0512; C; 0513; # CYRILLIC CAPITAL LETTER EL WITH HOOK
|
||||
0514; C; 0515; # CYRILLIC CAPITAL LETTER LHA
|
||||
0516; C; 0517; # CYRILLIC CAPITAL LETTER RHA
|
||||
0518; C; 0519; # CYRILLIC CAPITAL LETTER YAE
|
||||
051A; C; 051B; # CYRILLIC CAPITAL LETTER QA
|
||||
051C; C; 051D; # CYRILLIC CAPITAL LETTER WE
|
||||
051E; C; 051F; # CYRILLIC CAPITAL LETTER ALEUT KA
|
||||
0520; C; 0521; # CYRILLIC CAPITAL LETTER EL WITH MIDDLE HOOK
|
||||
0522; C; 0523; # CYRILLIC CAPITAL LETTER EN WITH MIDDLE HOOK
|
||||
0524; C; 0525; # CYRILLIC CAPITAL LETTER PE WITH DESCENDER
|
||||
0526; C; 0527; # CYRILLIC CAPITAL LETTER SHHA WITH DESCENDER
|
||||
0531; C; 0561; # ARMENIAN CAPITAL LETTER AYB
|
||||
0532; C; 0562; # ARMENIAN CAPITAL LETTER BEN
|
||||
0533; C; 0563; # ARMENIAN CAPITAL LETTER GIM
|
||||
@ -559,6 +582,8 @@
|
||||
10C3; C; 2D23; # GEORGIAN CAPITAL LETTER WE
|
||||
10C4; C; 2D24; # GEORGIAN CAPITAL LETTER HAR
|
||||
10C5; C; 2D25; # GEORGIAN CAPITAL LETTER HOE
|
||||
10C7; C; 2D27; # GEORGIAN CAPITAL LETTER YN
|
||||
10CD; C; 2D2D; # GEORGIAN CAPITAL LETTER AEN
|
||||
1E00; C; 1E01; # LATIN CAPITAL LETTER A WITH RING BELOW
|
||||
1E02; C; 1E03; # LATIN CAPITAL LETTER B WITH DOT ABOVE
|
||||
1E04; C; 1E05; # LATIN CAPITAL LETTER B WITH DOT BELOW
|
||||
@ -640,6 +665,8 @@
|
||||
1E99; F; 0079 030A; # LATIN SMALL LETTER Y WITH RING ABOVE
|
||||
1E9A; F; 0061 02BE; # LATIN SMALL LETTER A WITH RIGHT HALF RING
|
||||
1E9B; C; 1E61; # LATIN SMALL LETTER LONG S WITH DOT ABOVE
|
||||
1E9E; F; 0073 0073; # LATIN CAPITAL LETTER SHARP S
|
||||
1E9E; S; 00DF; # LATIN CAPITAL LETTER SHARP S
|
||||
1EA0; C; 1EA1; # LATIN CAPITAL LETTER A WITH DOT BELOW
|
||||
1EA2; C; 1EA3; # LATIN CAPITAL LETTER A WITH HOOK ABOVE
|
||||
1EA4; C; 1EA5; # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND ACUTE
|
||||
@ -685,6 +712,9 @@
|
||||
1EF4; C; 1EF5; # LATIN CAPITAL LETTER Y WITH DOT BELOW
|
||||
1EF6; C; 1EF7; # LATIN CAPITAL LETTER Y WITH HOOK ABOVE
|
||||
1EF8; C; 1EF9; # LATIN CAPITAL LETTER Y WITH TILDE
|
||||
1EFA; C; 1EFB; # LATIN CAPITAL LETTER MIDDLE-WELSH LL
|
||||
1EFC; C; 1EFD; # LATIN CAPITAL LETTER MIDDLE-WELSH V
|
||||
1EFE; C; 1EFF; # LATIN CAPITAL LETTER Y WITH LOOP
|
||||
1F08; C; 1F00; # GREEK CAPITAL LETTER ALPHA WITH PSILI
|
||||
1F09; C; 1F01; # GREEK CAPITAL LETTER ALPHA WITH DASIA
|
||||
1F0A; C; 1F02; # GREEK CAPITAL LETTER ALPHA WITH PSILI AND VARIA
|
||||
@ -962,7 +992,14 @@
|
||||
2C67; C; 2C68; # LATIN CAPITAL LETTER H WITH DESCENDER
|
||||
2C69; C; 2C6A; # LATIN CAPITAL LETTER K WITH DESCENDER
|
||||
2C6B; C; 2C6C; # LATIN CAPITAL LETTER Z WITH DESCENDER
|
||||
2C6D; C; 0251; # LATIN CAPITAL LETTER ALPHA
|
||||
2C6E; C; 0271; # LATIN CAPITAL LETTER M WITH HOOK
|
||||
2C6F; C; 0250; # LATIN CAPITAL LETTER TURNED A
|
||||
2C70; C; 0252; # LATIN CAPITAL LETTER TURNED ALPHA
|
||||
2C72; C; 2C73; # LATIN CAPITAL LETTER W WITH HOOK
|
||||
2C75; C; 2C76; # LATIN CAPITAL LETTER HALF H
|
||||
2C7E; C; 023F; # LATIN CAPITAL LETTER S WITH SWASH TAIL
|
||||
2C7F; C; 0240; # LATIN CAPITAL LETTER Z WITH SWASH TAIL
|
||||
2C80; C; 2C81; # COPTIC CAPITAL LETTER ALFA
|
||||
2C82; C; 2C83; # COPTIC CAPITAL LETTER VIDA
|
||||
2C84; C; 2C85; # COPTIC CAPITAL LETTER GAMMA
|
||||
@ -1013,6 +1050,100 @@
|
||||
2CDE; C; 2CDF; # COPTIC CAPITAL LETTER OLD NUBIAN NGI
|
||||
2CE0; C; 2CE1; # COPTIC CAPITAL LETTER OLD NUBIAN NYI
|
||||
2CE2; C; 2CE3; # COPTIC CAPITAL LETTER OLD NUBIAN WAU
|
||||
2CEB; C; 2CEC; # COPTIC CAPITAL LETTER CRYPTOGRAMMIC SHEI
|
||||
2CED; C; 2CEE; # COPTIC CAPITAL LETTER CRYPTOGRAMMIC GANGIA
|
||||
2CF2; C; 2CF3; # COPTIC CAPITAL LETTER BOHAIRIC KHEI
|
||||
A640; C; A641; # CYRILLIC CAPITAL LETTER ZEMLYA
|
||||
A642; C; A643; # CYRILLIC CAPITAL LETTER DZELO
|
||||
A644; C; A645; # CYRILLIC CAPITAL LETTER REVERSED DZE
|
||||
A646; C; A647; # CYRILLIC CAPITAL LETTER IOTA
|
||||
A648; C; A649; # CYRILLIC CAPITAL LETTER DJERV
|
||||
A64A; C; A64B; # CYRILLIC CAPITAL LETTER MONOGRAPH UK
|
||||
A64C; C; A64D; # CYRILLIC CAPITAL LETTER BROAD OMEGA
|
||||
A64E; C; A64F; # CYRILLIC CAPITAL LETTER NEUTRAL YER
|
||||
A650; C; A651; # CYRILLIC CAPITAL LETTER YERU WITH BACK YER
|
||||
A652; C; A653; # CYRILLIC CAPITAL LETTER IOTIFIED YAT
|
||||
A654; C; A655; # CYRILLIC CAPITAL LETTER REVERSED YU
|
||||
A656; C; A657; # CYRILLIC CAPITAL LETTER IOTIFIED A
|
||||
A658; C; A659; # CYRILLIC CAPITAL LETTER CLOSED LITTLE YUS
|
||||
A65A; C; A65B; # CYRILLIC CAPITAL LETTER BLENDED YUS
|
||||
A65C; C; A65D; # CYRILLIC CAPITAL LETTER IOTIFIED CLOSED LITTLE YUS
|
||||
A65E; C; A65F; # CYRILLIC CAPITAL LETTER YN
|
||||
A660; C; A661; # CYRILLIC CAPITAL LETTER REVERSED TSE
|
||||
A662; C; A663; # CYRILLIC CAPITAL LETTER SOFT DE
|
||||
A664; C; A665; # CYRILLIC CAPITAL LETTER SOFT EL
|
||||
A666; C; A667; # CYRILLIC CAPITAL LETTER SOFT EM
|
||||
A668; C; A669; # CYRILLIC CAPITAL LETTER MONOCULAR O
|
||||
A66A; C; A66B; # CYRILLIC CAPITAL LETTER BINOCULAR O
|
||||
A66C; C; A66D; # CYRILLIC CAPITAL LETTER DOUBLE MONOCULAR O
|
||||
A680; C; A681; # CYRILLIC CAPITAL LETTER DWE
|
||||
A682; C; A683; # CYRILLIC CAPITAL LETTER DZWE
|
||||
A684; C; A685; # CYRILLIC CAPITAL LETTER ZHWE
|
||||
A686; C; A687; # CYRILLIC CAPITAL LETTER CCHE
|
||||
A688; C; A689; # CYRILLIC CAPITAL LETTER DZZE
|
||||
A68A; C; A68B; # CYRILLIC CAPITAL LETTER TE WITH MIDDLE HOOK
|
||||
A68C; C; A68D; # CYRILLIC CAPITAL LETTER TWE
|
||||
A68E; C; A68F; # CYRILLIC CAPITAL LETTER TSWE
|
||||
A690; C; A691; # CYRILLIC CAPITAL LETTER TSSE
|
||||
A692; C; A693; # CYRILLIC CAPITAL LETTER TCHE
|
||||
A694; C; A695; # CYRILLIC CAPITAL LETTER HWE
|
||||
A696; C; A697; # CYRILLIC CAPITAL LETTER SHWE
|
||||
A722; C; A723; # LATIN CAPITAL LETTER EGYPTOLOGICAL ALEF
|
||||
A724; C; A725; # LATIN CAPITAL LETTER EGYPTOLOGICAL AIN
|
||||
A726; C; A727; # LATIN CAPITAL LETTER HENG
|
||||
A728; C; A729; # LATIN CAPITAL LETTER TZ
|
||||
A72A; C; A72B; # LATIN CAPITAL LETTER TRESILLO
|
||||
A72C; C; A72D; # LATIN CAPITAL LETTER CUATRILLO
|
||||
A72E; C; A72F; # LATIN CAPITAL LETTER CUATRILLO WITH COMMA
|
||||
A732; C; A733; # LATIN CAPITAL LETTER AA
|
||||
A734; C; A735; # LATIN CAPITAL LETTER AO
|
||||
A736; C; A737; # LATIN CAPITAL LETTER AU
|
||||
A738; C; A739; # LATIN CAPITAL LETTER AV
|
||||
A73A; C; A73B; # LATIN CAPITAL LETTER AV WITH HORIZONTAL BAR
|
||||
A73C; C; A73D; # LATIN CAPITAL LETTER AY
|
||||
A73E; C; A73F; # LATIN CAPITAL LETTER REVERSED C WITH DOT
|
||||
A740; C; A741; # LATIN CAPITAL LETTER K WITH STROKE
|
||||
A742; C; A743; # LATIN CAPITAL LETTER K WITH DIAGONAL STROKE
|
||||
A744; C; A745; # LATIN CAPITAL LETTER K WITH STROKE AND DIAGONAL STROKE
|
||||
A746; C; A747; # LATIN CAPITAL LETTER BROKEN L
|
||||
A748; C; A749; # LATIN CAPITAL LETTER L WITH HIGH STROKE
|
||||
A74A; C; A74B; # LATIN CAPITAL LETTER O WITH LONG STROKE OVERLAY
|
||||
A74C; C; A74D; # LATIN CAPITAL LETTER O WITH LOOP
|
||||
A74E; C; A74F; # LATIN CAPITAL LETTER OO
|
||||
A750; C; A751; # LATIN CAPITAL LETTER P WITH STROKE THROUGH DESCENDER
|
||||
A752; C; A753; # LATIN CAPITAL LETTER P WITH FLOURISH
|
||||
A754; C; A755; # LATIN CAPITAL LETTER P WITH SQUIRREL TAIL
|
||||
A756; C; A757; # LATIN CAPITAL LETTER Q WITH STROKE THROUGH DESCENDER
|
||||
A758; C; A759; # LATIN CAPITAL LETTER Q WITH DIAGONAL STROKE
|
||||
A75A; C; A75B; # LATIN CAPITAL LETTER R ROTUNDA
|
||||
A75C; C; A75D; # LATIN CAPITAL LETTER RUM ROTUNDA
|
||||
A75E; C; A75F; # LATIN CAPITAL LETTER V WITH DIAGONAL STROKE
|
||||
A760; C; A761; # LATIN CAPITAL LETTER VY
|
||||
A762; C; A763; # LATIN CAPITAL LETTER VISIGOTHIC Z
|
||||
A764; C; A765; # LATIN CAPITAL LETTER THORN WITH STROKE
|
||||
A766; C; A767; # LATIN CAPITAL LETTER THORN WITH STROKE THROUGH DESCENDER
|
||||
A768; C; A769; # LATIN CAPITAL LETTER VEND
|
||||
A76A; C; A76B; # LATIN CAPITAL LETTER ET
|
||||
A76C; C; A76D; # LATIN CAPITAL LETTER IS
|
||||
A76E; C; A76F; # LATIN CAPITAL LETTER CON
|
||||
A779; C; A77A; # LATIN CAPITAL LETTER INSULAR D
|
||||
A77B; C; A77C; # LATIN CAPITAL LETTER INSULAR F
|
||||
A77D; C; 1D79; # LATIN CAPITAL LETTER INSULAR G
|
||||
A77E; C; A77F; # LATIN CAPITAL LETTER TURNED INSULAR G
|
||||
A780; C; A781; # LATIN CAPITAL LETTER TURNED L
|
||||
A782; C; A783; # LATIN CAPITAL LETTER INSULAR R
|
||||
A784; C; A785; # LATIN CAPITAL LETTER INSULAR S
|
||||
A786; C; A787; # LATIN CAPITAL LETTER INSULAR T
|
||||
A78B; C; A78C; # LATIN CAPITAL LETTER SALTILLO
|
||||
A78D; C; 0265; # LATIN CAPITAL LETTER TURNED H
|
||||
A790; C; A791; # LATIN CAPITAL LETTER N WITH DESCENDER
|
||||
A792; C; A793; # LATIN CAPITAL LETTER C WITH BAR
|
||||
A7A0; C; A7A1; # LATIN CAPITAL LETTER G WITH OBLIQUE STROKE
|
||||
A7A2; C; A7A3; # LATIN CAPITAL LETTER K WITH OBLIQUE STROKE
|
||||
A7A4; C; A7A5; # LATIN CAPITAL LETTER N WITH OBLIQUE STROKE
|
||||
A7A6; C; A7A7; # LATIN CAPITAL LETTER R WITH OBLIQUE STROKE
|
||||
A7A8; C; A7A9; # LATIN CAPITAL LETTER S WITH OBLIQUE STROKE
|
||||
A7AA; C; 0266; # LATIN CAPITAL LETTER H WITH HOOK
|
||||
FB00; F; 0066 0066; # LATIN SMALL LIGATURE FF
|
||||
FB01; F; 0066 0069; # LATIN SMALL LIGATURE FI
|
||||
FB02; F; 0066 006C; # LATIN SMALL LIGATURE FL
|
||||
|
@ -1,18 +1,14 @@
|
||||
# DerivedAge-5.0.0.txt
|
||||
# Date: 2006-07-14, 17:25:00 PST [MD/KW]
|
||||
# DerivedAge-6.1.0.txt
|
||||
# Date: 2012-01-20, 21:47:00 GMT [MD, KW]
|
||||
#
|
||||
# Unicode Character Database
|
||||
# Copyright (c) 1991-2006 Unicode, Inc.
|
||||
# Copyright (c) 1991-2012 Unicode, Inc.
|
||||
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
||||
# For documentation, see UCD.html
|
||||
# For documentation, see http://www.unicode.org/reports/tr44/
|
||||
#
|
||||
# Unicode Character Database: Derived Property Data
|
||||
# This file shows when various code points were first assigned in Unicode.
|
||||
#
|
||||
# Caution: When using the Age *property*, all assigned code points
|
||||
# in each version are included, not just the newly assigned code points.
|
||||
# For more information, see http://www.unicode.org/reports/tr18/
|
||||
#
|
||||
# Notes:
|
||||
#
|
||||
# - The term 'assigned' means that a previously reserved code point was assigned
|
||||
@ -37,15 +33,23 @@
|
||||
|
||||
# ================================================
|
||||
|
||||
# Property: Age
|
||||
# Property: Age
|
||||
#
|
||||
# Note: When using the Age property in regular expressions,
|
||||
# an expression such as "\p{age=3.0}" matches all of the code points
|
||||
# assigned in Version 3.0--that is, all the code points with a value
|
||||
# less than or equal to 3.0 for the Age property.
|
||||
# For more information, see [http://www.unicode.org/reports/tr18/].
|
||||
|
||||
# All code points not explicitly listed for Age
|
||||
# have the value unassigned.
|
||||
# have the value Unassigned (NA).
|
||||
|
||||
# @missing: 0000..10FFFF; unassigned
|
||||
# @missing: 0000..10FFFF; Unassigned
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V1_1
|
||||
|
||||
# Assigned as of Unicode 1.1.0 (June, 1993)
|
||||
# [excluding removed Hangul Syllables]
|
||||
|
||||
@ -355,6 +359,8 @@ FFFE..FFFF ; 1.1 # [2] <noncharacter-FFFE>..<noncharacter-FFFF>
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V2_0
|
||||
|
||||
# Newly assigned in Unicode 2.0.0 (July, 1996)
|
||||
|
||||
0591..05A1 ; 2.0 # [17] HEBREW ACCENT ETNAHTA..HEBREW ACCENT PAZER
|
||||
@ -395,6 +401,8 @@ FFFFE..FFFFF ; 2.0 # [2] <noncharacter-FFFFE>..<noncharacter-FFFFF>
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V2_1
|
||||
|
||||
# Newly assigned in Unicode 2.1.2 (May, 1998)
|
||||
|
||||
20AC ; 2.1 # EURO SIGN
|
||||
@ -404,6 +412,8 @@ FFFC ; 2.1 # OBJECT REPLACEMENT CHARACTER
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V3_0
|
||||
|
||||
# Newly assigned in Unicode 3.0.0 (September, 1999)
|
||||
|
||||
01F6..01F9 ; 3.0 # [4] LATIN CAPITAL LETTER HWAIR..LATIN SMALL LETTER N WITH GRAVE
|
||||
@ -491,9 +501,7 @@ FFFC ; 2.1 # OBJECT REPLACEMENT CHARACTER
|
||||
1401..1676 ; 3.0 # [630] CANADIAN SYLLABICS E..CANADIAN SYLLABICS NNGAA
|
||||
1680..169C ; 3.0 # [29] OGHAM SPACE MARK..OGHAM REVERSED FEATHER MARK
|
||||
16A0..16F0 ; 3.0 # [81] RUNIC LETTER FEHU FEOH FE F..RUNIC BELGTHOR SYMBOL
|
||||
1780..17B3 ; 3.0 # [52] KHMER LETTER KA..KHMER INDEPENDENT VOWEL QAU
|
||||
17B4..17B5 ; 3.0 # [2] KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT AA
|
||||
17B6..17DC ; 3.0 # [39] KHMER VOWEL SIGN AA..KHMER SIGN AVAKRAHASANYA
|
||||
1780..17DC ; 3.0 # [93] KHMER LETTER KA..KHMER SIGN AVAKRAHASANYA
|
||||
17E0..17E9 ; 3.0 # [10] KHMER DIGIT ZERO..KHMER DIGIT NINE
|
||||
1800..180E ; 3.0 # [15] MONGOLIAN BIRGA..MONGOLIAN VOWEL SEPARATOR
|
||||
1810..1819 ; 3.0 # [10] MONGOLIAN DIGIT ZERO..MONGOLIAN DIGIT NINE
|
||||
@ -535,6 +543,8 @@ FFF9..FFFB ; 3.0 # [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTATIO
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V3_1
|
||||
|
||||
# Newly assigned in Unicode 3.1.0 (March, 2001)
|
||||
|
||||
03F4..03F5 ; 3.1 # [2] GREEK CAPITAL THETA SYMBOL..GREEK LUNATE EPSILON SYMBOL
|
||||
@ -580,6 +590,8 @@ E0020..E007F ; 3.1 # [96] TAG SPACE..CANCEL TAG
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V3_2
|
||||
|
||||
# Newly assigned in Unicode 3.2.0 (March, 2002)
|
||||
|
||||
0220 ; 3.2 # LATIN CAPITAL LETTER N WITH LONG RIGHT LEG
|
||||
@ -647,6 +659,8 @@ FF5F..FF60 ; 3.2 # [2] FULLWIDTH LEFT WHITE PARENTHESIS..FULLWIDTH RIGHT WH
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V4_0
|
||||
|
||||
# Newly assigned in Unicode 4.0.0 (April, 2003)
|
||||
|
||||
0221 ; 4.0 # LATIN SMALL LETTER D WITH CURL
|
||||
@ -731,6 +745,8 @@ E0100..E01EF ; 4.0 # [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V4_1
|
||||
|
||||
# Newly assigned in Unicode 4.1.0 (March, 2005)
|
||||
|
||||
0237..0241 ; 4.1 # [11] LATIN SMALL LETTER DOTLESS J..LATIN CAPITAL LETTER GLOTTAL STOP
|
||||
@ -824,6 +840,8 @@ FE10..FE19 ; 4.1 # [10] PRESENTATION FORM FOR VERTICAL COMMA..PRESENTATION F
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V5_0
|
||||
|
||||
# Newly assigned in Unicode 5.0.0 (July, 2006)
|
||||
|
||||
0242..024F ; 5.0 # [14] LATIN SMALL LETTER GLOTTAL STOP..LATIN SMALL LETTER Y WITH STROKE
|
||||
@ -864,4 +882,416 @@ A840..A877 ; 5.0 # [56] PHAGS-PA LETTER KA..PHAGS-PA MARK DOUBLE SHAD
|
||||
|
||||
# Total code points: 1369
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V5_1
|
||||
|
||||
# Newly assigned in Unicode 5.1.0 (March, 2008)
|
||||
|
||||
0370..0373 ; 5.1 # [4] GREEK CAPITAL LETTER HETA..GREEK SMALL LETTER ARCHAIC SAMPI
|
||||
0376..0377 ; 5.1 # [2] GREEK CAPITAL LETTER PAMPHYLIAN DIGAMMA..GREEK SMALL LETTER PAMPHYLIAN DIGAMMA
|
||||
03CF ; 5.1 # GREEK CAPITAL KAI SYMBOL
|
||||
0487 ; 5.1 # COMBINING CYRILLIC POKRYTIE
|
||||
0514..0523 ; 5.1 # [16] CYRILLIC CAPITAL LETTER LHA..CYRILLIC SMALL LETTER EN WITH MIDDLE HOOK
|
||||
0606..060A ; 5.1 # [5] ARABIC-INDIC CUBE ROOT..ARABIC-INDIC PER TEN THOUSAND SIGN
|
||||
0616..061A ; 5.1 # [5] ARABIC SMALL HIGH LIGATURE ALEF WITH LAM WITH YEH..ARABIC SMALL KASRA
|
||||
063B..063F ; 5.1 # [5] ARABIC LETTER KEHEH WITH TWO DOTS ABOVE..ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVE
|
||||
076E..077F ; 5.1 # [18] ARABIC LETTER HAH WITH SMALL ARABIC LETTER TAH BELOW..ARABIC LETTER KAF WITH TWO DOTS ABOVE
|
||||
0971..0972 ; 5.1 # [2] DEVANAGARI SIGN HIGH SPACING DOT..DEVANAGARI LETTER CANDRA A
|
||||
0A51 ; 5.1 # GURMUKHI SIGN UDAAT
|
||||
0A75 ; 5.1 # GURMUKHI SIGN YAKASH
|
||||
0B44 ; 5.1 # ORIYA VOWEL SIGN VOCALIC RR
|
||||
0B62..0B63 ; 5.1 # [2] ORIYA VOWEL SIGN VOCALIC L..ORIYA VOWEL SIGN VOCALIC LL
|
||||
0BD0 ; 5.1 # TAMIL OM
|
||||
0C3D ; 5.1 # TELUGU SIGN AVAGRAHA
|
||||
0C58..0C59 ; 5.1 # [2] TELUGU LETTER TSA..TELUGU LETTER DZA
|
||||
0C62..0C63 ; 5.1 # [2] TELUGU VOWEL SIGN VOCALIC L..TELUGU VOWEL SIGN VOCALIC LL
|
||||
0C78..0C7F ; 5.1 # [8] TELUGU FRACTION DIGIT ZERO FOR ODD POWERS OF FOUR..TELUGU SIGN TUUMU
|
||||
0D3D ; 5.1 # MALAYALAM SIGN AVAGRAHA
|
||||
0D44 ; 5.1 # MALAYALAM VOWEL SIGN VOCALIC RR
|
||||
0D62..0D63 ; 5.1 # [2] MALAYALAM VOWEL SIGN VOCALIC L..MALAYALAM VOWEL SIGN VOCALIC LL
|
||||
0D70..0D75 ; 5.1 # [6] MALAYALAM NUMBER TEN..MALAYALAM FRACTION THREE QUARTERS
|
||||
0D79..0D7F ; 5.1 # [7] MALAYALAM DATE MARK..MALAYALAM LETTER CHILLU K
|
||||
0F6B..0F6C ; 5.1 # [2] TIBETAN LETTER KKA..TIBETAN LETTER RRA
|
||||
0FCE ; 5.1 # TIBETAN SIGN RDEL NAG RDEL DKAR
|
||||
0FD2..0FD4 ; 5.1 # [3] TIBETAN MARK NYIS TSHEG..TIBETAN MARK CLOSING BRDA RNYING YIG MGO SGAB MA
|
||||
1022 ; 5.1 # MYANMAR LETTER SHAN A
|
||||
1028 ; 5.1 # MYANMAR LETTER MON E
|
||||
102B ; 5.1 # MYANMAR VOWEL SIGN TALL AA
|
||||
1033..1035 ; 5.1 # [3] MYANMAR VOWEL SIGN MON II..MYANMAR VOWEL SIGN E ABOVE
|
||||
103A..103F ; 5.1 # [6] MYANMAR SIGN ASAT..MYANMAR LETTER GREAT SA
|
||||
105A..1099 ; 5.1 # [64] MYANMAR LETTER MON NGA..MYANMAR SHAN DIGIT NINE
|
||||
109E..109F ; 5.1 # [2] MYANMAR SYMBOL SHAN ONE..MYANMAR SYMBOL SHAN EXCLAMATION
|
||||
18AA ; 5.1 # MONGOLIAN LETTER MANCHU ALI GALI LHA
|
||||
1B80..1BAA ; 5.1 # [43] SUNDANESE SIGN PANYECEK..SUNDANESE SIGN PAMAAEH
|
||||
1BAE..1BB9 ; 5.1 # [12] SUNDANESE LETTER KHA..SUNDANESE DIGIT NINE
|
||||
1C00..1C37 ; 5.1 # [56] LEPCHA LETTER KA..LEPCHA SIGN NUKTA
|
||||
1C3B..1C49 ; 5.1 # [15] LEPCHA PUNCTUATION TA-ROL..LEPCHA DIGIT NINE
|
||||
1C4D..1C7F ; 5.1 # [51] LEPCHA LETTER TTA..OL CHIKI PUNCTUATION DOUBLE MUCAAD
|
||||
1DCB..1DE6 ; 5.1 # [28] COMBINING BREVE-MACRON..COMBINING LATIN SMALL LETTER Z
|
||||
1E9C..1E9F ; 5.1 # [4] LATIN SMALL LETTER LONG S WITH DIAGONAL STROKE..LATIN SMALL LETTER DELTA
|
||||
1EFA..1EFF ; 5.1 # [6] LATIN CAPITAL LETTER MIDDLE-WELSH LL..LATIN SMALL LETTER Y WITH LOOP
|
||||
2064 ; 5.1 # INVISIBLE PLUS
|
||||
20F0 ; 5.1 # COMBINING ASTERISK ABOVE
|
||||
214F ; 5.1 # SYMBOL FOR SAMARITAN SOURCE
|
||||
2185..2188 ; 5.1 # [4] ROMAN NUMERAL SIX LATE FORM..ROMAN NUMERAL ONE HUNDRED THOUSAND
|
||||
269D ; 5.1 # OUTLINED WHITE STAR
|
||||
26B3..26BC ; 5.1 # [10] CERES..SESQUIQUADRATE
|
||||
26C0..26C3 ; 5.1 # [4] WHITE DRAUGHTS MAN..BLACK DRAUGHTS KING
|
||||
27CC ; 5.1 # LONG DIVISION
|
||||
27EC..27EF ; 5.1 # [4] MATHEMATICAL LEFT WHITE TORTOISE SHELL BRACKET..MATHEMATICAL RIGHT FLATTENED PARENTHESIS
|
||||
2B1B..2B1F ; 5.1 # [5] BLACK LARGE SQUARE..BLACK PENTAGON
|
||||
2B24..2B4C ; 5.1 # [41] BLACK LARGE CIRCLE..RIGHTWARDS ARROW ABOVE REVERSE TILDE OPERATOR
|
||||
2B50..2B54 ; 5.1 # [5] WHITE MEDIUM STAR..WHITE RIGHT-POINTING PENTAGON
|
||||
2C6D..2C6F ; 5.1 # [3] LATIN CAPITAL LETTER ALPHA..LATIN CAPITAL LETTER TURNED A
|
||||
2C71..2C73 ; 5.1 # [3] LATIN SMALL LETTER V WITH RIGHT HOOK..LATIN SMALL LETTER W WITH HOOK
|
||||
2C78..2C7D ; 5.1 # [6] LATIN SMALL LETTER E WITH NOTCH..MODIFIER LETTER CAPITAL V
|
||||
2DE0..2DFF ; 5.1 # [32] COMBINING CYRILLIC LETTER BE..COMBINING CYRILLIC LETTER IOTIFIED BIG YUS
|
||||
2E18..2E1B ; 5.1 # [4] INVERTED INTERROBANG..TILDE WITH RING ABOVE
|
||||
2E1E..2E30 ; 5.1 # [19] TILDE WITH DOT ABOVE..RING POINT
|
||||
312D ; 5.1 # BOPOMOFO LETTER IH
|
||||
31D0..31E3 ; 5.1 # [20] CJK STROKE H..CJK STROKE Q
|
||||
9FBC..9FC3 ; 5.1 # [8] CJK UNIFIED IDEOGRAPH-9FBC..CJK UNIFIED IDEOGRAPH-9FC3
|
||||
A500..A62B ; 5.1 # [300] VAI SYLLABLE EE..VAI SYLLABLE NDOLE DO
|
||||
A640..A65F ; 5.1 # [32] CYRILLIC CAPITAL LETTER ZEMLYA..CYRILLIC SMALL LETTER YN
|
||||
A662..A673 ; 5.1 # [18] CYRILLIC CAPITAL LETTER SOFT DE..SLAVONIC ASTERISK
|
||||
A67C..A697 ; 5.1 # [28] COMBINING CYRILLIC KAVYKA..CYRILLIC SMALL LETTER SHWE
|
||||
A71B..A71F ; 5.1 # [5] MODIFIER LETTER RAISED UP ARROW..MODIFIER LETTER LOW INVERTED EXCLAMATION MARK
|
||||
A722..A78C ; 5.1 # [107] LATIN CAPITAL LETTER EGYPTOLOGICAL ALEF..LATIN SMALL LETTER SALTILLO
|
||||
A7FB..A7FF ; 5.1 # [5] LATIN EPIGRAPHIC LETTER REVERSED F..LATIN EPIGRAPHIC LETTER ARCHAIC M
|
||||
A880..A8C4 ; 5.1 # [69] SAURASHTRA SIGN ANUSVARA..SAURASHTRA SIGN VIRAMA
|
||||
A8CE..A8D9 ; 5.1 # [12] SAURASHTRA DANDA..SAURASHTRA DIGIT NINE
|
||||
A900..A953 ; 5.1 # [84] KAYAH LI DIGIT ZERO..REJANG VIRAMA
|
||||
A95F ; 5.1 # REJANG SECTION MARK
|
||||
AA00..AA36 ; 5.1 # [55] CHAM LETTER A..CHAM CONSONANT SIGN WA
|
||||
AA40..AA4D ; 5.1 # [14] CHAM LETTER FINAL K..CHAM CONSONANT SIGN FINAL H
|
||||
AA50..AA59 ; 5.1 # [10] CHAM DIGIT ZERO..CHAM DIGIT NINE
|
||||
AA5C..AA5F ; 5.1 # [4] CHAM PUNCTUATION SPIRAL..CHAM PUNCTUATION TRIPLE DANDA
|
||||
FE24..FE26 ; 5.1 # [3] COMBINING MACRON LEFT HALF..COMBINING CONJOINING MACRON
|
||||
10190..1019B ; 5.1 # [12] ROMAN SEXTANS SIGN..ROMAN CENTURIAL SIGN
|
||||
101D0..101FD ; 5.1 # [46] PHAISTOS DISC SIGN PEDESTRIAN..PHAISTOS DISC SIGN COMBINING OBLIQUE STROKE
|
||||
10280..1029C ; 5.1 # [29] LYCIAN LETTER A..LYCIAN LETTER X
|
||||
102A0..102D0 ; 5.1 # [49] CARIAN LETTER A..CARIAN LETTER UUU3
|
||||
10920..10939 ; 5.1 # [26] LYDIAN LETTER A..LYDIAN LETTER C
|
||||
1093F ; 5.1 # LYDIAN TRIANGULAR MARK
|
||||
1D129 ; 5.1 # MUSICAL SYMBOL MULTIPLE MEASURE REST
|
||||
1F000..1F02B ; 5.1 # [44] MAHJONG TILE EAST WIND..MAHJONG TILE BACK
|
||||
1F030..1F093 ; 5.1 # [100] DOMINO TILE HORIZONTAL BACK..DOMINO TILE VERTICAL-06-06
|
||||
|
||||
# Total code points: 1624
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V5_2
|
||||
|
||||
# Newly assigned in Unicode 5.2.0 (October, 2009)
|
||||
|
||||
0524..0525 ; 5.2 # [2] CYRILLIC CAPITAL LETTER PE WITH DESCENDER..CYRILLIC SMALL LETTER PE WITH DESCENDER
|
||||
0800..082D ; 5.2 # [46] SAMARITAN LETTER ALAF..SAMARITAN MARK NEQUDAA
|
||||
0830..083E ; 5.2 # [15] SAMARITAN PUNCTUATION NEQUDAA..SAMARITAN PUNCTUATION ANNAAU
|
||||
0900 ; 5.2 # DEVANAGARI SIGN INVERTED CANDRABINDU
|
||||
094E ; 5.2 # DEVANAGARI VOWEL SIGN PRISHTHAMATRA E
|
||||
0955 ; 5.2 # DEVANAGARI VOWEL SIGN CANDRA LONG E
|
||||
0979..097A ; 5.2 # [2] DEVANAGARI LETTER ZHA..DEVANAGARI LETTER HEAVY YA
|
||||
09FB ; 5.2 # BENGALI GANDA MARK
|
||||
0FD5..0FD8 ; 5.2 # [4] RIGHT-FACING SVASTI SIGN..LEFT-FACING SVASTI SIGN WITH DOTS
|
||||
109A..109D ; 5.2 # [4] MYANMAR SIGN KHAMTI TONE-1..MYANMAR VOWEL SIGN AITON AI
|
||||
115A..115E ; 5.2 # [5] HANGUL CHOSEONG KIYEOK-TIKEUT..HANGUL CHOSEONG TIKEUT-RIEUL
|
||||
11A3..11A7 ; 5.2 # [5] HANGUL JUNGSEONG A-EU..HANGUL JUNGSEONG O-YAE
|
||||
11FA..11FF ; 5.2 # [6] HANGUL JONGSEONG KIYEOK-NIEUN..HANGUL JONGSEONG SSANGNIEUN
|
||||
1400 ; 5.2 # CANADIAN SYLLABICS HYPHEN
|
||||
1677..167F ; 5.2 # [9] CANADIAN SYLLABICS WOODS-CREE THWEE..CANADIAN SYLLABICS BLACKFOOT W
|
||||
18B0..18F5 ; 5.2 # [70] CANADIAN SYLLABICS OY..CANADIAN SYLLABICS CARRIER DENTAL S
|
||||
19AA..19AB ; 5.2 # [2] NEW TAI LUE LETTER HIGH SUA..NEW TAI LUE LETTER LOW SUA
|
||||
19DA ; 5.2 # NEW TAI LUE THAM DIGIT ONE
|
||||
1A20..1A5E ; 5.2 # [63] TAI THAM LETTER HIGH KA..TAI THAM CONSONANT SIGN SA
|
||||
1A60..1A7C ; 5.2 # [29] TAI THAM SIGN SAKOT..TAI THAM SIGN KHUEN-LUE KARAN
|
||||
1A7F..1A89 ; 5.2 # [11] TAI THAM COMBINING CRYPTOGRAMMIC DOT..TAI THAM HORA DIGIT NINE
|
||||
1A90..1A99 ; 5.2 # [10] TAI THAM THAM DIGIT ZERO..TAI THAM THAM DIGIT NINE
|
||||
1AA0..1AAD ; 5.2 # [14] TAI THAM SIGN WIANG..TAI THAM SIGN CAANG
|
||||
1CD0..1CF2 ; 5.2 # [35] VEDIC TONE KARSHANA..VEDIC SIGN ARDHAVISARGA
|
||||
1DFD ; 5.2 # COMBINING ALMOST EQUAL TO BELOW
|
||||
20B6..20B8 ; 5.2 # [3] LIVRE TOURNOIS SIGN..TENGE SIGN
|
||||
2150..2152 ; 5.2 # [3] VULGAR FRACTION ONE SEVENTH..VULGAR FRACTION ONE TENTH
|
||||
2189 ; 5.2 # VULGAR FRACTION ZERO THIRDS
|
||||
23E8 ; 5.2 # DECIMAL EXPONENT SYMBOL
|
||||
269E..269F ; 5.2 # [2] THREE LINES CONVERGING RIGHT..THREE LINES CONVERGING LEFT
|
||||
26BD..26BF ; 5.2 # [3] SOCCER BALL..SQUARED KEY
|
||||
26C4..26CD ; 5.2 # [10] SNOWMAN WITHOUT SNOW..DISABLED CAR
|
||||
26CF..26E1 ; 5.2 # [19] PICK..RESTRICTED LEFT ENTRY-2
|
||||
26E3 ; 5.2 # HEAVY CIRCLE WITH STROKE AND TWO DOTS ABOVE
|
||||
26E8..26FF ; 5.2 # [24] BLACK CROSS ON SHIELD..WHITE FLAG WITH HORIZONTAL MIDDLE BLACK STRIPE
|
||||
2757 ; 5.2 # HEAVY EXCLAMATION MARK SYMBOL
|
||||
2B55..2B59 ; 5.2 # [5] HEAVY LARGE CIRCLE..HEAVY CIRCLED SALTIRE
|
||||
2C70 ; 5.2 # LATIN CAPITAL LETTER TURNED ALPHA
|
||||
2C7E..2C7F ; 5.2 # [2] LATIN CAPITAL LETTER S WITH SWASH TAIL..LATIN CAPITAL LETTER Z WITH SWASH TAIL
|
||||
2CEB..2CF1 ; 5.2 # [7] COPTIC CAPITAL LETTER CRYPTOGRAMMIC SHEI..COPTIC COMBINING SPIRITUS LENIS
|
||||
2E31 ; 5.2 # WORD SEPARATOR MIDDLE DOT
|
||||
3244..324F ; 5.2 # [12] CIRCLED IDEOGRAPH QUESTION..CIRCLED NUMBER EIGHTY ON BLACK SQUARE
|
||||
9FC4..9FCB ; 5.2 # [8] CJK UNIFIED IDEOGRAPH-9FC4..CJK UNIFIED IDEOGRAPH-9FCB
|
||||
A4D0..A4FF ; 5.2 # [48] LISU LETTER BA..LISU PUNCTUATION FULL STOP
|
||||
A6A0..A6F7 ; 5.2 # [88] BAMUM LETTER A..BAMUM QUESTION MARK
|
||||
A830..A839 ; 5.2 # [10] NORTH INDIC FRACTION ONE QUARTER..NORTH INDIC QUANTITY MARK
|
||||
A8E0..A8FB ; 5.2 # [28] COMBINING DEVANAGARI DIGIT ZERO..DEVANAGARI HEADSTROKE
|
||||
A960..A97C ; 5.2 # [29] HANGUL CHOSEONG TIKEUT-MIEUM..HANGUL CHOSEONG SSANGYEORINHIEUH
|
||||
A980..A9CD ; 5.2 # [78] JAVANESE SIGN PANYANGGA..JAVANESE TURNED PADA PISELEH
|
||||
A9CF..A9D9 ; 5.2 # [11] JAVANESE PANGRANGKEP..JAVANESE DIGIT NINE
|
||||
A9DE..A9DF ; 5.2 # [2] JAVANESE PADA TIRTA TUMETES..JAVANESE PADA ISEN-ISEN
|
||||
AA60..AA7B ; 5.2 # [28] MYANMAR LETTER KHAMTI GA..MYANMAR SIGN PAO KAREN TONE
|
||||
AA80..AAC2 ; 5.2 # [67] TAI VIET LETTER LOW KO..TAI VIET TONE MAI SONG
|
||||
AADB..AADF ; 5.2 # [5] TAI VIET SYMBOL KON..TAI VIET SYMBOL KOI KOI
|
||||
ABC0..ABED ; 5.2 # [46] MEETEI MAYEK LETTER KOK..MEETEI MAYEK APUN IYEK
|
||||
ABF0..ABF9 ; 5.2 # [10] MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT NINE
|
||||
D7B0..D7C6 ; 5.2 # [23] HANGUL JUNGSEONG O-YEO..HANGUL JUNGSEONG ARAEA-E
|
||||
D7CB..D7FB ; 5.2 # [49] HANGUL JONGSEONG NIEUN-RIEUL..HANGUL JONGSEONG PHIEUPH-THIEUTH
|
||||
FA6B..FA6D ; 5.2 # [3] CJK COMPATIBILITY IDEOGRAPH-FA6B..CJK COMPATIBILITY IDEOGRAPH-FA6D
|
||||
10840..10855 ; 5.2 # [22] IMPERIAL ARAMAIC LETTER ALEPH..IMPERIAL ARAMAIC LETTER TAW
|
||||
10857..1085F ; 5.2 # [9] IMPERIAL ARAMAIC SECTION SIGN..IMPERIAL ARAMAIC NUMBER TEN THOUSAND
|
||||
1091A..1091B ; 5.2 # [2] PHOENICIAN NUMBER TWO..PHOENICIAN NUMBER THREE
|
||||
10A60..10A7F ; 5.2 # [32] OLD SOUTH ARABIAN LETTER HE..OLD SOUTH ARABIAN NUMERIC INDICATOR
|
||||
10B00..10B35 ; 5.2 # [54] AVESTAN LETTER A..AVESTAN LETTER HE
|
||||
10B39..10B55 ; 5.2 # [29] AVESTAN ABBREVIATION MARK..INSCRIPTIONAL PARTHIAN LETTER TAW
|
||||
10B58..10B72 ; 5.2 # [27] INSCRIPTIONAL PARTHIAN NUMBER ONE..INSCRIPTIONAL PAHLAVI LETTER TAW
|
||||
10B78..10B7F ; 5.2 # [8] INSCRIPTIONAL PAHLAVI NUMBER ONE..INSCRIPTIONAL PAHLAVI NUMBER ONE THOUSAND
|
||||
10C00..10C48 ; 5.2 # [73] OLD TURKIC LETTER ORKHON A..OLD TURKIC LETTER ORKHON BASH
|
||||
10E60..10E7E ; 5.2 # [31] RUMI DIGIT ONE..RUMI FRACTION TWO THIRDS
|
||||
11080..110BC ; 5.2 # [61] KAITHI SIGN CANDRABINDU..KAITHI ENUMERATION SIGN
|
||||
110BD ; 5.2 # KAITHI NUMBER SIGN
|
||||
110BE..110C1 ; 5.2 # [4] KAITHI SECTION MARK..KAITHI DOUBLE DANDA
|
||||
13000..1342E ; 5.2 # [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
|
||||
1F100..1F10A ; 5.2 # [11] DIGIT ZERO FULL STOP..DIGIT NINE COMMA
|
||||
1F110..1F12E ; 5.2 # [31] PARENTHESIZED LATIN CAPITAL LETTER A..CIRCLED WZ
|
||||
1F131 ; 5.2 # SQUARED LATIN CAPITAL LETTER B
|
||||
1F13D ; 5.2 # SQUARED LATIN CAPITAL LETTER N
|
||||
1F13F ; 5.2 # SQUARED LATIN CAPITAL LETTER P
|
||||
1F142 ; 5.2 # SQUARED LATIN CAPITAL LETTER S
|
||||
1F146 ; 5.2 # SQUARED LATIN CAPITAL LETTER W
|
||||
1F14A..1F14E ; 5.2 # [5] SQUARED HV..SQUARED PPV
|
||||
1F157 ; 5.2 # NEGATIVE CIRCLED LATIN CAPITAL LETTER H
|
||||
1F15F ; 5.2 # NEGATIVE CIRCLED LATIN CAPITAL LETTER P
|
||||
1F179 ; 5.2 # NEGATIVE SQUARED LATIN CAPITAL LETTER J
|
||||
1F17B..1F17C ; 5.2 # [2] NEGATIVE SQUARED LATIN CAPITAL LETTER L..NEGATIVE SQUARED LATIN CAPITAL LETTER M
|
||||
1F17F ; 5.2 # NEGATIVE SQUARED LATIN CAPITAL LETTER P
|
||||
1F18A..1F18D ; 5.2 # [4] CROSSED NEGATIVE SQUARED LATIN CAPITAL LETTER P..NEGATIVE SQUARED SA
|
||||
1F190 ; 5.2 # SQUARE DJ
|
||||
1F200 ; 5.2 # SQUARE HIRAGANA HOKA
|
||||
1F210..1F231 ; 5.2 # [34] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-6253
|
||||
1F240..1F248 ; 5.2 # [9] TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-672C..TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-6557
|
||||
2A700..2B734 ; 5.2 # [4149] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B734
|
||||
|
||||
# Total code points: 6648
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V6_0
|
||||
|
||||
# Newly assigned in Unicode 6.0.0 (October, 2010)
|
||||
|
||||
0526..0527 ; 6.0 # [2] CYRILLIC CAPITAL LETTER SHHA WITH DESCENDER..CYRILLIC SMALL LETTER SHHA WITH DESCENDER
|
||||
0620 ; 6.0 # ARABIC LETTER KASHMIRI YEH
|
||||
065F ; 6.0 # ARABIC WAVY HAMZA BELOW
|
||||
0840..085B ; 6.0 # [28] MANDAIC LETTER HALQA..MANDAIC GEMINATION MARK
|
||||
085E ; 6.0 # MANDAIC PUNCTUATION
|
||||
093A..093B ; 6.0 # [2] DEVANAGARI VOWEL SIGN OE..DEVANAGARI VOWEL SIGN OOE
|
||||
094F ; 6.0 # DEVANAGARI VOWEL SIGN AW
|
||||
0956..0957 ; 6.0 # [2] DEVANAGARI VOWEL SIGN UE..DEVANAGARI VOWEL SIGN UUE
|
||||
0973..0977 ; 6.0 # [5] DEVANAGARI LETTER OE..DEVANAGARI LETTER UUE
|
||||
0B72..0B77 ; 6.0 # [6] ORIYA FRACTION ONE QUARTER..ORIYA FRACTION THREE SIXTEENTHS
|
||||
0D29 ; 6.0 # MALAYALAM LETTER NNNA
|
||||
0D3A ; 6.0 # MALAYALAM LETTER TTTA
|
||||
0D4E ; 6.0 # MALAYALAM LETTER DOT REPH
|
||||
0F8C..0F8F ; 6.0 # [4] TIBETAN SIGN INVERTED MCHU CAN..TIBETAN SUBJOINED SIGN INVERTED MCHU CAN
|
||||
0FD9..0FDA ; 6.0 # [2] TIBETAN MARK LEADING MCHAN RTAGS..TIBETAN MARK TRAILING MCHAN RTAGS
|
||||
135D..135E ; 6.0 # [2] ETHIOPIC COMBINING GEMINATION AND VOWEL LENGTH MARK..ETHIOPIC COMBINING VOWEL LENGTH MARK
|
||||
1BC0..1BF3 ; 6.0 # [52] BATAK LETTER A..BATAK PANONGONAN
|
||||
1BFC..1BFF ; 6.0 # [4] BATAK SYMBOL BINDU NA METEK..BATAK SYMBOL BINDU PANGOLAT
|
||||
1DFC ; 6.0 # COMBINING DOUBLE INVERTED BREVE BELOW
|
||||
2095..209C ; 6.0 # [8] LATIN SUBSCRIPT SMALL LETTER H..LATIN SUBSCRIPT SMALL LETTER T
|
||||
20B9 ; 6.0 # INDIAN RUPEE SIGN
|
||||
23E9..23F3 ; 6.0 # [11] BLACK RIGHT-POINTING DOUBLE TRIANGLE..HOURGLASS WITH FLOWING SAND
|
||||
26CE ; 6.0 # OPHIUCHUS
|
||||
26E2 ; 6.0 # ASTRONOMICAL SYMBOL FOR URANUS
|
||||
26E4..26E7 ; 6.0 # [4] PENTAGRAM..INVERTED PENTAGRAM
|
||||
2705 ; 6.0 # WHITE HEAVY CHECK MARK
|
||||
270A..270B ; 6.0 # [2] RAISED FIST..RAISED HAND
|
||||
2728 ; 6.0 # SPARKLES
|
||||
274C ; 6.0 # CROSS MARK
|
||||
274E ; 6.0 # NEGATIVE SQUARED CROSS MARK
|
||||
2753..2755 ; 6.0 # [3] BLACK QUESTION MARK ORNAMENT..WHITE EXCLAMATION MARK ORNAMENT
|
||||
275F..2760 ; 6.0 # [2] HEAVY LOW SINGLE COMMA QUOTATION MARK ORNAMENT..HEAVY LOW DOUBLE COMMA QUOTATION MARK ORNAMENT
|
||||
2795..2797 ; 6.0 # [3] HEAVY PLUS SIGN..HEAVY DIVISION SIGN
|
||||
27B0 ; 6.0 # CURLY LOOP
|
||||
27BF ; 6.0 # DOUBLE CURLY LOOP
|
||||
27CE..27CF ; 6.0 # [2] SQUARED LOGICAL AND..SQUARED LOGICAL OR
|
||||
2D70 ; 6.0 # TIFINAGH SEPARATOR MARK
|
||||
2D7F ; 6.0 # TIFINAGH CONSONANT JOINER
|
||||
31B8..31BA ; 6.0 # [3] BOPOMOFO LETTER GH..BOPOMOFO LETTER ZY
|
||||
A660..A661 ; 6.0 # [2] CYRILLIC CAPITAL LETTER REVERSED TSE..CYRILLIC SMALL LETTER REVERSED TSE
|
||||
A78D..A78E ; 6.0 # [2] LATIN CAPITAL LETTER TURNED H..LATIN SMALL LETTER L WITH RETROFLEX HOOK AND BELT
|
||||
A790..A791 ; 6.0 # [2] LATIN CAPITAL LETTER N WITH DESCENDER..LATIN SMALL LETTER N WITH DESCENDER
|
||||
A7A0..A7A9 ; 6.0 # [10] LATIN CAPITAL LETTER G WITH OBLIQUE STROKE..LATIN SMALL LETTER S WITH OBLIQUE STROKE
|
||||
A7FA ; 6.0 # LATIN LETTER SMALL CAPITAL TURNED M
|
||||
AB01..AB06 ; 6.0 # [6] ETHIOPIC SYLLABLE TTHU..ETHIOPIC SYLLABLE TTHO
|
||||
AB09..AB0E ; 6.0 # [6] ETHIOPIC SYLLABLE DDHU..ETHIOPIC SYLLABLE DDHO
|
||||
AB11..AB16 ; 6.0 # [6] ETHIOPIC SYLLABLE DZU..ETHIOPIC SYLLABLE DZO
|
||||
AB20..AB26 ; 6.0 # [7] ETHIOPIC SYLLABLE CCHHA..ETHIOPIC SYLLABLE CCHHO
|
||||
AB28..AB2E ; 6.0 # [7] ETHIOPIC SYLLABLE BBA..ETHIOPIC SYLLABLE BBO
|
||||
FBB2..FBC1 ; 6.0 # [16] ARABIC SYMBOL DOT ABOVE..ARABIC SYMBOL SMALL TAH BELOW
|
||||
11000..1104D ; 6.0 # [78] BRAHMI SIGN CANDRABINDU..BRAHMI PUNCTUATION LOTUS
|
||||
11052..1106F ; 6.0 # [30] BRAHMI NUMBER ONE..BRAHMI DIGIT NINE
|
||||
16800..16A38 ; 6.0 # [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
|
||||
1B000..1B001 ; 6.0 # [2] KATAKANA LETTER ARCHAIC E..HIRAGANA LETTER ARCHAIC YE
|
||||
1F0A0..1F0AE ; 6.0 # [15] PLAYING CARD BACK..PLAYING CARD KING OF SPADES
|
||||
1F0B1..1F0BE ; 6.0 # [14] PLAYING CARD ACE OF HEARTS..PLAYING CARD KING OF HEARTS
|
||||
1F0C1..1F0CF ; 6.0 # [15] PLAYING CARD ACE OF DIAMONDS..PLAYING CARD BLACK JOKER
|
||||
1F0D1..1F0DF ; 6.0 # [15] PLAYING CARD ACE OF CLUBS..PLAYING CARD WHITE JOKER
|
||||
1F130 ; 6.0 # SQUARED LATIN CAPITAL LETTER A
|
||||
1F132..1F13C ; 6.0 # [11] SQUARED LATIN CAPITAL LETTER C..SQUARED LATIN CAPITAL LETTER M
|
||||
1F13E ; 6.0 # SQUARED LATIN CAPITAL LETTER O
|
||||
1F140..1F141 ; 6.0 # [2] SQUARED LATIN CAPITAL LETTER Q..SQUARED LATIN CAPITAL LETTER R
|
||||
1F143..1F145 ; 6.0 # [3] SQUARED LATIN CAPITAL LETTER T..SQUARED LATIN CAPITAL LETTER V
|
||||
1F147..1F149 ; 6.0 # [3] SQUARED LATIN CAPITAL LETTER X..SQUARED LATIN CAPITAL LETTER Z
|
||||
1F14F..1F156 ; 6.0 # [8] SQUARED WC..NEGATIVE CIRCLED LATIN CAPITAL LETTER G
|
||||
1F158..1F15E ; 6.0 # [7] NEGATIVE CIRCLED LATIN CAPITAL LETTER I..NEGATIVE CIRCLED LATIN CAPITAL LETTER O
|
||||
1F160..1F169 ; 6.0 # [10] NEGATIVE CIRCLED LATIN CAPITAL LETTER Q..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
|
||||
1F170..1F178 ; 6.0 # [9] NEGATIVE SQUARED LATIN CAPITAL LETTER A..NEGATIVE SQUARED LATIN CAPITAL LETTER I
|
||||
1F17A ; 6.0 # NEGATIVE SQUARED LATIN CAPITAL LETTER K
|
||||
1F17D..1F17E ; 6.0 # [2] NEGATIVE SQUARED LATIN CAPITAL LETTER N..NEGATIVE SQUARED LATIN CAPITAL LETTER O
|
||||
1F180..1F189 ; 6.0 # [10] NEGATIVE SQUARED LATIN CAPITAL LETTER Q..NEGATIVE SQUARED LATIN CAPITAL LETTER Z
|
||||
1F18E..1F18F ; 6.0 # [2] NEGATIVE SQUARED AB..NEGATIVE SQUARED WC
|
||||
1F191..1F19A ; 6.0 # [10] SQUARED CL..SQUARED VS
|
||||
1F1E6..1F1FF ; 6.0 # [26] REGIONAL INDICATOR SYMBOL LETTER A..REGIONAL INDICATOR SYMBOL LETTER Z
|
||||
1F201..1F202 ; 6.0 # [2] SQUARED KATAKANA KOKO..SQUARED KATAKANA SA
|
||||
1F232..1F23A ; 6.0 # [9] SQUARED CJK UNIFIED IDEOGRAPH-7981..SQUARED CJK UNIFIED IDEOGRAPH-55B6
|
||||
1F250..1F251 ; 6.0 # [2] CIRCLED IDEOGRAPH ADVANTAGE..CIRCLED IDEOGRAPH ACCEPT
|
||||
1F300..1F320 ; 6.0 # [33] CYCLONE..SHOOTING STAR
|
||||
1F330..1F335 ; 6.0 # [6] CHESTNUT..CACTUS
|
||||
1F337..1F37C ; 6.0 # [70] TULIP..BABY BOTTLE
|
||||
1F380..1F393 ; 6.0 # [20] RIBBON..GRADUATION CAP
|
||||
1F3A0..1F3C4 ; 6.0 # [37] CAROUSEL HORSE..SURFER
|
||||
1F3C6..1F3CA ; 6.0 # [5] TROPHY..SWIMMER
|
||||
1F3E0..1F3F0 ; 6.0 # [17] HOUSE BUILDING..EUROPEAN CASTLE
|
||||
1F400..1F43E ; 6.0 # [63] RAT..PAW PRINTS
|
||||
1F440 ; 6.0 # EYES
|
||||
1F442..1F4F7 ; 6.0 # [182] EAR..CAMERA
|
||||
1F4F9..1F4FC ; 6.0 # [4] VIDEO CAMERA..VIDEOCASSETTE
|
||||
1F500..1F53D ; 6.0 # [62] TWISTED RIGHTWARDS ARROWS..DOWN-POINTING SMALL RED TRIANGLE
|
||||
1F550..1F567 ; 6.0 # [24] CLOCK FACE ONE OCLOCK..CLOCK FACE TWELVE-THIRTY
|
||||
1F5FB..1F5FF ; 6.0 # [5] MOUNT FUJI..MOYAI
|
||||
1F601..1F610 ; 6.0 # [16] GRINNING FACE WITH SMILING EYES..NEUTRAL FACE
|
||||
1F612..1F614 ; 6.0 # [3] UNAMUSED FACE..PENSIVE FACE
|
||||
1F616 ; 6.0 # CONFOUNDED FACE
|
||||
1F618 ; 6.0 # FACE THROWING A KISS
|
||||
1F61A ; 6.0 # KISSING FACE WITH CLOSED EYES
|
||||
1F61C..1F61E ; 6.0 # [3] FACE WITH STUCK-OUT TONGUE AND WINKING EYE..DISAPPOINTED FACE
|
||||
1F620..1F625 ; 6.0 # [6] ANGRY FACE..DISAPPOINTED BUT RELIEVED FACE
|
||||
1F628..1F62B ; 6.0 # [4] FEARFUL FACE..TIRED FACE
|
||||
1F62D ; 6.0 # LOUDLY CRYING FACE
|
||||
1F630..1F633 ; 6.0 # [4] FACE WITH OPEN MOUTH AND COLD SWEAT..FLUSHED FACE
|
||||
1F635..1F640 ; 6.0 # [12] DIZZY FACE..WEARY CAT FACE
|
||||
1F645..1F64F ; 6.0 # [11] FACE WITH NO GOOD GESTURE..PERSON WITH FOLDED HANDS
|
||||
1F680..1F6C5 ; 6.0 # [70] ROCKET..LEFT LUGGAGE
|
||||
1F700..1F773 ; 6.0 # [116] ALCHEMICAL SYMBOL FOR QUINTESSENCE..ALCHEMICAL SYMBOL FOR HALF OUNCE
|
||||
2B740..2B81D ; 6.0 # [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
|
||||
|
||||
# Total code points: 2088
|
||||
|
||||
# ================================================
|
||||
|
||||
# Age=V6_1
|
||||
|
||||
# Newly assigned in Unicode 6.1.0 (January, 2012)
|
||||
|
||||
058F ; 6.1 # ARMENIAN DRAM SIGN
|
||||
0604 ; 6.1 # ARABIC SIGN SAMVAT
|
||||
08A0 ; 6.1 # ARABIC LETTER BEH WITH SMALL V BELOW
|
||||
08A2..08AC ; 6.1 # [11] ARABIC LETTER JEEM WITH TWO DOTS ABOVE..ARABIC LETTER ROHINGYA YEH
|
||||
08E4..08FE ; 6.1 # [27] ARABIC CURLY FATHA..ARABIC DAMMA WITH DOT
|
||||
0AF0 ; 6.1 # GUJARATI ABBREVIATION SIGN
|
||||
0EDE..0EDF ; 6.1 # [2] LAO LETTER KHMU GO..LAO LETTER KHMU NYO
|
||||
10C7 ; 6.1 # GEORGIAN CAPITAL LETTER YN
|
||||
10CD ; 6.1 # GEORGIAN CAPITAL LETTER AEN
|
||||
10FD..10FF ; 6.1 # [3] GEORGIAN LETTER AEN..GEORGIAN LETTER LABIAL SIGN
|
||||
1BAB..1BAD ; 6.1 # [3] SUNDANESE SIGN VIRAMA..SUNDANESE CONSONANT SIGN PASANGAN WA
|
||||
1BBA..1BBF ; 6.1 # [6] SUNDANESE AVAGRAHA..SUNDANESE LETTER FINAL M
|
||||
1CC0..1CC7 ; 6.1 # [8] SUNDANESE PUNCTUATION BINDU SURYA..SUNDANESE PUNCTUATION BINDU BA SATANGA
|
||||
1CF3..1CF6 ; 6.1 # [4] VEDIC SIGN ROTATED ARDHAVISARGA..VEDIC SIGN UPADHMANIYA
|
||||
27CB ; 6.1 # MATHEMATICAL RISING DIAGONAL
|
||||
27CD ; 6.1 # MATHEMATICAL FALLING DIAGONAL
|
||||
2CF2..2CF3 ; 6.1 # [2] COPTIC CAPITAL LETTER BOHAIRIC KHEI..COPTIC SMALL LETTER BOHAIRIC KHEI
|
||||
2D27 ; 6.1 # GEORGIAN SMALL LETTER YN
|
||||
2D2D ; 6.1 # GEORGIAN SMALL LETTER AEN
|
||||
2D66..2D67 ; 6.1 # [2] TIFINAGH LETTER YE..TIFINAGH LETTER YO
|
||||
2E32..2E3B ; 6.1 # [10] TURNED COMMA..THREE-EM DASH
|
||||
9FCC ; 6.1 # CJK UNIFIED IDEOGRAPH-9FCC
|
||||
A674..A67B ; 6.1 # [8] COMBINING CYRILLIC LETTER UKRAINIAN IE..COMBINING CYRILLIC LETTER OMEGA
|
||||
A69F ; 6.1 # COMBINING CYRILLIC LETTER IOTIFIED E
|
||||
A792..A793 ; 6.1 # [2] LATIN CAPITAL LETTER C WITH BAR..LATIN SMALL LETTER C WITH BAR
|
||||
A7AA ; 6.1 # LATIN CAPITAL LETTER H WITH HOOK
|
||||
A7F8..A7F9 ; 6.1 # [2] MODIFIER LETTER CAPITAL H WITH STROKE..MODIFIER LETTER SMALL LIGATURE OE
|
||||
AAE0..AAF6 ; 6.1 # [23] MEETEI MAYEK LETTER E..MEETEI MAYEK VIRAMA
|
||||
FA2E..FA2F ; 6.1 # [2] CJK COMPATIBILITY IDEOGRAPH-FA2E..CJK COMPATIBILITY IDEOGRAPH-FA2F
|
||||
10980..109B7 ; 6.1 # [56] MEROITIC HIEROGLYPHIC LETTER A..MEROITIC CURSIVE LETTER DA
|
||||
109BE..109BF ; 6.1 # [2] MEROITIC CURSIVE LOGOGRAM RMT..MEROITIC CURSIVE LOGOGRAM IMN
|
||||
110D0..110E8 ; 6.1 # [25] SORA SOMPENG LETTER SAH..SORA SOMPENG LETTER MAE
|
||||
110F0..110F9 ; 6.1 # [10] SORA SOMPENG DIGIT ZERO..SORA SOMPENG DIGIT NINE
|
||||
11100..11134 ; 6.1 # [53] CHAKMA SIGN CANDRABINDU..CHAKMA MAAYYAA
|
||||
11136..11143 ; 6.1 # [14] CHAKMA DIGIT ZERO..CHAKMA QUESTION MARK
|
||||
11180..111C8 ; 6.1 # [73] SHARADA SIGN CANDRABINDU..SHARADA SEPARATOR
|
||||
111D0..111D9 ; 6.1 # [10] SHARADA DIGIT ZERO..SHARADA DIGIT NINE
|
||||
11680..116B7 ; 6.1 # [56] TAKRI LETTER A..TAKRI SIGN NUKTA
|
||||
116C0..116C9 ; 6.1 # [10] TAKRI DIGIT ZERO..TAKRI DIGIT NINE
|
||||
16F00..16F44 ; 6.1 # [69] MIAO LETTER PA..MIAO LETTER HHA
|
||||
16F50..16F7E ; 6.1 # [47] MIAO LETTER NASALIZATION..MIAO VOWEL SIGN NG
|
||||
16F8F..16F9F ; 6.1 # [17] MIAO TONE RIGHT..MIAO LETTER REFORMED TONE-8
|
||||
1EE00..1EE03 ; 6.1 # [4] ARABIC MATHEMATICAL ALEF..ARABIC MATHEMATICAL DAL
|
||||
1EE05..1EE1F ; 6.1 # [27] ARABIC MATHEMATICAL WAW..ARABIC MATHEMATICAL DOTLESS QAF
|
||||
1EE21..1EE22 ; 6.1 # [2] ARABIC MATHEMATICAL INITIAL BEH..ARABIC MATHEMATICAL INITIAL JEEM
|
||||
1EE24 ; 6.1 # ARABIC MATHEMATICAL INITIAL HEH
|
||||
1EE27 ; 6.1 # ARABIC MATHEMATICAL INITIAL HAH
|
||||
1EE29..1EE32 ; 6.1 # [10] ARABIC MATHEMATICAL INITIAL YEH..ARABIC MATHEMATICAL INITIAL QAF
|
||||
1EE34..1EE37 ; 6.1 # [4] ARABIC MATHEMATICAL INITIAL SHEEN..ARABIC MATHEMATICAL INITIAL KHAH
|
||||
1EE39 ; 6.1 # ARABIC MATHEMATICAL INITIAL DAD
|
||||
1EE3B ; 6.1 # ARABIC MATHEMATICAL INITIAL GHAIN
|
||||
1EE42 ; 6.1 # ARABIC MATHEMATICAL TAILED JEEM
|
||||
1EE47 ; 6.1 # ARABIC MATHEMATICAL TAILED HAH
|
||||
1EE49 ; 6.1 # ARABIC MATHEMATICAL TAILED YEH
|
||||
1EE4B ; 6.1 # ARABIC MATHEMATICAL TAILED LAM
|
||||
1EE4D..1EE4F ; 6.1 # [3] ARABIC MATHEMATICAL TAILED NOON..ARABIC MATHEMATICAL TAILED AIN
|
||||
1EE51..1EE52 ; 6.1 # [2] ARABIC MATHEMATICAL TAILED SAD..ARABIC MATHEMATICAL TAILED QAF
|
||||
1EE54 ; 6.1 # ARABIC MATHEMATICAL TAILED SHEEN
|
||||
1EE57 ; 6.1 # ARABIC MATHEMATICAL TAILED KHAH
|
||||
1EE59 ; 6.1 # ARABIC MATHEMATICAL TAILED DAD
|
||||
1EE5B ; 6.1 # ARABIC MATHEMATICAL TAILED GHAIN
|
||||
1EE5D ; 6.1 # ARABIC MATHEMATICAL TAILED DOTLESS NOON
|
||||
1EE5F ; 6.1 # ARABIC MATHEMATICAL TAILED DOTLESS QAF
|
||||
1EE61..1EE62 ; 6.1 # [2] ARABIC MATHEMATICAL STRETCHED BEH..ARABIC MATHEMATICAL STRETCHED JEEM
|
||||
1EE64 ; 6.1 # ARABIC MATHEMATICAL STRETCHED HEH
|
||||
1EE67..1EE6A ; 6.1 # [4] ARABIC MATHEMATICAL STRETCHED HAH..ARABIC MATHEMATICAL STRETCHED KAF
|
||||
1EE6C..1EE72 ; 6.1 # [7] ARABIC MATHEMATICAL STRETCHED MEEM..ARABIC MATHEMATICAL STRETCHED QAF
|
||||
1EE74..1EE77 ; 6.1 # [4] ARABIC MATHEMATICAL STRETCHED SHEEN..ARABIC MATHEMATICAL STRETCHED KHAH
|
||||
1EE79..1EE7C ; 6.1 # [4] ARABIC MATHEMATICAL STRETCHED DAD..ARABIC MATHEMATICAL STRETCHED DOTLESS BEH
|
||||
1EE7E ; 6.1 # ARABIC MATHEMATICAL STRETCHED DOTLESS FEH
|
||||
1EE80..1EE89 ; 6.1 # [10] ARABIC MATHEMATICAL LOOPED ALEF..ARABIC MATHEMATICAL LOOPED YEH
|
||||
1EE8B..1EE9B ; 6.1 # [17] ARABIC MATHEMATICAL LOOPED LAM..ARABIC MATHEMATICAL LOOPED GHAIN
|
||||
1EEA1..1EEA3 ; 6.1 # [3] ARABIC MATHEMATICAL DOUBLE-STRUCK BEH..ARABIC MATHEMATICAL DOUBLE-STRUCK DAL
|
||||
1EEA5..1EEA9 ; 6.1 # [5] ARABIC MATHEMATICAL DOUBLE-STRUCK WAW..ARABIC MATHEMATICAL DOUBLE-STRUCK YEH
|
||||
1EEAB..1EEBB ; 6.1 # [17] ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC MATHEMATICAL DOUBLE-STRUCK GHAIN
|
||||
1EEF0..1EEF1 ; 6.1 # [2] ARABIC MATHEMATICAL OPERATOR MEEM WITH HAH WITH TATWEEL..ARABIC MATHEMATICAL OPERATOR HAH WITH DAL
|
||||
1F16A..1F16B ; 6.1 # [2] RAISED MC SIGN..RAISED MD SIGN
|
||||
1F540..1F543 ; 6.1 # [4] CIRCLED CROSS POMMEE..NOTCHED LEFT SEMICIRCLE WITH THREE DOTS
|
||||
1F600 ; 6.1 # GRINNING FACE
|
||||
1F611 ; 6.1 # EXPRESSIONLESS FACE
|
||||
1F615 ; 6.1 # CONFUSED FACE
|
||||
1F617 ; 6.1 # KISSING FACE
|
||||
1F619 ; 6.1 # KISSING FACE WITH SMILING EYES
|
||||
1F61B ; 6.1 # FACE WITH STUCK-OUT TONGUE
|
||||
1F61F ; 6.1 # WORRIED FACE
|
||||
1F626..1F627 ; 6.1 # [2] FROWNING FACE WITH OPEN MOUTH..ANGUISHED FACE
|
||||
1F62C ; 6.1 # GRIMACING FACE
|
||||
1F62E..1F62F ; 6.1 # [2] FACE WITH OPEN MOUTH..HUSHED FACE
|
||||
1F634 ; 6.1 # SLEEPING FACE
|
||||
|
||||
# Total code points: 732
|
||||
|
||||
# EOF
|
||||
|
File diff suppressed because it is too large
Load Diff
@ -1,10 +1,10 @@
|
||||
# GraphemeBreakProperty-5.0.0.txt
|
||||
# Date: 2006-03-09, 23:14:04 GMT [MD]
|
||||
# GraphemeBreakProperty-6.1.0.txt
|
||||
# Date: 2011-12-05, 16:44:15 GMT [MD]
|
||||
#
|
||||
# Unicode Character Database
|
||||
# Copyright (c) 1991-2006 Unicode, Inc.
|
||||
# Copyright (c) 1991-2011 Unicode, Inc.
|
||||
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
||||
# For documentation, see UCD.html
|
||||
# For documentation, see http://www.unicode.org/reports/tr44/
|
||||
|
||||
# ================================================
|
||||
|
||||
@ -34,40 +34,46 @@
|
||||
000E..001F ; Control # Cc [18] <control-000E>..<control-001F>
|
||||
007F..009F ; Control # Cc [33] <control-007F>..<control-009F>
|
||||
00AD ; Control # Cf SOFT HYPHEN
|
||||
0600..0603 ; Control # Cf [4] ARABIC NUMBER SIGN..ARABIC SIGN SAFHA
|
||||
0600..0604 ; Control # Cf [5] ARABIC NUMBER SIGN..ARABIC SIGN SAMVAT
|
||||
06DD ; Control # Cf ARABIC END OF AYAH
|
||||
070F ; Control # Cf SYRIAC ABBREVIATION MARK
|
||||
17B4..17B5 ; Control # Cf [2] KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT AA
|
||||
200B ; Control # Cf ZERO WIDTH SPACE
|
||||
200E..200F ; Control # Cf [2] LEFT-TO-RIGHT MARK..RIGHT-TO-LEFT MARK
|
||||
2028 ; Control # Zl LINE SEPARATOR
|
||||
2029 ; Control # Zp PARAGRAPH SEPARATOR
|
||||
202A..202E ; Control # Cf [5] LEFT-TO-RIGHT EMBEDDING..RIGHT-TO-LEFT OVERRIDE
|
||||
2060..2063 ; Control # Cf [4] WORD JOINER..INVISIBLE SEPARATOR
|
||||
2060..2064 ; Control # Cf [5] WORD JOINER..INVISIBLE PLUS
|
||||
2065..2069 ; Control # Cn [5] <reserved-2065>..<reserved-2069>
|
||||
206A..206F ; Control # Cf [6] INHIBIT SYMMETRIC SWAPPING..NOMINAL DIGIT SHAPES
|
||||
D800..DFFF ; Control # Cs [2048] <surrogate-D800>..<surrogate-DFFF>
|
||||
FEFF ; Control # Cf ZERO WIDTH NO-BREAK SPACE
|
||||
FFF0..FFF8 ; Control # Cn [9] <reserved-FFF0>..<reserved-FFF8>
|
||||
FFF9..FFFB ; Control # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTATION TERMINATOR
|
||||
110BD ; Control # Cf KAITHI NUMBER SIGN
|
||||
1D173..1D17A ; Control # Cf [8] MUSICAL SYMBOL BEGIN BEAM..MUSICAL SYMBOL END PHRASE
|
||||
E0000 ; Control # Cn <reserved-E0000>
|
||||
E0001 ; Control # Cf LANGUAGE TAG
|
||||
E0002..E001F ; Control # Cn [30] <reserved-E0002>..<reserved-E001F>
|
||||
E0020..E007F ; Control # Cf [96] TAG SPACE..CANCEL TAG
|
||||
E0080..E00FF ; Control # Cn [128] <reserved-E0080>..<reserved-E00FF>
|
||||
E01F0..E0FFF ; Control # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
|
||||
|
||||
# Total code points: 201
|
||||
# Total code points: 6023
|
||||
|
||||
# ================================================
|
||||
|
||||
0300..036F ; Extend # Mn [112] COMBINING GRAVE ACCENT..COMBINING LATIN SMALL LETTER X
|
||||
0483..0486 ; Extend # Mn [4] COMBINING CYRILLIC TITLO..COMBINING CYRILLIC PSILI PNEUMATA
|
||||
0483..0487 ; Extend # Mn [5] COMBINING CYRILLIC TITLO..COMBINING CYRILLIC POKRYTIE
|
||||
0488..0489 ; Extend # Me [2] COMBINING CYRILLIC HUNDRED THOUSANDS SIGN..COMBINING CYRILLIC MILLIONS SIGN
|
||||
0591..05BD ; Extend # Mn [45] HEBREW ACCENT ETNAHTA..HEBREW POINT METEG
|
||||
05BF ; Extend # Mn HEBREW POINT RAFE
|
||||
05C1..05C2 ; Extend # Mn [2] HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
|
||||
05C4..05C5 ; Extend # Mn [2] HEBREW MARK UPPER DOT..HEBREW MARK LOWER DOT
|
||||
05C7 ; Extend # Mn HEBREW POINT QAMATS QATAN
|
||||
0610..0615 ; Extend # Mn [6] ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM..ARABIC SMALL HIGH TAH
|
||||
064B..065E ; Extend # Mn [20] ARABIC FATHATAN..ARABIC FATHA WITH TWO DOTS
|
||||
0610..061A ; Extend # Mn [11] ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM..ARABIC SMALL KASRA
|
||||
064B..065F ; Extend # Mn [21] ARABIC FATHATAN..ARABIC WAVY HAMZA BELOW
|
||||
0670 ; Extend # Mn ARABIC LETTER SUPERSCRIPT ALEF
|
||||
06D6..06DC ; Extend # Mn [7] ARABIC SMALL HIGH LIGATURE SAD WITH LAM WITH ALEF MAKSURA..ARABIC SMALL HIGH SEEN
|
||||
06DE ; Extend # Me ARABIC START OF RUB EL HIZB
|
||||
06DF..06E4 ; Extend # Mn [6] ARABIC SMALL HIGH ROUNDED ZERO..ARABIC SMALL HIGH MADDA
|
||||
06E7..06E8 ; Extend # Mn [2] ARABIC SMALL HIGH YEH..ARABIC SMALL HIGH NOON
|
||||
06EA..06ED ; Extend # Mn [4] ARABIC EMPTY CENTRE LOW STOP..ARABIC SMALL LOW MEEM
|
||||
@ -75,11 +81,18 @@ E0020..E007F ; Control # Cf [96] TAG SPACE..CANCEL TAG
|
||||
0730..074A ; Extend # Mn [27] SYRIAC PTHAHA ABOVE..SYRIAC BARREKH
|
||||
07A6..07B0 ; Extend # Mn [11] THAANA ABAFILI..THAANA SUKUN
|
||||
07EB..07F3 ; Extend # Mn [9] NKO COMBINING SHORT HIGH TONE..NKO COMBINING DOUBLE DOT ABOVE
|
||||
0901..0902 ; Extend # Mn [2] DEVANAGARI SIGN CANDRABINDU..DEVANAGARI SIGN ANUSVARA
|
||||
0816..0819 ; Extend # Mn [4] SAMARITAN MARK IN..SAMARITAN MARK DAGESH
|
||||
081B..0823 ; Extend # Mn [9] SAMARITAN MARK EPENTHETIC YUT..SAMARITAN VOWEL SIGN A
|
||||
0825..0827 ; Extend # Mn [3] SAMARITAN VOWEL SIGN SHORT A..SAMARITAN VOWEL SIGN U
|
||||
0829..082D ; Extend # Mn [5] SAMARITAN VOWEL SIGN LONG I..SAMARITAN MARK NEQUDAA
|
||||
0859..085B ; Extend # Mn [3] MANDAIC AFFRICATION MARK..MANDAIC GEMINATION MARK
|
||||
08E4..08FE ; Extend # Mn [27] ARABIC CURLY FATHA..ARABIC DAMMA WITH DOT
|
||||
0900..0902 ; Extend # Mn [3] DEVANAGARI SIGN INVERTED CANDRABINDU..DEVANAGARI SIGN ANUSVARA
|
||||
093A ; Extend # Mn DEVANAGARI VOWEL SIGN OE
|
||||
093C ; Extend # Mn DEVANAGARI SIGN NUKTA
|
||||
0941..0948 ; Extend # Mn [8] DEVANAGARI VOWEL SIGN U..DEVANAGARI VOWEL SIGN AI
|
||||
094D ; Extend # Mn DEVANAGARI SIGN VIRAMA
|
||||
0951..0954 ; Extend # Mn [4] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI ACUTE ACCENT
|
||||
0951..0957 ; Extend # Mn [7] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI VOWEL SIGN UUE
|
||||
0962..0963 ; Extend # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
|
||||
0981 ; Extend # Mn BENGALI SIGN CANDRABINDU
|
||||
09BC ; Extend # Mn BENGALI SIGN NUKTA
|
||||
@ -93,7 +106,9 @@ E0020..E007F ; Control # Cf [96] TAG SPACE..CANCEL TAG
|
||||
0A41..0A42 ; Extend # Mn [2] GURMUKHI VOWEL SIGN U..GURMUKHI VOWEL SIGN UU
|
||||
0A47..0A48 ; Extend # Mn [2] GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN AI
|
||||
0A4B..0A4D ; Extend # Mn [3] GURMUKHI VOWEL SIGN OO..GURMUKHI SIGN VIRAMA
|
||||
0A51 ; Extend # Mn GURMUKHI SIGN UDAAT
|
||||
0A70..0A71 ; Extend # Mn [2] GURMUKHI TIPPI..GURMUKHI ADDAK
|
||||
0A75 ; Extend # Mn GURMUKHI SIGN YAKASH
|
||||
0A81..0A82 ; Extend # Mn [2] GUJARATI SIGN CANDRABINDU..GUJARATI SIGN ANUSVARA
|
||||
0ABC ; Extend # Mn GUJARATI SIGN NUKTA
|
||||
0AC1..0AC5 ; Extend # Mn [5] GUJARATI VOWEL SIGN U..GUJARATI VOWEL SIGN CANDRA E
|
||||
@ -104,10 +119,11 @@ E0020..E007F ; Control # Cf [96] TAG SPACE..CANCEL TAG
|
||||
0B3C ; Extend # Mn ORIYA SIGN NUKTA
|
||||
0B3E ; Extend # Mc ORIYA VOWEL SIGN AA
|
||||
0B3F ; Extend # Mn ORIYA VOWEL SIGN I
|
||||
0B41..0B43 ; Extend # Mn [3] ORIYA VOWEL SIGN U..ORIYA VOWEL SIGN VOCALIC R
|
||||
0B41..0B44 ; Extend # Mn [4] ORIYA VOWEL SIGN U..ORIYA VOWEL SIGN VOCALIC RR
|
||||
0B4D ; Extend # Mn ORIYA SIGN VIRAMA
|
||||
0B56 ; Extend # Mn ORIYA AI LENGTH MARK
|
||||
0B57 ; Extend # Mc ORIYA AU LENGTH MARK
|
||||
0B62..0B63 ; Extend # Mn [2] ORIYA VOWEL SIGN VOCALIC L..ORIYA VOWEL SIGN VOCALIC LL
|
||||
0B82 ; Extend # Mn TAMIL SIGN ANUSVARA
|
||||
0BBE ; Extend # Mc TAMIL VOWEL SIGN AA
|
||||
0BC0 ; Extend # Mn TAMIL VOWEL SIGN II
|
||||
@ -117,6 +133,7 @@ E0020..E007F ; Control # Cf [96] TAG SPACE..CANCEL TAG
|
||||
0C46..0C48 ; Extend # Mn [3] TELUGU VOWEL SIGN E..TELUGU VOWEL SIGN AI
|
||||
0C4A..0C4D ; Extend # Mn [4] TELUGU VOWEL SIGN O..TELUGU SIGN VIRAMA
|
||||
0C55..0C56 ; Extend # Mn [2] TELUGU LENGTH MARK..TELUGU AI LENGTH MARK
|
||||
0C62..0C63 ; Extend # Mn [2] TELUGU VOWEL SIGN VOCALIC L..TELUGU VOWEL SIGN VOCALIC LL
|
||||
0CBC ; Extend # Mn KANNADA SIGN NUKTA
|
||||
0CBF ; Extend # Mn KANNADA VOWEL SIGN I
|
||||
0CC2 ; Extend # Mc KANNADA VOWEL SIGN UU
|
||||
@ -125,9 +142,10 @@ E0020..E007F ; Control # Cf [96] TAG SPACE..CANCEL TAG
|
||||
0CD5..0CD6 ; Extend # Mc [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
|
||||
0CE2..0CE3 ; Extend # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
|
||||
0D3E ; Extend # Mc MALAYALAM VOWEL SIGN AA
|
||||
0D41..0D43 ; Extend # Mn [3] MALAYALAM VOWEL SIGN U..MALAYALAM VOWEL SIGN VOCALIC R
|
||||
0D41..0D44 ; Extend # Mn [4] MALAYALAM VOWEL SIGN U..MALAYALAM VOWEL SIGN VOCALIC RR
|
||||
0D4D ; Extend # Mn MALAYALAM SIGN VIRAMA
|
||||
0D57 ; Extend # Mc MALAYALAM AU LENGTH MARK
|
||||
0D62..0D63 ; Extend # Mn [2] MALAYALAM VOWEL SIGN VOCALIC L..MALAYALAM VOWEL SIGN VOCALIC LL
|
||||
0DCA ; Extend # Mn SINHALA SIGN AL-LAKUNA
|
||||
0DCF ; Extend # Mc SINHALA VOWEL SIGN AELA-PILLA
|
||||
0DD2..0DD4 ; Extend # Mn [3] SINHALA VOWEL SIGN KETTI IS-PILLA..SINHALA VOWEL SIGN KETTI PAA-PILLA
|
||||
@ -147,19 +165,26 @@ E0020..E007F ; Control # Cf [96] TAG SPACE..CANCEL TAG
|
||||
0F71..0F7E ; Extend # Mn [14] TIBETAN VOWEL SIGN AA..TIBETAN SIGN RJES SU NGA RO
|
||||
0F80..0F84 ; Extend # Mn [5] TIBETAN VOWEL SIGN REVERSED I..TIBETAN MARK HALANTA
|
||||
0F86..0F87 ; Extend # Mn [2] TIBETAN SIGN LCI RTAGS..TIBETAN SIGN YANG RTAGS
|
||||
0F90..0F97 ; Extend # Mn [8] TIBETAN SUBJOINED LETTER KA..TIBETAN SUBJOINED LETTER JA
|
||||
0F8D..0F97 ; Extend # Mn [11] TIBETAN SUBJOINED SIGN LCE TSA CAN..TIBETAN SUBJOINED LETTER JA
|
||||
0F99..0FBC ; Extend # Mn [36] TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOINED LETTER FIXED-FORM RA
|
||||
0FC6 ; Extend # Mn TIBETAN SYMBOL PADMA GDAN
|
||||
102D..1030 ; Extend # Mn [4] MYANMAR VOWEL SIGN I..MYANMAR VOWEL SIGN UU
|
||||
1032 ; Extend # Mn MYANMAR VOWEL SIGN AI
|
||||
1036..1037 ; Extend # Mn [2] MYANMAR SIGN ANUSVARA..MYANMAR SIGN DOT BELOW
|
||||
1039 ; Extend # Mn MYANMAR SIGN VIRAMA
|
||||
1032..1037 ; Extend # Mn [6] MYANMAR VOWEL SIGN AI..MYANMAR SIGN DOT BELOW
|
||||
1039..103A ; Extend # Mn [2] MYANMAR SIGN VIRAMA..MYANMAR SIGN ASAT
|
||||
103D..103E ; Extend # Mn [2] MYANMAR CONSONANT SIGN MEDIAL WA..MYANMAR CONSONANT SIGN MEDIAL HA
|
||||
1058..1059 ; Extend # Mn [2] MYANMAR VOWEL SIGN VOCALIC L..MYANMAR VOWEL SIGN VOCALIC LL
|
||||
135F ; Extend # Mn ETHIOPIC COMBINING GEMINATION MARK
|
||||
105E..1060 ; Extend # Mn [3] MYANMAR CONSONANT SIGN MON MEDIAL NA..MYANMAR CONSONANT SIGN MON MEDIAL LA
|
||||
1071..1074 ; Extend # Mn [4] MYANMAR VOWEL SIGN GEBA KAREN I..MYANMAR VOWEL SIGN KAYAH EE
|
||||
1082 ; Extend # Mn MYANMAR CONSONANT SIGN SHAN MEDIAL WA
|
||||
1085..1086 ; Extend # Mn [2] MYANMAR VOWEL SIGN SHAN E ABOVE..MYANMAR VOWEL SIGN SHAN FINAL Y
|
||||
108D ; Extend # Mn MYANMAR SIGN SHAN COUNCIL EMPHATIC TONE
|
||||
109D ; Extend # Mn MYANMAR VOWEL SIGN AITON AI
|
||||
135D..135F ; Extend # Mn [3] ETHIOPIC COMBINING GEMINATION AND VOWEL LENGTH MARK..ETHIOPIC COMBINING GEMINATION MARK
|
||||
1712..1714 ; Extend # Mn [3] TAGALOG VOWEL SIGN I..TAGALOG SIGN VIRAMA
|
||||
1732..1734 ; Extend # Mn [3] HANUNOO VOWEL SIGN I..HANUNOO SIGN PAMUDPOD
|
||||
1752..1753 ; Extend # Mn [2] BUHID VOWEL SIGN I..BUHID VOWEL SIGN U
|
||||
1772..1773 ; Extend # Mn [2] TAGBANWA VOWEL SIGN I..TAGBANWA VOWEL SIGN U
|
||||
17B4..17B5 ; Extend # Mn [2] KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT AA
|
||||
17B7..17BD ; Extend # Mn [7] KHMER VOWEL SIGN I..KHMER VOWEL SIGN UA
|
||||
17C6 ; Extend # Mn KHMER SIGN NIKAHIT
|
||||
17C9..17D3 ; Extend # Mn [11] KHMER SIGN MUUSIKATOAN..KHMER SIGN BATHAMASAT
|
||||
@ -171,33 +196,105 @@ E0020..E007F ; Control # Cf [96] TAG SPACE..CANCEL TAG
|
||||
1932 ; Extend # Mn LIMBU SMALL LETTER ANUSVARA
|
||||
1939..193B ; Extend # Mn [3] LIMBU SIGN MUKPHRENG..LIMBU SIGN SA-I
|
||||
1A17..1A18 ; Extend # Mn [2] BUGINESE VOWEL SIGN I..BUGINESE VOWEL SIGN U
|
||||
1A56 ; Extend # Mn TAI THAM CONSONANT SIGN MEDIAL LA
|
||||
1A58..1A5E ; Extend # Mn [7] TAI THAM SIGN MAI KANG LAI..TAI THAM CONSONANT SIGN SA
|
||||
1A60 ; Extend # Mn TAI THAM SIGN SAKOT
|
||||
1A62 ; Extend # Mn TAI THAM VOWEL SIGN MAI SAT
|
||||
1A65..1A6C ; Extend # Mn [8] TAI THAM VOWEL SIGN I..TAI THAM VOWEL SIGN OA BELOW
|
||||
1A73..1A7C ; Extend # Mn [10] TAI THAM VOWEL SIGN OA ABOVE..TAI THAM SIGN KHUEN-LUE KARAN
|
||||
1A7F ; Extend # Mn TAI THAM COMBINING CRYPTOGRAMMIC DOT
|
||||
1B00..1B03 ; Extend # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
|
||||
1B34 ; Extend # Mn BALINESE SIGN REREKAN
|
||||
1B36..1B3A ; Extend # Mn [5] BALINESE VOWEL SIGN ULU..BALINESE VOWEL SIGN RA REPA
|
||||
1B3C ; Extend # Mn BALINESE VOWEL SIGN LA LENGA
|
||||
1B42 ; Extend # Mn BALINESE VOWEL SIGN PEPET
|
||||
1B6B..1B73 ; Extend # Mn [9] BALINESE MUSICAL SYMBOL COMBINING TEGEH..BALINESE MUSICAL SYMBOL COMBINING GONG
|
||||
1DC0..1DCA ; Extend # Mn [11] COMBINING DOTTED GRAVE ACCENT..COMBINING LATIN SMALL LETTER R BELOW
|
||||
1DFE..1DFF ; Extend # Mn [2] COMBINING LEFT ARROWHEAD ABOVE..COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
|
||||
1B80..1B81 ; Extend # Mn [2] SUNDANESE SIGN PANYECEK..SUNDANESE SIGN PANGLAYAR
|
||||
1BA2..1BA5 ; Extend # Mn [4] SUNDANESE CONSONANT SIGN PANYAKRA..SUNDANESE VOWEL SIGN PANYUKU
|
||||
1BA8..1BA9 ; Extend # Mn [2] SUNDANESE VOWEL SIGN PAMEPET..SUNDANESE VOWEL SIGN PANEULEUNG
|
||||
1BAB ; Extend # Mn SUNDANESE SIGN VIRAMA
|
||||
1BE6 ; Extend # Mn BATAK SIGN TOMPI
|
||||
1BE8..1BE9 ; Extend # Mn [2] BATAK VOWEL SIGN PAKPAK E..BATAK VOWEL SIGN EE
|
||||
1BED ; Extend # Mn BATAK VOWEL SIGN KARO O
|
||||
1BEF..1BF1 ; Extend # Mn [3] BATAK VOWEL SIGN U FOR SIMALUNGUN SA..BATAK CONSONANT SIGN H
|
||||
1C2C..1C33 ; Extend # Mn [8] LEPCHA VOWEL SIGN E..LEPCHA CONSONANT SIGN T
|
||||
1C36..1C37 ; Extend # Mn [2] LEPCHA SIGN RAN..LEPCHA SIGN NUKTA
|
||||
1CD0..1CD2 ; Extend # Mn [3] VEDIC TONE KARSHANA..VEDIC TONE PRENKHA
|
||||
1CD4..1CE0 ; Extend # Mn [13] VEDIC SIGN YAJURVEDIC MIDLINE SVARITA..VEDIC TONE RIGVEDIC KASHMIRI INDEPENDENT SVARITA
|
||||
1CE2..1CE8 ; Extend # Mn [7] VEDIC SIGN VISARGA SVARITA..VEDIC SIGN VISARGA ANUDATTA WITH TAIL
|
||||
1CED ; Extend # Mn VEDIC SIGN TIRYAK
|
||||
1CF4 ; Extend # Mn VEDIC TONE CANDRA ABOVE
|
||||
1DC0..1DE6 ; Extend # Mn [39] COMBINING DOTTED GRAVE ACCENT..COMBINING LATIN SMALL LETTER Z
|
||||
1DFC..1DFF ; Extend # Mn [4] COMBINING DOUBLE INVERTED BREVE BELOW..COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
|
||||
200C..200D ; Extend # Cf [2] ZERO WIDTH NON-JOINER..ZERO WIDTH JOINER
|
||||
20D0..20DC ; Extend # Mn [13] COMBINING LEFT HARPOON ABOVE..COMBINING FOUR DOTS ABOVE
|
||||
20DD..20E0 ; Extend # Me [4] COMBINING ENCLOSING CIRCLE..COMBINING ENCLOSING CIRCLE BACKSLASH
|
||||
20E1 ; Extend # Mn COMBINING LEFT RIGHT ARROW ABOVE
|
||||
20E2..20E4 ; Extend # Me [3] COMBINING ENCLOSING SCREEN..COMBINING ENCLOSING UPWARD POINTING TRIANGLE
|
||||
20E5..20EF ; Extend # Mn [11] COMBINING REVERSE SOLIDUS OVERLAY..COMBINING RIGHT ARROW BELOW
|
||||
302A..302F ; Extend # Mn [6] IDEOGRAPHIC LEVEL TONE MARK..HANGUL DOUBLE DOT TONE MARK
|
||||
20E5..20F0 ; Extend # Mn [12] COMBINING REVERSE SOLIDUS OVERLAY..COMBINING ASTERISK ABOVE
|
||||
2CEF..2CF1 ; Extend # Mn [3] COPTIC COMBINING NI ABOVE..COPTIC COMBINING SPIRITUS LENIS
|
||||
2D7F ; Extend # Mn TIFINAGH CONSONANT JOINER
|
||||
2DE0..2DFF ; Extend # Mn [32] COMBINING CYRILLIC LETTER BE..COMBINING CYRILLIC LETTER IOTIFIED BIG YUS
|
||||
302A..302D ; Extend # Mn [4] IDEOGRAPHIC LEVEL TONE MARK..IDEOGRAPHIC ENTERING TONE MARK
|
||||
302E..302F ; Extend # Mc [2] HANGUL SINGLE DOT TONE MARK..HANGUL DOUBLE DOT TONE MARK
|
||||
3099..309A ; Extend # Mn [2] COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK..COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
|
||||
A66F ; Extend # Mn COMBINING CYRILLIC VZMET
|
||||
A670..A672 ; Extend # Me [3] COMBINING CYRILLIC TEN MILLIONS SIGN..COMBINING CYRILLIC THOUSAND MILLIONS SIGN
|
||||
A674..A67D ; Extend # Mn [10] COMBINING CYRILLIC LETTER UKRAINIAN IE..COMBINING CYRILLIC PAYEROK
|
||||
A69F ; Extend # Mn COMBINING CYRILLIC LETTER IOTIFIED E
|
||||
A6F0..A6F1 ; Extend # Mn [2] BAMUM COMBINING MARK KOQNDON..BAMUM COMBINING MARK TUKWENTIS
|
||||
A802 ; Extend # Mn SYLOTI NAGRI SIGN DVISVARA
|
||||
A806 ; Extend # Mn SYLOTI NAGRI SIGN HASANTA
|
||||
A80B ; Extend # Mn SYLOTI NAGRI SIGN ANUSVARA
|
||||
A825..A826 ; Extend # Mn [2] SYLOTI NAGRI VOWEL SIGN U..SYLOTI NAGRI VOWEL SIGN E
|
||||
A8C4 ; Extend # Mn SAURASHTRA SIGN VIRAMA
|
||||
A8E0..A8F1 ; Extend # Mn [18] COMBINING DEVANAGARI DIGIT ZERO..COMBINING DEVANAGARI SIGN AVAGRAHA
|
||||
A926..A92D ; Extend # Mn [8] KAYAH LI VOWEL UE..KAYAH LI TONE CALYA PLOPHU
|
||||
A947..A951 ; Extend # Mn [11] REJANG VOWEL SIGN I..REJANG CONSONANT SIGN R
|
||||
A980..A982 ; Extend # Mn [3] JAVANESE SIGN PANYANGGA..JAVANESE SIGN LAYAR
|
||||
A9B3 ; Extend # Mn JAVANESE SIGN CECAK TELU
|
||||
A9B6..A9B9 ; Extend # Mn [4] JAVANESE VOWEL SIGN WULU..JAVANESE VOWEL SIGN SUKU MENDUT
|
||||
A9BC ; Extend # Mn JAVANESE VOWEL SIGN PEPET
|
||||
AA29..AA2E ; Extend # Mn [6] CHAM VOWEL SIGN AA..CHAM VOWEL SIGN OE
|
||||
AA31..AA32 ; Extend # Mn [2] CHAM VOWEL SIGN AU..CHAM VOWEL SIGN UE
|
||||
AA35..AA36 ; Extend # Mn [2] CHAM CONSONANT SIGN LA..CHAM CONSONANT SIGN WA
|
||||
AA43 ; Extend # Mn CHAM CONSONANT SIGN FINAL NG
|
||||
AA4C ; Extend # Mn CHAM CONSONANT SIGN FINAL M
|
||||
AAB0 ; Extend # Mn TAI VIET MAI KANG
|
||||
AAB2..AAB4 ; Extend # Mn [3] TAI VIET VOWEL I..TAI VIET VOWEL U
|
||||
AAB7..AAB8 ; Extend # Mn [2] TAI VIET MAI KHIT..TAI VIET VOWEL IA
|
||||
AABE..AABF ; Extend # Mn [2] TAI VIET VOWEL AM..TAI VIET TONE MAI EK
|
||||
AAC1 ; Extend # Mn TAI VIET TONE MAI THO
|
||||
AAEC..AAED ; Extend # Mn [2] MEETEI MAYEK VOWEL SIGN UU..MEETEI MAYEK VOWEL SIGN AAI
|
||||
AAF6 ; Extend # Mn MEETEI MAYEK VIRAMA
|
||||
ABE5 ; Extend # Mn MEETEI MAYEK VOWEL SIGN ANAP
|
||||
ABE8 ; Extend # Mn MEETEI MAYEK VOWEL SIGN UNAP
|
||||
ABED ; Extend # Mn MEETEI MAYEK APUN IYEK
|
||||
FB1E ; Extend # Mn HEBREW POINT JUDEO-SPANISH VARIKA
|
||||
FE00..FE0F ; Extend # Mn [16] VARIATION SELECTOR-1..VARIATION SELECTOR-16
|
||||
FE20..FE23 ; Extend # Mn [4] COMBINING LIGATURE LEFT HALF..COMBINING DOUBLE TILDE RIGHT HALF
|
||||
FE20..FE26 ; Extend # Mn [7] COMBINING LIGATURE LEFT HALF..COMBINING CONJOINING MACRON
|
||||
FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK
|
||||
101FD ; Extend # Mn PHAISTOS DISC SIGN COMBINING OBLIQUE STROKE
|
||||
10A01..10A03 ; Extend # Mn [3] KHAROSHTHI VOWEL SIGN I..KHAROSHTHI VOWEL SIGN VOCALIC R
|
||||
10A05..10A06 ; Extend # Mn [2] KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SIGN O
|
||||
10A0C..10A0F ; Extend # Mn [4] KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI SIGN VISARGA
|
||||
10A38..10A3A ; Extend # Mn [3] KHAROSHTHI SIGN BAR ABOVE..KHAROSHTHI SIGN DOT BELOW
|
||||
10A3F ; Extend # Mn KHAROSHTHI VIRAMA
|
||||
11001 ; Extend # Mn BRAHMI SIGN ANUSVARA
|
||||
11038..11046 ; Extend # Mn [15] BRAHMI VOWEL SIGN AA..BRAHMI VIRAMA
|
||||
11080..11081 ; Extend # Mn [2] KAITHI SIGN CANDRABINDU..KAITHI SIGN ANUSVARA
|
||||
110B3..110B6 ; Extend # Mn [4] KAITHI VOWEL SIGN U..KAITHI VOWEL SIGN AI
|
||||
110B9..110BA ; Extend # Mn [2] KAITHI SIGN VIRAMA..KAITHI SIGN NUKTA
|
||||
11100..11102 ; Extend # Mn [3] CHAKMA SIGN CANDRABINDU..CHAKMA SIGN VISARGA
|
||||
11127..1112B ; Extend # Mn [5] CHAKMA VOWEL SIGN A..CHAKMA VOWEL SIGN UU
|
||||
1112D..11134 ; Extend # Mn [8] CHAKMA VOWEL SIGN AI..CHAKMA MAAYYAA
|
||||
11180..11181 ; Extend # Mn [2] SHARADA SIGN CANDRABINDU..SHARADA SIGN ANUSVARA
|
||||
111B6..111BE ; Extend # Mn [9] SHARADA VOWEL SIGN U..SHARADA VOWEL SIGN O
|
||||
116AB ; Extend # Mn TAKRI SIGN ANUSVARA
|
||||
116AD ; Extend # Mn TAKRI VOWEL SIGN AA
|
||||
116B0..116B5 ; Extend # Mn [6] TAKRI VOWEL SIGN U..TAKRI VOWEL SIGN AU
|
||||
116B7 ; Extend # Mn TAKRI SIGN NUKTA
|
||||
16F8F..16F92 ; Extend # Mn [4] MIAO TONE RIGHT..MIAO TONE BELOW
|
||||
1D165 ; Extend # Mc MUSICAL SYMBOL COMBINING STEM
|
||||
1D167..1D169 ; Extend # Mn [3] MUSICAL SYMBOL COMBINING TREMOLO-1..MUSICAL SYMBOL COMBINING TREMOLO-3
|
||||
1D16E..1D172 ; Extend # Mc [5] MUSICAL SYMBOL COMBINING FLAG-1..MUSICAL SYMBOL COMBINING FLAG-5
|
||||
@ -207,26 +304,145 @@ FE20..FE23 ; Extend # Mn [4] COMBINING LIGATURE LEFT HALF..COMBINING DOUBLE
|
||||
1D242..1D244 ; Extend # Mn [3] COMBINING GREEK MUSICAL TRISEME..COMBINING GREEK MUSICAL PENTASEME
|
||||
E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
|
||||
|
||||
# Total code points: 911
|
||||
# Total code points: 1317
|
||||
|
||||
# ================================================
|
||||
|
||||
1100..1159 ; L # Lo [90] HANGUL CHOSEONG KIYEOK..HANGUL CHOSEONG YEORINHIEUH
|
||||
115F ; L # Lo HANGUL CHOSEONG FILLER
|
||||
0903 ; SpacingMark # Mc DEVANAGARI SIGN VISARGA
|
||||
093B ; SpacingMark # Mc DEVANAGARI VOWEL SIGN OOE
|
||||
093E..0940 ; SpacingMark # Mc [3] DEVANAGARI VOWEL SIGN AA..DEVANAGARI VOWEL SIGN II
|
||||
0949..094C ; SpacingMark # Mc [4] DEVANAGARI VOWEL SIGN CANDRA O..DEVANAGARI VOWEL SIGN AU
|
||||
094E..094F ; SpacingMark # Mc [2] DEVANAGARI VOWEL SIGN PRISHTHAMATRA E..DEVANAGARI VOWEL SIGN AW
|
||||
0982..0983 ; SpacingMark # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
|
||||
09BF..09C0 ; SpacingMark # Mc [2] BENGALI VOWEL SIGN I..BENGALI VOWEL SIGN II
|
||||
09C7..09C8 ; SpacingMark # Mc [2] BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI
|
||||
09CB..09CC ; SpacingMark # Mc [2] BENGALI VOWEL SIGN O..BENGALI VOWEL SIGN AU
|
||||
0A03 ; SpacingMark # Mc GURMUKHI SIGN VISARGA
|
||||
0A3E..0A40 ; SpacingMark # Mc [3] GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN II
|
||||
0A83 ; SpacingMark # Mc GUJARATI SIGN VISARGA
|
||||
0ABE..0AC0 ; SpacingMark # Mc [3] GUJARATI VOWEL SIGN AA..GUJARATI VOWEL SIGN II
|
||||
0AC9 ; SpacingMark # Mc GUJARATI VOWEL SIGN CANDRA O
|
||||
0ACB..0ACC ; SpacingMark # Mc [2] GUJARATI VOWEL SIGN O..GUJARATI VOWEL SIGN AU
|
||||
0B02..0B03 ; SpacingMark # Mc [2] ORIYA SIGN ANUSVARA..ORIYA SIGN VISARGA
|
||||
0B40 ; SpacingMark # Mc ORIYA VOWEL SIGN II
|
||||
0B47..0B48 ; SpacingMark # Mc [2] ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI
|
||||
0B4B..0B4C ; SpacingMark # Mc [2] ORIYA VOWEL SIGN O..ORIYA VOWEL SIGN AU
|
||||
0BBF ; SpacingMark # Mc TAMIL VOWEL SIGN I
|
||||
0BC1..0BC2 ; SpacingMark # Mc [2] TAMIL VOWEL SIGN U..TAMIL VOWEL SIGN UU
|
||||
0BC6..0BC8 ; SpacingMark # Mc [3] TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI
|
||||
0BCA..0BCC ; SpacingMark # Mc [3] TAMIL VOWEL SIGN O..TAMIL VOWEL SIGN AU
|
||||
0C01..0C03 ; SpacingMark # Mc [3] TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA
|
||||
0C41..0C44 ; SpacingMark # Mc [4] TELUGU VOWEL SIGN U..TELUGU VOWEL SIGN VOCALIC RR
|
||||
0C82..0C83 ; SpacingMark # Mc [2] KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA
|
||||
0CBE ; SpacingMark # Mc KANNADA VOWEL SIGN AA
|
||||
0CC0..0CC1 ; SpacingMark # Mc [2] KANNADA VOWEL SIGN II..KANNADA VOWEL SIGN U
|
||||
0CC3..0CC4 ; SpacingMark # Mc [2] KANNADA VOWEL SIGN VOCALIC R..KANNADA VOWEL SIGN VOCALIC RR
|
||||
0CC7..0CC8 ; SpacingMark # Mc [2] KANNADA VOWEL SIGN EE..KANNADA VOWEL SIGN AI
|
||||
0CCA..0CCB ; SpacingMark # Mc [2] KANNADA VOWEL SIGN O..KANNADA VOWEL SIGN OO
|
||||
0D02..0D03 ; SpacingMark # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
|
||||
0D3F..0D40 ; SpacingMark # Mc [2] MALAYALAM VOWEL SIGN I..MALAYALAM VOWEL SIGN II
|
||||
0D46..0D48 ; SpacingMark # Mc [3] MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN AI
|
||||
0D4A..0D4C ; SpacingMark # Mc [3] MALAYALAM VOWEL SIGN O..MALAYALAM VOWEL SIGN AU
|
||||
0D82..0D83 ; SpacingMark # Mc [2] SINHALA SIGN ANUSVARAYA..SINHALA SIGN VISARGAYA
|
||||
0DD0..0DD1 ; SpacingMark # Mc [2] SINHALA VOWEL SIGN KETTI AEDA-PILLA..SINHALA VOWEL SIGN DIGA AEDA-PILLA
|
||||
0DD8..0DDE ; SpacingMark # Mc [7] SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOWEL SIGN KOMBUVA HAA GAYANUKITTA
|
||||
0DF2..0DF3 ; SpacingMark # Mc [2] SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHALA VOWEL SIGN DIGA GAYANUKITTA
|
||||
0E33 ; SpacingMark # Lo THAI CHARACTER SARA AM
|
||||
0EB3 ; SpacingMark # Lo LAO VOWEL SIGN AM
|
||||
0F3E..0F3F ; SpacingMark # Mc [2] TIBETAN SIGN YAR TSHES..TIBETAN SIGN MAR TSHES
|
||||
0F7F ; SpacingMark # Mc TIBETAN SIGN RNAM BCAD
|
||||
1031 ; SpacingMark # Mc MYANMAR VOWEL SIGN E
|
||||
103B..103C ; SpacingMark # Mc [2] MYANMAR CONSONANT SIGN MEDIAL YA..MYANMAR CONSONANT SIGN MEDIAL RA
|
||||
1056..1057 ; SpacingMark # Mc [2] MYANMAR VOWEL SIGN VOCALIC R..MYANMAR VOWEL SIGN VOCALIC RR
|
||||
1084 ; SpacingMark # Mc MYANMAR VOWEL SIGN SHAN E
|
||||
17B6 ; SpacingMark # Mc KHMER VOWEL SIGN AA
|
||||
17BE..17C5 ; SpacingMark # Mc [8] KHMER VOWEL SIGN OE..KHMER VOWEL SIGN AU
|
||||
17C7..17C8 ; SpacingMark # Mc [2] KHMER SIGN REAHMUK..KHMER SIGN YUUKALEAPINTU
|
||||
1923..1926 ; SpacingMark # Mc [4] LIMBU VOWEL SIGN EE..LIMBU VOWEL SIGN AU
|
||||
1929..192B ; SpacingMark # Mc [3] LIMBU SUBJOINED LETTER YA..LIMBU SUBJOINED LETTER WA
|
||||
1930..1931 ; SpacingMark # Mc [2] LIMBU SMALL LETTER KA..LIMBU SMALL LETTER NGA
|
||||
1933..1938 ; SpacingMark # Mc [6] LIMBU SMALL LETTER TA..LIMBU SMALL LETTER LA
|
||||
19B5..19B7 ; SpacingMark # Mc [3] NEW TAI LUE VOWEL SIGN E..NEW TAI LUE VOWEL SIGN O
|
||||
19BA ; SpacingMark # Mc NEW TAI LUE VOWEL SIGN AY
|
||||
1A19..1A1B ; SpacingMark # Mc [3] BUGINESE VOWEL SIGN E..BUGINESE VOWEL SIGN AE
|
||||
1A55 ; SpacingMark # Mc TAI THAM CONSONANT SIGN MEDIAL RA
|
||||
1A57 ; SpacingMark # Mc TAI THAM CONSONANT SIGN LA TANG LAI
|
||||
1A6D..1A72 ; SpacingMark # Mc [6] TAI THAM VOWEL SIGN OY..TAI THAM VOWEL SIGN THAM AI
|
||||
1B04 ; SpacingMark # Mc BALINESE SIGN BISAH
|
||||
1B35 ; SpacingMark # Mc BALINESE VOWEL SIGN TEDUNG
|
||||
1B3B ; SpacingMark # Mc BALINESE VOWEL SIGN RA REPA TEDUNG
|
||||
1B3D..1B41 ; SpacingMark # Mc [5] BALINESE VOWEL SIGN LA LENGA TEDUNG..BALINESE VOWEL SIGN TALING REPA TEDUNG
|
||||
1B43..1B44 ; SpacingMark # Mc [2] BALINESE VOWEL SIGN PEPET TEDUNG..BALINESE ADEG ADEG
|
||||
1B82 ; SpacingMark # Mc SUNDANESE SIGN PANGWISAD
|
||||
1BA1 ; SpacingMark # Mc SUNDANESE CONSONANT SIGN PAMINGKAL
|
||||
1BA6..1BA7 ; SpacingMark # Mc [2] SUNDANESE VOWEL SIGN PANAELAENG..SUNDANESE VOWEL SIGN PANOLONG
|
||||
1BAA ; SpacingMark # Mc SUNDANESE SIGN PAMAAEH
|
||||
1BAC..1BAD ; SpacingMark # Mc [2] SUNDANESE CONSONANT SIGN PASANGAN MA..SUNDANESE CONSONANT SIGN PASANGAN WA
|
||||
1BE7 ; SpacingMark # Mc BATAK VOWEL SIGN E
|
||||
1BEA..1BEC ; SpacingMark # Mc [3] BATAK VOWEL SIGN I..BATAK VOWEL SIGN O
|
||||
1BEE ; SpacingMark # Mc BATAK VOWEL SIGN U
|
||||
1BF2..1BF3 ; SpacingMark # Mc [2] BATAK PANGOLAT..BATAK PANONGONAN
|
||||
1C24..1C2B ; SpacingMark # Mc [8] LEPCHA SUBJOINED LETTER YA..LEPCHA VOWEL SIGN UU
|
||||
1C34..1C35 ; SpacingMark # Mc [2] LEPCHA CONSONANT SIGN NYIN-DO..LEPCHA CONSONANT SIGN KANG
|
||||
1CE1 ; SpacingMark # Mc VEDIC TONE ATHARVAVEDIC INDEPENDENT SVARITA
|
||||
1CF2..1CF3 ; SpacingMark # Mc [2] VEDIC SIGN ARDHAVISARGA..VEDIC SIGN ROTATED ARDHAVISARGA
|
||||
A823..A824 ; SpacingMark # Mc [2] SYLOTI NAGRI VOWEL SIGN A..SYLOTI NAGRI VOWEL SIGN I
|
||||
A827 ; SpacingMark # Mc SYLOTI NAGRI VOWEL SIGN OO
|
||||
A880..A881 ; SpacingMark # Mc [2] SAURASHTRA SIGN ANUSVARA..SAURASHTRA SIGN VISARGA
|
||||
A8B4..A8C3 ; SpacingMark # Mc [16] SAURASHTRA CONSONANT SIGN HAARU..SAURASHTRA VOWEL SIGN AU
|
||||
A952..A953 ; SpacingMark # Mc [2] REJANG CONSONANT SIGN H..REJANG VIRAMA
|
||||
A983 ; SpacingMark # Mc JAVANESE SIGN WIGNYAN
|
||||
A9B4..A9B5 ; SpacingMark # Mc [2] JAVANESE VOWEL SIGN TARUNG..JAVANESE VOWEL SIGN TOLONG
|
||||
A9BA..A9BB ; SpacingMark # Mc [2] JAVANESE VOWEL SIGN TALING..JAVANESE VOWEL SIGN DIRGA MURE
|
||||
A9BD..A9C0 ; SpacingMark # Mc [4] JAVANESE CONSONANT SIGN KERET..JAVANESE PANGKON
|
||||
AA2F..AA30 ; SpacingMark # Mc [2] CHAM VOWEL SIGN O..CHAM VOWEL SIGN AI
|
||||
AA33..AA34 ; SpacingMark # Mc [2] CHAM CONSONANT SIGN YA..CHAM CONSONANT SIGN RA
|
||||
AA4D ; SpacingMark # Mc CHAM CONSONANT SIGN FINAL H
|
||||
AAEB ; SpacingMark # Mc MEETEI MAYEK VOWEL SIGN II
|
||||
AAEE..AAEF ; SpacingMark # Mc [2] MEETEI MAYEK VOWEL SIGN AU..MEETEI MAYEK VOWEL SIGN AAU
|
||||
AAF5 ; SpacingMark # Mc MEETEI MAYEK VOWEL SIGN VISARGA
|
||||
ABE3..ABE4 ; SpacingMark # Mc [2] MEETEI MAYEK VOWEL SIGN ONAP..MEETEI MAYEK VOWEL SIGN INAP
|
||||
ABE6..ABE7 ; SpacingMark # Mc [2] MEETEI MAYEK VOWEL SIGN YENAP..MEETEI MAYEK VOWEL SIGN SOUNAP
|
||||
ABE9..ABEA ; SpacingMark # Mc [2] MEETEI MAYEK VOWEL SIGN CHEINAP..MEETEI MAYEK VOWEL SIGN NUNG
|
||||
ABEC ; SpacingMark # Mc MEETEI MAYEK LUM IYEK
|
||||
11000 ; SpacingMark # Mc BRAHMI SIGN CANDRABINDU
|
||||
11002 ; SpacingMark # Mc BRAHMI SIGN VISARGA
|
||||
11082 ; SpacingMark # Mc KAITHI SIGN VISARGA
|
||||
110B0..110B2 ; SpacingMark # Mc [3] KAITHI VOWEL SIGN AA..KAITHI VOWEL SIGN II
|
||||
110B7..110B8 ; SpacingMark # Mc [2] KAITHI VOWEL SIGN O..KAITHI VOWEL SIGN AU
|
||||
1112C ; SpacingMark # Mc CHAKMA VOWEL SIGN E
|
||||
11182 ; SpacingMark # Mc SHARADA SIGN VISARGA
|
||||
111B3..111B5 ; SpacingMark # Mc [3] SHARADA VOWEL SIGN AA..SHARADA VOWEL SIGN II
|
||||
111BF..111C0 ; SpacingMark # Mc [2] SHARADA VOWEL SIGN AU..SHARADA SIGN VIRAMA
|
||||
116AC ; SpacingMark # Mc TAKRI SIGN VISARGA
|
||||
116AE..116AF ; SpacingMark # Mc [2] TAKRI VOWEL SIGN I..TAKRI VOWEL SIGN II
|
||||
116B6 ; SpacingMark # Mc TAKRI SIGN VIRAMA
|
||||
16F51..16F7E ; SpacingMark # Mc [46] MIAO SIGN ASPIRATION..MIAO VOWEL SIGN NG
|
||||
1D166 ; SpacingMark # Mc MUSICAL SYMBOL COMBINING SPRECHGESANG STEM
|
||||
1D16D ; SpacingMark # Mc MUSICAL SYMBOL COMBINING AUGMENTATION DOT
|
||||
|
||||
# Total code points: 91
|
||||
# Total code points: 291
|
||||
|
||||
# ================================================
|
||||
|
||||
1160..11A2 ; V # Lo [67] HANGUL JUNGSEONG FILLER..HANGUL JUNGSEONG SSANGARAEA
|
||||
1100..115F ; L # Lo [96] HANGUL CHOSEONG KIYEOK..HANGUL CHOSEONG FILLER
|
||||
A960..A97C ; L # Lo [29] HANGUL CHOSEONG TIKEUT-MIEUM..HANGUL CHOSEONG SSANGYEORINHIEUH
|
||||
|
||||
# Total code points: 67
|
||||
# Total code points: 125
|
||||
|
||||
# ================================================
|
||||
|
||||
11A8..11F9 ; T # Lo [82] HANGUL JONGSEONG KIYEOK..HANGUL JONGSEONG YEORINHIEUH
|
||||
1160..11A7 ; V # Lo [72] HANGUL JUNGSEONG FILLER..HANGUL JUNGSEONG O-YAE
|
||||
D7B0..D7C6 ; V # Lo [23] HANGUL JUNGSEONG O-YEO..HANGUL JUNGSEONG ARAEA-E
|
||||
|
||||
# Total code points: 82
|
||||
# Total code points: 95
|
||||
|
||||
# ================================================
|
||||
|
||||
11A8..11FF ; T # Lo [88] HANGUL JONGSEONG KIYEOK..HANGUL JONGSEONG SSANGNIEUN
|
||||
D7CB..D7FB ; T # Lo [49] HANGUL JONGSEONG NIEUN-RIEUL..HANGUL JONGSEONG PHIEUPH-THIEUTH
|
||||
|
||||
# Total code points: 137
|
||||
|
||||
# ================================================
|
||||
|
||||
|
File diff suppressed because it is too large
Load Diff
@ -1,14 +1,14 @@
|
||||
# NormalizationCorrections-5.0.0.txt
|
||||
# Date: 2006-07-14, 11:23:00 PDT [KW]
|
||||
# NormalizationCorrections-6.1.0.txt
|
||||
# Date: 2011-06-23, 00:46:00 GMT [KW, LI]
|
||||
#
|
||||
# This file is a normative contributory data file in the
|
||||
# Unicode Character Database.
|
||||
#
|
||||
# Copyright (c) 1991-2006 Unicode, Inc.
|
||||
# Copyright (c) 1991-2011 Unicode, Inc.
|
||||
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
||||
#
|
||||
# The normalization stabilization policy of the Unicode
|
||||
# Consortium ordinarily precludes any change to the decomposition
|
||||
# The normalization stability policy of the Unicode Consortium
|
||||
# ordinarily precludes any change to the decomposition
|
||||
# for any character, once established in a relevant version
|
||||
# of the UnicodeData.txt data file. However, under certain
|
||||
# exceptional (and rare) conditions, an error in a decomposition
|
||||
|
File diff suppressed because it is too large
Load Diff
File diff suppressed because it is too large
Load Diff
@ -1,17 +1,17 @@
|
||||
# SpecialCasing-5.0.0.txt
|
||||
# Date: 2006-03-03, 08:23:36 GMT [MD]
|
||||
# SpecialCasing-6.1.0.txt
|
||||
# Date: 2011-11-27, 05:10:51 GMT [MD]
|
||||
#
|
||||
# Unicode Character Database
|
||||
# Copyright (c) 1991-2006 Unicode, Inc.
|
||||
# Copyright (c) 1991-2011 Unicode, Inc.
|
||||
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
||||
# For documentation, see UCD.html
|
||||
# For documentation, see http://www.unicode.org/reports/tr44/
|
||||
#
|
||||
# Special Casing Properties
|
||||
#
|
||||
# This file is a supplement to the UnicodeData file.
|
||||
# It contains additional information about the casing of Unicode characters.
|
||||
# (For compatibility, the UnicodeData.txt file only contains case mappings for
|
||||
# characters where they are 1-1, and does not have locale-specific mappings.)
|
||||
# characters where they are 1-1, and independent of context and language.
|
||||
# For more information, see the discussion of Case Mappings in the Unicode Standard.
|
||||
#
|
||||
# All code points not listed in this file that do not have a simple case mappings
|
||||
@ -27,16 +27,16 @@
|
||||
# than one character, they are separated by spaces. Other than as used to separate
|
||||
# elements, spaces are to be ignored.
|
||||
#
|
||||
# The <condition_list> is optional. Where present, it consists of one or more locale IDs
|
||||
# The <condition_list> is optional. Where present, it consists of one or more language IDs
|
||||
# or contexts, separated by spaces. In these conditions:
|
||||
# - A condition list overrides the normal behavior if all of the listed conditions are true.
|
||||
# - The context is always the context of the characters in the original string,
|
||||
# NOT in the resulting string.
|
||||
# - Case distinctions in the condition list are not significant.
|
||||
# - Conditions preceded by "Not_" represent the negation of the condition.
|
||||
# The condition list is not represented in the UCD as a formal property.
|
||||
#
|
||||
# A locale ID is defined by taking any language tag as defined by
|
||||
# RFC 3066 (or its successor), and replacing '-' by '_'.
|
||||
# A language ID is defined by BCP 47, with '-' and '_' treated equivalently.
|
||||
#
|
||||
# A context for a character C is defined by Section 3.13 Default Case
|
||||
# Operations, of The Unicode Standard, Version 5.0.
|
||||
@ -48,6 +48,8 @@
|
||||
# * Additional fields
|
||||
# ================================================================================
|
||||
|
||||
# @missing: 0000..10FFFF; <slc>; <stc>; <suc>;
|
||||
|
||||
# ================================================================================
|
||||
# Unconditional mappings
|
||||
# ================================================================================
|
||||
@ -106,11 +108,11 @@ FB17; FB17; 0544 056D; 0544 053D; # ARMENIAN SMALL LIGATURE MEN XEH
|
||||
1FE7; 1FE7; 03A5 0308 0342; 03A5 0308 0342; # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND PERISPOMENI
|
||||
1FF6; 1FF6; 03A9 0342; 03A9 0342; # GREEK SMALL LETTER OMEGA WITH PERISPOMENI
|
||||
|
||||
# IMPORTANT-when capitalizing iota-subscript (0345)
|
||||
# It MUST be in normalized form--moved to the end of any sequence of combining marks.
|
||||
# This is because logically it represents a following base character!
|
||||
# E.g. <iota_subscript> (<Mn> | <Mc> | <Me>)+ => (<Mn> | <Mc> | <Me>)+ <iota_subscript>
|
||||
# It should never be the first character in a word, so in titlecasing it can be left as is.
|
||||
# IMPORTANT-when iota-subscript (0345) is uppercased or titlecased,
|
||||
# the result will be incorrect unless the iota-subscript is moved to the end
|
||||
# of any sequence of combining marks. Otherwise, the accents will go on the capital iota.
|
||||
# This process can be achieved by first transforming the text to NFC before casing.
|
||||
# E.g. <alpha><iota_subscript><acute> is uppercased to <ALPHA><acute><IOTA>
|
||||
|
||||
# The following cases are already in the UnicodeData file, so are only commented here.
|
||||
|
||||
@ -189,7 +191,14 @@ FB17; FB17; 0544 056D; 0544 053D; # ARMENIAN SMALL LIGATURE MEN XEH
|
||||
1FF7; 1FF7; 03A9 0342 0345; 03A9 0342 0399; # GREEK SMALL LETTER OMEGA WITH PERISPOMENI AND YPOGEGRAMMENI
|
||||
|
||||
# ================================================================================
|
||||
# Conditional mappings
|
||||
# Conditional Mappings
|
||||
# The remainder of this file provides conditional casing data used to produce
|
||||
# full case mappings.
|
||||
# ================================================================================
|
||||
# Language-Insensitive Mappings
|
||||
# These are characters whose full case mappings do not depend on language, but do
|
||||
# depend on context (which characters come before or after). For more information
|
||||
# see the header of this file and the Unicode Standard.
|
||||
# ================================================================================
|
||||
|
||||
# Special case for final form of sigma
|
||||
@ -208,7 +217,10 @@ FB17; FB17; 0544 056D; 0544 053D; # ARMENIAN SMALL LIGATURE MEN XEH
|
||||
# 03C2; 03C3; 03A3; 03A3; Not_Final_Sigma; # GREEK SMALL LETTER FINAL SIGMA
|
||||
|
||||
# ================================================================================
|
||||
# Locale-sensitive mappings
|
||||
# Language-Sensitive Mappings
|
||||
# These are characters whose full case mappings depend on language and perhaps also
|
||||
# context (which characters come before or after). For more information
|
||||
# see the header of this file and the Unicode Standard.
|
||||
# ================================================================================
|
||||
|
||||
# Lithuanian
|
||||
|
File diff suppressed because it is too large
Load Diff
@ -1,10 +1,10 @@
|
||||
# WordBreakProperty-5.0.0.txt
|
||||
# Date: 2006-06-07, 23:23:03 GMT [MD]
|
||||
# WordBreakProperty-6.1.0.txt
|
||||
# Date: 2011-11-27, 05:10:51 GMT [MD]
|
||||
#
|
||||
# Unicode Character Database
|
||||
# Copyright (c) 1991-2006 Unicode, Inc.
|
||||
# Copyright (c) 1991-2011 Unicode, Inc.
|
||||
# For terms of use, see http://www.unicode.org/terms_of_use.html
|
||||
# For documentation, see UCD.html
|
||||
# For documentation, see http://www.unicode.org/reports/tr44/
|
||||
|
||||
# ================================================
|
||||
|
||||
@ -17,39 +17,39 @@
|
||||
|
||||
# ================================================
|
||||
|
||||
00AD ; Format # Cf SOFT HYPHEN
|
||||
0600..0603 ; Format # Cf [4] ARABIC NUMBER SIGN..ARABIC SIGN SAFHA
|
||||
06DD ; Format # Cf ARABIC END OF AYAH
|
||||
070F ; Format # Cf SYRIAC ABBREVIATION MARK
|
||||
17B4..17B5 ; Format # Cf [2] KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT AA
|
||||
200B ; Format # Cf ZERO WIDTH SPACE
|
||||
200E..200F ; Format # Cf [2] LEFT-TO-RIGHT MARK..RIGHT-TO-LEFT MARK
|
||||
202A..202E ; Format # Cf [5] LEFT-TO-RIGHT EMBEDDING..RIGHT-TO-LEFT OVERRIDE
|
||||
2060..2063 ; Format # Cf [4] WORD JOINER..INVISIBLE SEPARATOR
|
||||
206A..206F ; Format # Cf [6] INHIBIT SYMMETRIC SWAPPING..NOMINAL DIGIT SHAPES
|
||||
FEFF ; Format # Cf ZERO WIDTH NO-BREAK SPACE
|
||||
FFF9..FFFB ; Format # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTATION TERMINATOR
|
||||
1D173..1D17A ; Format # Cf [8] MUSICAL SYMBOL BEGIN BEAM..MUSICAL SYMBOL END PHRASE
|
||||
E0001 ; Format # Cf LANGUAGE TAG
|
||||
E0020..E007F ; Format # Cf [96] TAG SPACE..CANCEL TAG
|
||||
000D ; CR # Cc <control-000D>
|
||||
|
||||
# Total code points: 201
|
||||
# Total code points: 1
|
||||
|
||||
# ================================================
|
||||
|
||||
000A ; LF # Cc <control-000A>
|
||||
|
||||
# Total code points: 1
|
||||
|
||||
# ================================================
|
||||
|
||||
000B..000C ; Newline # Cc [2] <control-000B>..<control-000C>
|
||||
0085 ; Newline # Cc <control-0085>
|
||||
2028 ; Newline # Zl LINE SEPARATOR
|
||||
2029 ; Newline # Zp PARAGRAPH SEPARATOR
|
||||
|
||||
# Total code points: 5
|
||||
|
||||
# ================================================
|
||||
|
||||
0300..036F ; Extend # Mn [112] COMBINING GRAVE ACCENT..COMBINING LATIN SMALL LETTER X
|
||||
0483..0486 ; Extend # Mn [4] COMBINING CYRILLIC TITLO..COMBINING CYRILLIC PSILI PNEUMATA
|
||||
0483..0487 ; Extend # Mn [5] COMBINING CYRILLIC TITLO..COMBINING CYRILLIC POKRYTIE
|
||||
0488..0489 ; Extend # Me [2] COMBINING CYRILLIC HUNDRED THOUSANDS SIGN..COMBINING CYRILLIC MILLIONS SIGN
|
||||
0591..05BD ; Extend # Mn [45] HEBREW ACCENT ETNAHTA..HEBREW POINT METEG
|
||||
05BF ; Extend # Mn HEBREW POINT RAFE
|
||||
05C1..05C2 ; Extend # Mn [2] HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
|
||||
05C4..05C5 ; Extend # Mn [2] HEBREW MARK UPPER DOT..HEBREW MARK LOWER DOT
|
||||
05C7 ; Extend # Mn HEBREW POINT QAMATS QATAN
|
||||
0610..0615 ; Extend # Mn [6] ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM..ARABIC SMALL HIGH TAH
|
||||
064B..065E ; Extend # Mn [20] ARABIC FATHATAN..ARABIC FATHA WITH TWO DOTS
|
||||
0610..061A ; Extend # Mn [11] ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM..ARABIC SMALL KASRA
|
||||
064B..065F ; Extend # Mn [21] ARABIC FATHATAN..ARABIC WAVY HAMZA BELOW
|
||||
0670 ; Extend # Mn ARABIC LETTER SUPERSCRIPT ALEF
|
||||
06D6..06DC ; Extend # Mn [7] ARABIC SMALL HIGH LIGATURE SAD WITH LAM WITH ALEF MAKSURA..ARABIC SMALL HIGH SEEN
|
||||
06DE ; Extend # Me ARABIC START OF RUB EL HIZB
|
||||
06DF..06E4 ; Extend # Mn [6] ARABIC SMALL HIGH ROUNDED ZERO..ARABIC SMALL HIGH MADDA
|
||||
06E7..06E8 ; Extend # Mn [2] ARABIC SMALL HIGH YEH..ARABIC SMALL HIGH NOON
|
||||
06EA..06ED ; Extend # Mn [4] ARABIC EMPTY CENTRE LOW STOP..ARABIC SMALL LOW MEEM
|
||||
@ -57,64 +57,108 @@ E0020..E007F ; Format # Cf [96] TAG SPACE..CANCEL TAG
|
||||
0730..074A ; Extend # Mn [27] SYRIAC PTHAHA ABOVE..SYRIAC BARREKH
|
||||
07A6..07B0 ; Extend # Mn [11] THAANA ABAFILI..THAANA SUKUN
|
||||
07EB..07F3 ; Extend # Mn [9] NKO COMBINING SHORT HIGH TONE..NKO COMBINING DOUBLE DOT ABOVE
|
||||
0901..0902 ; Extend # Mn [2] DEVANAGARI SIGN CANDRABINDU..DEVANAGARI SIGN ANUSVARA
|
||||
0816..0819 ; Extend # Mn [4] SAMARITAN MARK IN..SAMARITAN MARK DAGESH
|
||||
081B..0823 ; Extend # Mn [9] SAMARITAN MARK EPENTHETIC YUT..SAMARITAN VOWEL SIGN A
|
||||
0825..0827 ; Extend # Mn [3] SAMARITAN VOWEL SIGN SHORT A..SAMARITAN VOWEL SIGN U
|
||||
0829..082D ; Extend # Mn [5] SAMARITAN VOWEL SIGN LONG I..SAMARITAN MARK NEQUDAA
|
||||
0859..085B ; Extend # Mn [3] MANDAIC AFFRICATION MARK..MANDAIC GEMINATION MARK
|
||||
08E4..08FE ; Extend # Mn [27] ARABIC CURLY FATHA..ARABIC DAMMA WITH DOT
|
||||
0900..0902 ; Extend # Mn [3] DEVANAGARI SIGN INVERTED CANDRABINDU..DEVANAGARI SIGN ANUSVARA
|
||||
0903 ; Extend # Mc DEVANAGARI SIGN VISARGA
|
||||
093A ; Extend # Mn DEVANAGARI VOWEL SIGN OE
|
||||
093B ; Extend # Mc DEVANAGARI VOWEL SIGN OOE
|
||||
093C ; Extend # Mn DEVANAGARI SIGN NUKTA
|
||||
093E..0940 ; Extend # Mc [3] DEVANAGARI VOWEL SIGN AA..DEVANAGARI VOWEL SIGN II
|
||||
0941..0948 ; Extend # Mn [8] DEVANAGARI VOWEL SIGN U..DEVANAGARI VOWEL SIGN AI
|
||||
0949..094C ; Extend # Mc [4] DEVANAGARI VOWEL SIGN CANDRA O..DEVANAGARI VOWEL SIGN AU
|
||||
094D ; Extend # Mn DEVANAGARI SIGN VIRAMA
|
||||
0951..0954 ; Extend # Mn [4] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI ACUTE ACCENT
|
||||
094E..094F ; Extend # Mc [2] DEVANAGARI VOWEL SIGN PRISHTHAMATRA E..DEVANAGARI VOWEL SIGN AW
|
||||
0951..0957 ; Extend # Mn [7] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI VOWEL SIGN UUE
|
||||
0962..0963 ; Extend # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
|
||||
0981 ; Extend # Mn BENGALI SIGN CANDRABINDU
|
||||
0982..0983 ; Extend # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
|
||||
09BC ; Extend # Mn BENGALI SIGN NUKTA
|
||||
09BE ; Extend # Mc BENGALI VOWEL SIGN AA
|
||||
09BE..09C0 ; Extend # Mc [3] BENGALI VOWEL SIGN AA..BENGALI VOWEL SIGN II
|
||||
09C1..09C4 ; Extend # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
|
||||
09C7..09C8 ; Extend # Mc [2] BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI
|
||||
09CB..09CC ; Extend # Mc [2] BENGALI VOWEL SIGN O..BENGALI VOWEL SIGN AU
|
||||
09CD ; Extend # Mn BENGALI SIGN VIRAMA
|
||||
09D7 ; Extend # Mc BENGALI AU LENGTH MARK
|
||||
09E2..09E3 ; Extend # Mn [2] BENGALI VOWEL SIGN VOCALIC L..BENGALI VOWEL SIGN VOCALIC LL
|
||||
0A01..0A02 ; Extend # Mn [2] GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN BINDI
|
||||
0A03 ; Extend # Mc GURMUKHI SIGN VISARGA
|
||||
0A3C ; Extend # Mn GURMUKHI SIGN NUKTA
|
||||
0A3E..0A40 ; Extend # Mc [3] GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN II
|
||||
0A41..0A42 ; Extend # Mn [2] GURMUKHI VOWEL SIGN U..GURMUKHI VOWEL SIGN UU
|
||||
0A47..0A48 ; Extend # Mn [2] GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN AI
|
||||
0A4B..0A4D ; Extend # Mn [3] GURMUKHI VOWEL SIGN OO..GURMUKHI SIGN VIRAMA
|
||||
0A51 ; Extend # Mn GURMUKHI SIGN UDAAT
|
||||
0A70..0A71 ; Extend # Mn [2] GURMUKHI TIPPI..GURMUKHI ADDAK
|
||||
0A75 ; Extend # Mn GURMUKHI SIGN YAKASH
|
||||
0A81..0A82 ; Extend # Mn [2] GUJARATI SIGN CANDRABINDU..GUJARATI SIGN ANUSVARA
|
||||
0A83 ; Extend # Mc GUJARATI SIGN VISARGA
|
||||
0ABC ; Extend # Mn GUJARATI SIGN NUKTA
|
||||
0ABE..0AC0 ; Extend # Mc [3] GUJARATI VOWEL SIGN AA..GUJARATI VOWEL SIGN II
|
||||
0AC1..0AC5 ; Extend # Mn [5] GUJARATI VOWEL SIGN U..GUJARATI VOWEL SIGN CANDRA E
|
||||
0AC7..0AC8 ; Extend # Mn [2] GUJARATI VOWEL SIGN E..GUJARATI VOWEL SIGN AI
|
||||
0AC9 ; Extend # Mc GUJARATI VOWEL SIGN CANDRA O
|
||||
0ACB..0ACC ; Extend # Mc [2] GUJARATI VOWEL SIGN O..GUJARATI VOWEL SIGN AU
|
||||
0ACD ; Extend # Mn GUJARATI SIGN VIRAMA
|
||||
0AE2..0AE3 ; Extend # Mn [2] GUJARATI VOWEL SIGN VOCALIC L..GUJARATI VOWEL SIGN VOCALIC LL
|
||||
0B01 ; Extend # Mn ORIYA SIGN CANDRABINDU
|
||||
0B02..0B03 ; Extend # Mc [2] ORIYA SIGN ANUSVARA..ORIYA SIGN VISARGA
|
||||
0B3C ; Extend # Mn ORIYA SIGN NUKTA
|
||||
0B3E ; Extend # Mc ORIYA VOWEL SIGN AA
|
||||
0B3F ; Extend # Mn ORIYA VOWEL SIGN I
|
||||
0B41..0B43 ; Extend # Mn [3] ORIYA VOWEL SIGN U..ORIYA VOWEL SIGN VOCALIC R
|
||||
0B40 ; Extend # Mc ORIYA VOWEL SIGN II
|
||||
0B41..0B44 ; Extend # Mn [4] ORIYA VOWEL SIGN U..ORIYA VOWEL SIGN VOCALIC RR
|
||||
0B47..0B48 ; Extend # Mc [2] ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI
|
||||
0B4B..0B4C ; Extend # Mc [2] ORIYA VOWEL SIGN O..ORIYA VOWEL SIGN AU
|
||||
0B4D ; Extend # Mn ORIYA SIGN VIRAMA
|
||||
0B56 ; Extend # Mn ORIYA AI LENGTH MARK
|
||||
0B57 ; Extend # Mc ORIYA AU LENGTH MARK
|
||||
0B62..0B63 ; Extend # Mn [2] ORIYA VOWEL SIGN VOCALIC L..ORIYA VOWEL SIGN VOCALIC LL
|
||||
0B82 ; Extend # Mn TAMIL SIGN ANUSVARA
|
||||
0BBE ; Extend # Mc TAMIL VOWEL SIGN AA
|
||||
0BBE..0BBF ; Extend # Mc [2] TAMIL VOWEL SIGN AA..TAMIL VOWEL SIGN I
|
||||
0BC0 ; Extend # Mn TAMIL VOWEL SIGN II
|
||||
0BC1..0BC2 ; Extend # Mc [2] TAMIL VOWEL SIGN U..TAMIL VOWEL SIGN UU
|
||||
0BC6..0BC8 ; Extend # Mc [3] TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI
|
||||
0BCA..0BCC ; Extend # Mc [3] TAMIL VOWEL SIGN O..TAMIL VOWEL SIGN AU
|
||||
0BCD ; Extend # Mn TAMIL SIGN VIRAMA
|
||||
0BD7 ; Extend # Mc TAMIL AU LENGTH MARK
|
||||
0C01..0C03 ; Extend # Mc [3] TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA
|
||||
0C3E..0C40 ; Extend # Mn [3] TELUGU VOWEL SIGN AA..TELUGU VOWEL SIGN II
|
||||
0C41..0C44 ; Extend # Mc [4] TELUGU VOWEL SIGN U..TELUGU VOWEL SIGN VOCALIC RR
|
||||
0C46..0C48 ; Extend # Mn [3] TELUGU VOWEL SIGN E..TELUGU VOWEL SIGN AI
|
||||
0C4A..0C4D ; Extend # Mn [4] TELUGU VOWEL SIGN O..TELUGU SIGN VIRAMA
|
||||
0C55..0C56 ; Extend # Mn [2] TELUGU LENGTH MARK..TELUGU AI LENGTH MARK
|
||||
0C62..0C63 ; Extend # Mn [2] TELUGU VOWEL SIGN VOCALIC L..TELUGU VOWEL SIGN VOCALIC LL
|
||||
0C82..0C83 ; Extend # Mc [2] KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA
|
||||
0CBC ; Extend # Mn KANNADA SIGN NUKTA
|
||||
0CBE ; Extend # Mc KANNADA VOWEL SIGN AA
|
||||
0CBF ; Extend # Mn KANNADA VOWEL SIGN I
|
||||
0CC2 ; Extend # Mc KANNADA VOWEL SIGN UU
|
||||
0CC0..0CC4 ; Extend # Mc [5] KANNADA VOWEL SIGN II..KANNADA VOWEL SIGN VOCALIC RR
|
||||
0CC6 ; Extend # Mn KANNADA VOWEL SIGN E
|
||||
0CC7..0CC8 ; Extend # Mc [2] KANNADA VOWEL SIGN EE..KANNADA VOWEL SIGN AI
|
||||
0CCA..0CCB ; Extend # Mc [2] KANNADA VOWEL SIGN O..KANNADA VOWEL SIGN OO
|
||||
0CCC..0CCD ; Extend # Mn [2] KANNADA VOWEL SIGN AU..KANNADA SIGN VIRAMA
|
||||
0CD5..0CD6 ; Extend # Mc [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
|
||||
0CE2..0CE3 ; Extend # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
|
||||
0D3E ; Extend # Mc MALAYALAM VOWEL SIGN AA
|
||||
0D41..0D43 ; Extend # Mn [3] MALAYALAM VOWEL SIGN U..MALAYALAM VOWEL SIGN VOCALIC R
|
||||
0D02..0D03 ; Extend # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
|
||||
0D3E..0D40 ; Extend # Mc [3] MALAYALAM VOWEL SIGN AA..MALAYALAM VOWEL SIGN II
|
||||
0D41..0D44 ; Extend # Mn [4] MALAYALAM VOWEL SIGN U..MALAYALAM VOWEL SIGN VOCALIC RR
|
||||
0D46..0D48 ; Extend # Mc [3] MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN AI
|
||||
0D4A..0D4C ; Extend # Mc [3] MALAYALAM VOWEL SIGN O..MALAYALAM VOWEL SIGN AU
|
||||
0D4D ; Extend # Mn MALAYALAM SIGN VIRAMA
|
||||
0D57 ; Extend # Mc MALAYALAM AU LENGTH MARK
|
||||
0D62..0D63 ; Extend # Mn [2] MALAYALAM VOWEL SIGN VOCALIC L..MALAYALAM VOWEL SIGN VOCALIC LL
|
||||
0D82..0D83 ; Extend # Mc [2] SINHALA SIGN ANUSVARAYA..SINHALA SIGN VISARGAYA
|
||||
0DCA ; Extend # Mn SINHALA SIGN AL-LAKUNA
|
||||
0DCF ; Extend # Mc SINHALA VOWEL SIGN AELA-PILLA
|
||||
0DCF..0DD1 ; Extend # Mc [3] SINHALA VOWEL SIGN AELA-PILLA..SINHALA VOWEL SIGN DIGA AEDA-PILLA
|
||||
0DD2..0DD4 ; Extend # Mn [3] SINHALA VOWEL SIGN KETTI IS-PILLA..SINHALA VOWEL SIGN KETTI PAA-PILLA
|
||||
0DD6 ; Extend # Mn SINHALA VOWEL SIGN DIGA PAA-PILLA
|
||||
0DDF ; Extend # Mc SINHALA VOWEL SIGN GAYANUKITTA
|
||||
0DD8..0DDF ; Extend # Mc [8] SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOWEL SIGN GAYANUKITTA
|
||||
0DF2..0DF3 ; Extend # Mc [2] SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHALA VOWEL SIGN DIGA GAYANUKITTA
|
||||
0E31 ; Extend # Mn THAI CHARACTER MAI HAN-AKAT
|
||||
0E34..0E3A ; Extend # Mn [7] THAI CHARACTER SARA I..THAI CHARACTER PHINTHU
|
||||
0E47..0E4E ; Extend # Mn [8] THAI CHARACTER MAITAIKHU..THAI CHARACTER YAMAKKAN
|
||||
@ -126,69 +170,246 @@ E0020..E007F ; Format # Cf [96] TAG SPACE..CANCEL TAG
|
||||
0F35 ; Extend # Mn TIBETAN MARK NGAS BZUNG NYI ZLA
|
||||
0F37 ; Extend # Mn TIBETAN MARK NGAS BZUNG SGOR RTAGS
|
||||
0F39 ; Extend # Mn TIBETAN MARK TSA -PHRU
|
||||
0F3E..0F3F ; Extend # Mc [2] TIBETAN SIGN YAR TSHES..TIBETAN SIGN MAR TSHES
|
||||
0F71..0F7E ; Extend # Mn [14] TIBETAN VOWEL SIGN AA..TIBETAN SIGN RJES SU NGA RO
|
||||
0F7F ; Extend # Mc TIBETAN SIGN RNAM BCAD
|
||||
0F80..0F84 ; Extend # Mn [5] TIBETAN VOWEL SIGN REVERSED I..TIBETAN MARK HALANTA
|
||||
0F86..0F87 ; Extend # Mn [2] TIBETAN SIGN LCI RTAGS..TIBETAN SIGN YANG RTAGS
|
||||
0F90..0F97 ; Extend # Mn [8] TIBETAN SUBJOINED LETTER KA..TIBETAN SUBJOINED LETTER JA
|
||||
0F8D..0F97 ; Extend # Mn [11] TIBETAN SUBJOINED SIGN LCE TSA CAN..TIBETAN SUBJOINED LETTER JA
|
||||
0F99..0FBC ; Extend # Mn [36] TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOINED LETTER FIXED-FORM RA
|
||||
0FC6 ; Extend # Mn TIBETAN SYMBOL PADMA GDAN
|
||||
102B..102C ; Extend # Mc [2] MYANMAR VOWEL SIGN TALL AA..MYANMAR VOWEL SIGN AA
|
||||
102D..1030 ; Extend # Mn [4] MYANMAR VOWEL SIGN I..MYANMAR VOWEL SIGN UU
|
||||
1032 ; Extend # Mn MYANMAR VOWEL SIGN AI
|
||||
1036..1037 ; Extend # Mn [2] MYANMAR SIGN ANUSVARA..MYANMAR SIGN DOT BELOW
|
||||
1039 ; Extend # Mn MYANMAR SIGN VIRAMA
|
||||
1031 ; Extend # Mc MYANMAR VOWEL SIGN E
|
||||
1032..1037 ; Extend # Mn [6] MYANMAR VOWEL SIGN AI..MYANMAR SIGN DOT BELOW
|
||||
1038 ; Extend # Mc MYANMAR SIGN VISARGA
|
||||
1039..103A ; Extend # Mn [2] MYANMAR SIGN VIRAMA..MYANMAR SIGN ASAT
|
||||
103B..103C ; Extend # Mc [2] MYANMAR CONSONANT SIGN MEDIAL YA..MYANMAR CONSONANT SIGN MEDIAL RA
|
||||
103D..103E ; Extend # Mn [2] MYANMAR CONSONANT SIGN MEDIAL WA..MYANMAR CONSONANT SIGN MEDIAL HA
|
||||
1056..1057 ; Extend # Mc [2] MYANMAR VOWEL SIGN VOCALIC R..MYANMAR VOWEL SIGN VOCALIC RR
|
||||
1058..1059 ; Extend # Mn [2] MYANMAR VOWEL SIGN VOCALIC L..MYANMAR VOWEL SIGN VOCALIC LL
|
||||
135F ; Extend # Mn ETHIOPIC COMBINING GEMINATION MARK
|
||||
105E..1060 ; Extend # Mn [3] MYANMAR CONSONANT SIGN MON MEDIAL NA..MYANMAR CONSONANT SIGN MON MEDIAL LA
|
||||
1062..1064 ; Extend # Mc [3] MYANMAR VOWEL SIGN SGAW KAREN EU..MYANMAR TONE MARK SGAW KAREN KE PHO
|
||||
1067..106D ; Extend # Mc [7] MYANMAR VOWEL SIGN WESTERN PWO KAREN EU..MYANMAR SIGN WESTERN PWO KAREN TONE-5
|
||||
1071..1074 ; Extend # Mn [4] MYANMAR VOWEL SIGN GEBA KAREN I..MYANMAR VOWEL SIGN KAYAH EE
|
||||
1082 ; Extend # Mn MYANMAR CONSONANT SIGN SHAN MEDIAL WA
|
||||
1083..1084 ; Extend # Mc [2] MYANMAR VOWEL SIGN SHAN AA..MYANMAR VOWEL SIGN SHAN E
|
||||
1085..1086 ; Extend # Mn [2] MYANMAR VOWEL SIGN SHAN E ABOVE..MYANMAR VOWEL SIGN SHAN FINAL Y
|
||||
1087..108C ; Extend # Mc [6] MYANMAR SIGN SHAN TONE-2..MYANMAR SIGN SHAN COUNCIL TONE-3
|
||||
108D ; Extend # Mn MYANMAR SIGN SHAN COUNCIL EMPHATIC TONE
|
||||
108F ; Extend # Mc MYANMAR SIGN RUMAI PALAUNG TONE-5
|
||||
109A..109C ; Extend # Mc [3] MYANMAR SIGN KHAMTI TONE-1..MYANMAR VOWEL SIGN AITON A
|
||||
109D ; Extend # Mn MYANMAR VOWEL SIGN AITON AI
|
||||
135D..135F ; Extend # Mn [3] ETHIOPIC COMBINING GEMINATION AND VOWEL LENGTH MARK..ETHIOPIC COMBINING GEMINATION MARK
|
||||
1712..1714 ; Extend # Mn [3] TAGALOG VOWEL SIGN I..TAGALOG SIGN VIRAMA
|
||||
1732..1734 ; Extend # Mn [3] HANUNOO VOWEL SIGN I..HANUNOO SIGN PAMUDPOD
|
||||
1752..1753 ; Extend # Mn [2] BUHID VOWEL SIGN I..BUHID VOWEL SIGN U
|
||||
1772..1773 ; Extend # Mn [2] TAGBANWA VOWEL SIGN I..TAGBANWA VOWEL SIGN U
|
||||
17B4..17B5 ; Extend # Mn [2] KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT AA
|
||||
17B6 ; Extend # Mc KHMER VOWEL SIGN AA
|
||||
17B7..17BD ; Extend # Mn [7] KHMER VOWEL SIGN I..KHMER VOWEL SIGN UA
|
||||
17BE..17C5 ; Extend # Mc [8] KHMER VOWEL SIGN OE..KHMER VOWEL SIGN AU
|
||||
17C6 ; Extend # Mn KHMER SIGN NIKAHIT
|
||||
17C7..17C8 ; Extend # Mc [2] KHMER SIGN REAHMUK..KHMER SIGN YUUKALEAPINTU
|
||||
17C9..17D3 ; Extend # Mn [11] KHMER SIGN MUUSIKATOAN..KHMER SIGN BATHAMASAT
|
||||
17DD ; Extend # Mn KHMER SIGN ATTHACAN
|
||||
180B..180D ; Extend # Mn [3] MONGOLIAN FREE VARIATION SELECTOR ONE..MONGOLIAN FREE VARIATION SELECTOR THREE
|
||||
18A9 ; Extend # Mn MONGOLIAN LETTER ALI GALI DAGALGA
|
||||
1920..1922 ; Extend # Mn [3] LIMBU VOWEL SIGN A..LIMBU VOWEL SIGN U
|
||||
1923..1926 ; Extend # Mc [4] LIMBU VOWEL SIGN EE..LIMBU VOWEL SIGN AU
|
||||
1927..1928 ; Extend # Mn [2] LIMBU VOWEL SIGN E..LIMBU VOWEL SIGN O
|
||||
1929..192B ; Extend # Mc [3] LIMBU SUBJOINED LETTER YA..LIMBU SUBJOINED LETTER WA
|
||||
1930..1931 ; Extend # Mc [2] LIMBU SMALL LETTER KA..LIMBU SMALL LETTER NGA
|
||||
1932 ; Extend # Mn LIMBU SMALL LETTER ANUSVARA
|
||||
1933..1938 ; Extend # Mc [6] LIMBU SMALL LETTER TA..LIMBU SMALL LETTER LA
|
||||
1939..193B ; Extend # Mn [3] LIMBU SIGN MUKPHRENG..LIMBU SIGN SA-I
|
||||
19B0..19C0 ; Extend # Mc [17] NEW TAI LUE VOWEL SIGN VOWEL SHORTENER..NEW TAI LUE VOWEL SIGN IY
|
||||
19C8..19C9 ; Extend # Mc [2] NEW TAI LUE TONE MARK-1..NEW TAI LUE TONE MARK-2
|
||||
1A17..1A18 ; Extend # Mn [2] BUGINESE VOWEL SIGN I..BUGINESE VOWEL SIGN U
|
||||
1A19..1A1B ; Extend # Mc [3] BUGINESE VOWEL SIGN E..BUGINESE VOWEL SIGN AE
|
||||
1A55 ; Extend # Mc TAI THAM CONSONANT SIGN MEDIAL RA
|
||||
1A56 ; Extend # Mn TAI THAM CONSONANT SIGN MEDIAL LA
|
||||
1A57 ; Extend # Mc TAI THAM CONSONANT SIGN LA TANG LAI
|
||||
1A58..1A5E ; Extend # Mn [7] TAI THAM SIGN MAI KANG LAI..TAI THAM CONSONANT SIGN SA
|
||||
1A60 ; Extend # Mn TAI THAM SIGN SAKOT
|
||||
1A61 ; Extend # Mc TAI THAM VOWEL SIGN A
|
||||
1A62 ; Extend # Mn TAI THAM VOWEL SIGN MAI SAT
|
||||
1A63..1A64 ; Extend # Mc [2] TAI THAM VOWEL SIGN AA..TAI THAM VOWEL SIGN TALL AA
|
||||
1A65..1A6C ; Extend # Mn [8] TAI THAM VOWEL SIGN I..TAI THAM VOWEL SIGN OA BELOW
|
||||
1A6D..1A72 ; Extend # Mc [6] TAI THAM VOWEL SIGN OY..TAI THAM VOWEL SIGN THAM AI
|
||||
1A73..1A7C ; Extend # Mn [10] TAI THAM VOWEL SIGN OA ABOVE..TAI THAM SIGN KHUEN-LUE KARAN
|
||||
1A7F ; Extend # Mn TAI THAM COMBINING CRYPTOGRAMMIC DOT
|
||||
1B00..1B03 ; Extend # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
|
||||
1B04 ; Extend # Mc BALINESE SIGN BISAH
|
||||
1B34 ; Extend # Mn BALINESE SIGN REREKAN
|
||||
1B35 ; Extend # Mc BALINESE VOWEL SIGN TEDUNG
|
||||
1B36..1B3A ; Extend # Mn [5] BALINESE VOWEL SIGN ULU..BALINESE VOWEL SIGN RA REPA
|
||||
1B3B ; Extend # Mc BALINESE VOWEL SIGN RA REPA TEDUNG
|
||||
1B3C ; Extend # Mn BALINESE VOWEL SIGN LA LENGA
|
||||
1B3D..1B41 ; Extend # Mc [5] BALINESE VOWEL SIGN LA LENGA TEDUNG..BALINESE VOWEL SIGN TALING REPA TEDUNG
|
||||
1B42 ; Extend # Mn BALINESE VOWEL SIGN PEPET
|
||||
1B43..1B44 ; Extend # Mc [2] BALINESE VOWEL SIGN PEPET TEDUNG..BALINESE ADEG ADEG
|
||||
1B6B..1B73 ; Extend # Mn [9] BALINESE MUSICAL SYMBOL COMBINING TEGEH..BALINESE MUSICAL SYMBOL COMBINING GONG
|
||||
1DC0..1DCA ; Extend # Mn [11] COMBINING DOTTED GRAVE ACCENT..COMBINING LATIN SMALL LETTER R BELOW
|
||||
1DFE..1DFF ; Extend # Mn [2] COMBINING LEFT ARROWHEAD ABOVE..COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
|
||||
1B80..1B81 ; Extend # Mn [2] SUNDANESE SIGN PANYECEK..SUNDANESE SIGN PANGLAYAR
|
||||
1B82 ; Extend # Mc SUNDANESE SIGN PANGWISAD
|
||||
1BA1 ; Extend # Mc SUNDANESE CONSONANT SIGN PAMINGKAL
|
||||
1BA2..1BA5 ; Extend # Mn [4] SUNDANESE CONSONANT SIGN PANYAKRA..SUNDANESE VOWEL SIGN PANYUKU
|
||||
1BA6..1BA7 ; Extend # Mc [2] SUNDANESE VOWEL SIGN PANAELAENG..SUNDANESE VOWEL SIGN PANOLONG
|
||||
1BA8..1BA9 ; Extend # Mn [2] SUNDANESE VOWEL SIGN PAMEPET..SUNDANESE VOWEL SIGN PANEULEUNG
|
||||
1BAA ; Extend # Mc SUNDANESE SIGN PAMAAEH
|
||||
1BAB ; Extend # Mn SUNDANESE SIGN VIRAMA
|
||||
1BAC..1BAD ; Extend # Mc [2] SUNDANESE CONSONANT SIGN PASANGAN MA..SUNDANESE CONSONANT SIGN PASANGAN WA
|
||||
1BE6 ; Extend # Mn BATAK SIGN TOMPI
|
||||
1BE7 ; Extend # Mc BATAK VOWEL SIGN E
|
||||
1BE8..1BE9 ; Extend # Mn [2] BATAK VOWEL SIGN PAKPAK E..BATAK VOWEL SIGN EE
|
||||
1BEA..1BEC ; Extend # Mc [3] BATAK VOWEL SIGN I..BATAK VOWEL SIGN O
|
||||
1BED ; Extend # Mn BATAK VOWEL SIGN KARO O
|
||||
1BEE ; Extend # Mc BATAK VOWEL SIGN U
|
||||
1BEF..1BF1 ; Extend # Mn [3] BATAK VOWEL SIGN U FOR SIMALUNGUN SA..BATAK CONSONANT SIGN H
|
||||
1BF2..1BF3 ; Extend # Mc [2] BATAK PANGOLAT..BATAK PANONGONAN
|
||||
1C24..1C2B ; Extend # Mc [8] LEPCHA SUBJOINED LETTER YA..LEPCHA VOWEL SIGN UU
|
||||
1C2C..1C33 ; Extend # Mn [8] LEPCHA VOWEL SIGN E..LEPCHA CONSONANT SIGN T
|
||||
1C34..1C35 ; Extend # Mc [2] LEPCHA CONSONANT SIGN NYIN-DO..LEPCHA CONSONANT SIGN KANG
|
||||
1C36..1C37 ; Extend # Mn [2] LEPCHA SIGN RAN..LEPCHA SIGN NUKTA
|
||||
1CD0..1CD2 ; Extend # Mn [3] VEDIC TONE KARSHANA..VEDIC TONE PRENKHA
|
||||
1CD4..1CE0 ; Extend # Mn [13] VEDIC SIGN YAJURVEDIC MIDLINE SVARITA..VEDIC TONE RIGVEDIC KASHMIRI INDEPENDENT SVARITA
|
||||
1CE1 ; Extend # Mc VEDIC TONE ATHARVAVEDIC INDEPENDENT SVARITA
|
||||
1CE2..1CE8 ; Extend # Mn [7] VEDIC SIGN VISARGA SVARITA..VEDIC SIGN VISARGA ANUDATTA WITH TAIL
|
||||
1CED ; Extend # Mn VEDIC SIGN TIRYAK
|
||||
1CF2..1CF3 ; Extend # Mc [2] VEDIC SIGN ARDHAVISARGA..VEDIC SIGN ROTATED ARDHAVISARGA
|
||||
1CF4 ; Extend # Mn VEDIC TONE CANDRA ABOVE
|
||||
1DC0..1DE6 ; Extend # Mn [39] COMBINING DOTTED GRAVE ACCENT..COMBINING LATIN SMALL LETTER Z
|
||||
1DFC..1DFF ; Extend # Mn [4] COMBINING DOUBLE INVERTED BREVE BELOW..COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
|
||||
200C..200D ; Extend # Cf [2] ZERO WIDTH NON-JOINER..ZERO WIDTH JOINER
|
||||
20D0..20DC ; Extend # Mn [13] COMBINING LEFT HARPOON ABOVE..COMBINING FOUR DOTS ABOVE
|
||||
20DD..20E0 ; Extend # Me [4] COMBINING ENCLOSING CIRCLE..COMBINING ENCLOSING CIRCLE BACKSLASH
|
||||
20E1 ; Extend # Mn COMBINING LEFT RIGHT ARROW ABOVE
|
||||
20E2..20E4 ; Extend # Me [3] COMBINING ENCLOSING SCREEN..COMBINING ENCLOSING UPWARD POINTING TRIANGLE
|
||||
20E5..20EF ; Extend # Mn [11] COMBINING REVERSE SOLIDUS OVERLAY..COMBINING RIGHT ARROW BELOW
|
||||
302A..302F ; Extend # Mn [6] IDEOGRAPHIC LEVEL TONE MARK..HANGUL DOUBLE DOT TONE MARK
|
||||
20E5..20F0 ; Extend # Mn [12] COMBINING REVERSE SOLIDUS OVERLAY..COMBINING ASTERISK ABOVE
|
||||
2CEF..2CF1 ; Extend # Mn [3] COPTIC COMBINING NI ABOVE..COPTIC COMBINING SPIRITUS LENIS
|
||||
2D7F ; Extend # Mn TIFINAGH CONSONANT JOINER
|
||||
2DE0..2DFF ; Extend # Mn [32] COMBINING CYRILLIC LETTER BE..COMBINING CYRILLIC LETTER IOTIFIED BIG YUS
|
||||
302A..302D ; Extend # Mn [4] IDEOGRAPHIC LEVEL TONE MARK..IDEOGRAPHIC ENTERING TONE MARK
|
||||
302E..302F ; Extend # Mc [2] HANGUL SINGLE DOT TONE MARK..HANGUL DOUBLE DOT TONE MARK
|
||||
3099..309A ; Extend # Mn [2] COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK..COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
|
||||
A66F ; Extend # Mn COMBINING CYRILLIC VZMET
|
||||
A670..A672 ; Extend # Me [3] COMBINING CYRILLIC TEN MILLIONS SIGN..COMBINING CYRILLIC THOUSAND MILLIONS SIGN
|
||||
A674..A67D ; Extend # Mn [10] COMBINING CYRILLIC LETTER UKRAINIAN IE..COMBINING CYRILLIC PAYEROK
|
||||
A69F ; Extend # Mn COMBINING CYRILLIC LETTER IOTIFIED E
|
||||
A6F0..A6F1 ; Extend # Mn [2] BAMUM COMBINING MARK KOQNDON..BAMUM COMBINING MARK TUKWENTIS
|
||||
A802 ; Extend # Mn SYLOTI NAGRI SIGN DVISVARA
|
||||
A806 ; Extend # Mn SYLOTI NAGRI SIGN HASANTA
|
||||
A80B ; Extend # Mn SYLOTI NAGRI SIGN ANUSVARA
|
||||
A823..A824 ; Extend # Mc [2] SYLOTI NAGRI VOWEL SIGN A..SYLOTI NAGRI VOWEL SIGN I
|
||||
A825..A826 ; Extend # Mn [2] SYLOTI NAGRI VOWEL SIGN U..SYLOTI NAGRI VOWEL SIGN E
|
||||
A827 ; Extend # Mc SYLOTI NAGRI VOWEL SIGN OO
|
||||
A880..A881 ; Extend # Mc [2] SAURASHTRA SIGN ANUSVARA..SAURASHTRA SIGN VISARGA
|
||||
A8B4..A8C3 ; Extend # Mc [16] SAURASHTRA CONSONANT SIGN HAARU..SAURASHTRA VOWEL SIGN AU
|
||||
A8C4 ; Extend # Mn SAURASHTRA SIGN VIRAMA
|
||||
A8E0..A8F1 ; Extend # Mn [18] COMBINING DEVANAGARI DIGIT ZERO..COMBINING DEVANAGARI SIGN AVAGRAHA
|
||||
A926..A92D ; Extend # Mn [8] KAYAH LI VOWEL UE..KAYAH LI TONE CALYA PLOPHU
|
||||
A947..A951 ; Extend # Mn [11] REJANG VOWEL SIGN I..REJANG CONSONANT SIGN R
|
||||
A952..A953 ; Extend # Mc [2] REJANG CONSONANT SIGN H..REJANG VIRAMA
|
||||
A980..A982 ; Extend # Mn [3] JAVANESE SIGN PANYANGGA..JAVANESE SIGN LAYAR
|
||||
A983 ; Extend # Mc JAVANESE SIGN WIGNYAN
|
||||
A9B3 ; Extend # Mn JAVANESE SIGN CECAK TELU
|
||||
A9B4..A9B5 ; Extend # Mc [2] JAVANESE VOWEL SIGN TARUNG..JAVANESE VOWEL SIGN TOLONG
|
||||
A9B6..A9B9 ; Extend # Mn [4] JAVANESE VOWEL SIGN WULU..JAVANESE VOWEL SIGN SUKU MENDUT
|
||||
A9BA..A9BB ; Extend # Mc [2] JAVANESE VOWEL SIGN TALING..JAVANESE VOWEL SIGN DIRGA MURE
|
||||
A9BC ; Extend # Mn JAVANESE VOWEL SIGN PEPET
|
||||
A9BD..A9C0 ; Extend # Mc [4] JAVANESE CONSONANT SIGN KERET..JAVANESE PANGKON
|
||||
AA29..AA2E ; Extend # Mn [6] CHAM VOWEL SIGN AA..CHAM VOWEL SIGN OE
|
||||
AA2F..AA30 ; Extend # Mc [2] CHAM VOWEL SIGN O..CHAM VOWEL SIGN AI
|
||||
AA31..AA32 ; Extend # Mn [2] CHAM VOWEL SIGN AU..CHAM VOWEL SIGN UE
|
||||
AA33..AA34 ; Extend # Mc [2] CHAM CONSONANT SIGN YA..CHAM CONSONANT SIGN RA
|
||||
AA35..AA36 ; Extend # Mn [2] CHAM CONSONANT SIGN LA..CHAM CONSONANT SIGN WA
|
||||
AA43 ; Extend # Mn CHAM CONSONANT SIGN FINAL NG
|
||||
AA4C ; Extend # Mn CHAM CONSONANT SIGN FINAL M
|
||||
AA4D ; Extend # Mc CHAM CONSONANT SIGN FINAL H
|
||||
AA7B ; Extend # Mc MYANMAR SIGN PAO KAREN TONE
|
||||
AAB0 ; Extend # Mn TAI VIET MAI KANG
|
||||
AAB2..AAB4 ; Extend # Mn [3] TAI VIET VOWEL I..TAI VIET VOWEL U
|
||||
AAB7..AAB8 ; Extend # Mn [2] TAI VIET MAI KHIT..TAI VIET VOWEL IA
|
||||
AABE..AABF ; Extend # Mn [2] TAI VIET VOWEL AM..TAI VIET TONE MAI EK
|
||||
AAC1 ; Extend # Mn TAI VIET TONE MAI THO
|
||||
AAEB ; Extend # Mc MEETEI MAYEK VOWEL SIGN II
|
||||
AAEC..AAED ; Extend # Mn [2] MEETEI MAYEK VOWEL SIGN UU..MEETEI MAYEK VOWEL SIGN AAI
|
||||
AAEE..AAEF ; Extend # Mc [2] MEETEI MAYEK VOWEL SIGN AU..MEETEI MAYEK VOWEL SIGN AAU
|
||||
AAF5 ; Extend # Mc MEETEI MAYEK VOWEL SIGN VISARGA
|
||||
AAF6 ; Extend # Mn MEETEI MAYEK VIRAMA
|
||||
ABE3..ABE4 ; Extend # Mc [2] MEETEI MAYEK VOWEL SIGN ONAP..MEETEI MAYEK VOWEL SIGN INAP
|
||||
ABE5 ; Extend # Mn MEETEI MAYEK VOWEL SIGN ANAP
|
||||
ABE6..ABE7 ; Extend # Mc [2] MEETEI MAYEK VOWEL SIGN YENAP..MEETEI MAYEK VOWEL SIGN SOUNAP
|
||||
ABE8 ; Extend # Mn MEETEI MAYEK VOWEL SIGN UNAP
|
||||
ABE9..ABEA ; Extend # Mc [2] MEETEI MAYEK VOWEL SIGN CHEINAP..MEETEI MAYEK VOWEL SIGN NUNG
|
||||
ABEC ; Extend # Mc MEETEI MAYEK LUM IYEK
|
||||
ABED ; Extend # Mn MEETEI MAYEK APUN IYEK
|
||||
FB1E ; Extend # Mn HEBREW POINT JUDEO-SPANISH VARIKA
|
||||
FE00..FE0F ; Extend # Mn [16] VARIATION SELECTOR-1..VARIATION SELECTOR-16
|
||||
FE20..FE23 ; Extend # Mn [4] COMBINING LIGATURE LEFT HALF..COMBINING DOUBLE TILDE RIGHT HALF
|
||||
FE20..FE26 ; Extend # Mn [7] COMBINING LIGATURE LEFT HALF..COMBINING CONJOINING MACRON
|
||||
FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK
|
||||
101FD ; Extend # Mn PHAISTOS DISC SIGN COMBINING OBLIQUE STROKE
|
||||
10A01..10A03 ; Extend # Mn [3] KHAROSHTHI VOWEL SIGN I..KHAROSHTHI VOWEL SIGN VOCALIC R
|
||||
10A05..10A06 ; Extend # Mn [2] KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SIGN O
|
||||
10A0C..10A0F ; Extend # Mn [4] KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI SIGN VISARGA
|
||||
10A38..10A3A ; Extend # Mn [3] KHAROSHTHI SIGN BAR ABOVE..KHAROSHTHI SIGN DOT BELOW
|
||||
10A3F ; Extend # Mn KHAROSHTHI VIRAMA
|
||||
1D165 ; Extend # Mc MUSICAL SYMBOL COMBINING STEM
|
||||
11000 ; Extend # Mc BRAHMI SIGN CANDRABINDU
|
||||
11001 ; Extend # Mn BRAHMI SIGN ANUSVARA
|
||||
11002 ; Extend # Mc BRAHMI SIGN VISARGA
|
||||
11038..11046 ; Extend # Mn [15] BRAHMI VOWEL SIGN AA..BRAHMI VIRAMA
|
||||
11080..11081 ; Extend # Mn [2] KAITHI SIGN CANDRABINDU..KAITHI SIGN ANUSVARA
|
||||
11082 ; Extend # Mc KAITHI SIGN VISARGA
|
||||
110B0..110B2 ; Extend # Mc [3] KAITHI VOWEL SIGN AA..KAITHI VOWEL SIGN II
|
||||
110B3..110B6 ; Extend # Mn [4] KAITHI VOWEL SIGN U..KAITHI VOWEL SIGN AI
|
||||
110B7..110B8 ; Extend # Mc [2] KAITHI VOWEL SIGN O..KAITHI VOWEL SIGN AU
|
||||
110B9..110BA ; Extend # Mn [2] KAITHI SIGN VIRAMA..KAITHI SIGN NUKTA
|
||||
11100..11102 ; Extend # Mn [3] CHAKMA SIGN CANDRABINDU..CHAKMA SIGN VISARGA
|
||||
11127..1112B ; Extend # Mn [5] CHAKMA VOWEL SIGN A..CHAKMA VOWEL SIGN UU
|
||||
1112C ; Extend # Mc CHAKMA VOWEL SIGN E
|
||||
1112D..11134 ; Extend # Mn [8] CHAKMA VOWEL SIGN AI..CHAKMA MAAYYAA
|
||||
11180..11181 ; Extend # Mn [2] SHARADA SIGN CANDRABINDU..SHARADA SIGN ANUSVARA
|
||||
11182 ; Extend # Mc SHARADA SIGN VISARGA
|
||||
111B3..111B5 ; Extend # Mc [3] SHARADA VOWEL SIGN AA..SHARADA VOWEL SIGN II
|
||||
111B6..111BE ; Extend # Mn [9] SHARADA VOWEL SIGN U..SHARADA VOWEL SIGN O
|
||||
111BF..111C0 ; Extend # Mc [2] SHARADA VOWEL SIGN AU..SHARADA SIGN VIRAMA
|
||||
116AB ; Extend # Mn TAKRI SIGN ANUSVARA
|
||||
116AC ; Extend # Mc TAKRI SIGN VISARGA
|
||||
116AD ; Extend # Mn TAKRI VOWEL SIGN AA
|
||||
116AE..116AF ; Extend # Mc [2] TAKRI VOWEL SIGN I..TAKRI VOWEL SIGN II
|
||||
116B0..116B5 ; Extend # Mn [6] TAKRI VOWEL SIGN U..TAKRI VOWEL SIGN AU
|
||||
116B6 ; Extend # Mc TAKRI SIGN VIRAMA
|
||||
116B7 ; Extend # Mn TAKRI SIGN NUKTA
|
||||
16F51..16F7E ; Extend # Mc [46] MIAO SIGN ASPIRATION..MIAO VOWEL SIGN NG
|
||||
16F8F..16F92 ; Extend # Mn [4] MIAO TONE RIGHT..MIAO TONE BELOW
|
||||
1D165..1D166 ; Extend # Mc [2] MUSICAL SYMBOL COMBINING STEM..MUSICAL SYMBOL COMBINING SPRECHGESANG STEM
|
||||
1D167..1D169 ; Extend # Mn [3] MUSICAL SYMBOL COMBINING TREMOLO-1..MUSICAL SYMBOL COMBINING TREMOLO-3
|
||||
1D16E..1D172 ; Extend # Mc [5] MUSICAL SYMBOL COMBINING FLAG-1..MUSICAL SYMBOL COMBINING FLAG-5
|
||||
1D16D..1D172 ; Extend # Mc [6] MUSICAL SYMBOL COMBINING AUGMENTATION DOT..MUSICAL SYMBOL COMBINING FLAG-5
|
||||
1D17B..1D182 ; Extend # Mn [8] MUSICAL SYMBOL COMBINING ACCENT..MUSICAL SYMBOL COMBINING LOURE
|
||||
1D185..1D18B ; Extend # Mn [7] MUSICAL SYMBOL COMBINING DOIT..MUSICAL SYMBOL COMBINING TRIPLE TONGUE
|
||||
1D1AA..1D1AD ; Extend # Mn [4] MUSICAL SYMBOL COMBINING DOWN BOW..MUSICAL SYMBOL COMBINING SNAP PIZZICATO
|
||||
1D242..1D244 ; Extend # Mn [3] COMBINING GREEK MUSICAL TRISEME..COMBINING GREEK MUSICAL PENTASEME
|
||||
E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
|
||||
|
||||
# Total code points: 1649
|
||||
|
||||
# ================================================
|
||||
|
||||
00AD ; Format # Cf SOFT HYPHEN
|
||||
0600..0604 ; Format # Cf [5] ARABIC NUMBER SIGN..ARABIC SIGN SAMVAT
|
||||
06DD ; Format # Cf ARABIC END OF AYAH
|
||||
070F ; Format # Cf SYRIAC ABBREVIATION MARK
|
||||
200E..200F ; Format # Cf [2] LEFT-TO-RIGHT MARK..RIGHT-TO-LEFT MARK
|
||||
202A..202E ; Format # Cf [5] LEFT-TO-RIGHT EMBEDDING..RIGHT-TO-LEFT OVERRIDE
|
||||
2060..2064 ; Format # Cf [5] WORD JOINER..INVISIBLE PLUS
|
||||
206A..206F ; Format # Cf [6] INHIBIT SYMMETRIC SWAPPING..NOMINAL DIGIT SHAPES
|
||||
FEFF ; Format # Cf ZERO WIDTH NO-BREAK SPACE
|
||||
FFF9..FFFB ; Format # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTATION TERMINATOR
|
||||
110BD ; Format # Cf KAITHI NUMBER SIGN
|
||||
1D173..1D17A ; Format # Cf [8] MUSICAL SYMBOL BEGIN BEAM..MUSICAL SYMBOL END PHRASE
|
||||
E0001 ; Format # Cf LANGUAGE TAG
|
||||
E0020..E007F ; Format # Cf [96] TAG SPACE..CANCEL TAG
|
||||
|
||||
# Total code points: 136
|
||||
|
||||
# ================================================
|
||||
@ -200,20 +421,22 @@ E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
|
||||
30FC..30FE ; Katakana # Lm [3] KATAKANA-HIRAGANA PROLONGED SOUND MARK..KATAKANA VOICED ITERATION MARK
|
||||
30FF ; Katakana # Lo KATAKANA DIGRAPH KOTO
|
||||
31F0..31FF ; Katakana # Lo [16] KATAKANA LETTER SMALL KU..KATAKANA LETTER SMALL RO
|
||||
32D0..32FE ; Katakana # So [47] CIRCLED KATAKANA A..CIRCLED KATAKANA WO
|
||||
3300..3357 ; Katakana # So [88] SQUARE APAATO..SQUARE WATTO
|
||||
FF66..FF6F ; Katakana # Lo [10] HALFWIDTH KATAKANA LETTER WO..HALFWIDTH KATAKANA LETTER SMALL TU
|
||||
FF70 ; Katakana # Lm HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK
|
||||
FF71..FF9D ; Katakana # Lo [45] HALFWIDTH KATAKANA LETTER A..HALFWIDTH KATAKANA LETTER N
|
||||
FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK
|
||||
1B000 ; Katakana # Lo KATAKANA LETTER ARCHAIC E
|
||||
|
||||
# Total code points: 176
|
||||
# Total code points: 310
|
||||
|
||||
# ================================================
|
||||
|
||||
0041..005A ; ALetter # L& [26] LATIN CAPITAL LETTER A..LATIN CAPITAL LETTER Z
|
||||
0061..007A ; ALetter # L& [26] LATIN SMALL LETTER A..LATIN SMALL LETTER Z
|
||||
00AA ; ALetter # L& FEMININE ORDINAL INDICATOR
|
||||
00AA ; ALetter # Lo FEMININE ORDINAL INDICATOR
|
||||
00B5 ; ALetter # L& MICRO SIGN
|
||||
00BA ; ALetter # L& MASCULINE ORDINAL INDICATOR
|
||||
00BA ; ALetter # Lo MASCULINE ORDINAL INDICATOR
|
||||
00C0..00D6 ; ALetter # L& [23] LATIN CAPITAL LETTER A WITH GRAVE..LATIN CAPITAL LETTER O WITH DIAERESIS
|
||||
00D8..00F6 ; ALetter # L& [31] LATIN CAPITAL LETTER O WITH STROKE..LATIN SMALL LETTER O WITH DIAERESIS
|
||||
00F8..01BA ; ALetter # L& [195] LATIN SMALL LETTER O WITH STROKE..LATIN SMALL LETTER EZH WITH TAIL
|
||||
@ -226,24 +449,27 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
02B0..02C1 ; ALetter # Lm [18] MODIFIER LETTER SMALL H..MODIFIER LETTER REVERSED GLOTTAL STOP
|
||||
02C6..02D1 ; ALetter # Lm [12] MODIFIER LETTER CIRCUMFLEX ACCENT..MODIFIER LETTER HALF TRIANGULAR COLON
|
||||
02E0..02E4 ; ALetter # Lm [5] MODIFIER LETTER SMALL GAMMA..MODIFIER LETTER SMALL REVERSED GLOTTAL STOP
|
||||
02EC ; ALetter # Lm MODIFIER LETTER VOICING
|
||||
02EE ; ALetter # Lm MODIFIER LETTER DOUBLE APOSTROPHE
|
||||
0370..0373 ; ALetter # L& [4] GREEK CAPITAL LETTER HETA..GREEK SMALL LETTER ARCHAIC SAMPI
|
||||
0374 ; ALetter # Lm GREEK NUMERAL SIGN
|
||||
0376..0377 ; ALetter # L& [2] GREEK CAPITAL LETTER PAMPHYLIAN DIGAMMA..GREEK SMALL LETTER PAMPHYLIAN DIGAMMA
|
||||
037A ; ALetter # Lm GREEK YPOGEGRAMMENI
|
||||
037B..037D ; ALetter # L& [3] GREEK SMALL REVERSED LUNATE SIGMA SYMBOL..GREEK SMALL REVERSED DOTTED LUNATE SIGMA SYMBOL
|
||||
0386 ; ALetter # L& GREEK CAPITAL LETTER ALPHA WITH TONOS
|
||||
0388..038A ; ALetter # L& [3] GREEK CAPITAL LETTER EPSILON WITH TONOS..GREEK CAPITAL LETTER IOTA WITH TONOS
|
||||
038C ; ALetter # L& GREEK CAPITAL LETTER OMICRON WITH TONOS
|
||||
038E..03A1 ; ALetter # L& [20] GREEK CAPITAL LETTER UPSILON WITH TONOS..GREEK CAPITAL LETTER RHO
|
||||
03A3..03CE ; ALetter # L& [44] GREEK CAPITAL LETTER SIGMA..GREEK SMALL LETTER OMEGA WITH TONOS
|
||||
03D0..03F5 ; ALetter # L& [38] GREEK BETA SYMBOL..GREEK LUNATE EPSILON SYMBOL
|
||||
03A3..03F5 ; ALetter # L& [83] GREEK CAPITAL LETTER SIGMA..GREEK LUNATE EPSILON SYMBOL
|
||||
03F7..0481 ; ALetter # L& [139] GREEK CAPITAL LETTER SHO..CYRILLIC SMALL LETTER KOPPA
|
||||
048A..0513 ; ALetter # L& [138] CYRILLIC CAPITAL LETTER SHORT I WITH TAIL..CYRILLIC SMALL LETTER EL WITH HOOK
|
||||
048A..0527 ; ALetter # L& [158] CYRILLIC CAPITAL LETTER SHORT I WITH TAIL..CYRILLIC SMALL LETTER SHHA WITH DESCENDER
|
||||
0531..0556 ; ALetter # L& [38] ARMENIAN CAPITAL LETTER AYB..ARMENIAN CAPITAL LETTER FEH
|
||||
0559 ; ALetter # Lm ARMENIAN MODIFIER LETTER LEFT HALF RING
|
||||
0561..0587 ; ALetter # L& [39] ARMENIAN SMALL LETTER AYB..ARMENIAN SMALL LIGATURE ECH YIWN
|
||||
05D0..05EA ; ALetter # Lo [27] HEBREW LETTER ALEF..HEBREW LETTER TAV
|
||||
05F0..05F2 ; ALetter # Lo [3] HEBREW LIGATURE YIDDISH DOUBLE VAV..HEBREW LIGATURE YIDDISH DOUBLE YOD
|
||||
05F3 ; ALetter # Po HEBREW PUNCTUATION GERESH
|
||||
0621..063A ; ALetter # Lo [26] ARABIC LETTER HAMZA..ARABIC LETTER GHAIN
|
||||
0620..063F ; ALetter # Lo [32] ARABIC LETTER KASHMIRI YEH..ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVE
|
||||
0640 ; ALetter # Lm ARABIC TATWEEL
|
||||
0641..064A ; ALetter # Lo [10] ARABIC LETTER FEH..ARABIC LETTER YEH
|
||||
066E..066F ; ALetter # Lo [2] ARABIC LETTER DOTLESS BEH..ARABIC LETTER DOTLESS QAF
|
||||
@ -255,21 +481,25 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
06FF ; ALetter # Lo ARABIC LETTER HEH WITH INVERTED V
|
||||
0710 ; ALetter # Lo SYRIAC LETTER ALAPH
|
||||
0712..072F ; ALetter # Lo [30] SYRIAC LETTER BETH..SYRIAC LETTER PERSIAN DHALATH
|
||||
074D..076D ; ALetter # Lo [33] SYRIAC LETTER SOGDIAN ZHAIN..ARABIC LETTER SEEN WITH TWO DOTS VERTICALLY ABOVE
|
||||
0780..07A5 ; ALetter # Lo [38] THAANA LETTER HAA..THAANA LETTER WAAVU
|
||||
074D..07A5 ; ALetter # Lo [89] SYRIAC LETTER SOGDIAN ZHAIN..THAANA LETTER WAAVU
|
||||
07B1 ; ALetter # Lo THAANA LETTER NAA
|
||||
07CA..07EA ; ALetter # Lo [33] NKO LETTER A..NKO LETTER JONA RA
|
||||
07F4..07F5 ; ALetter # Lm [2] NKO HIGH TONE APOSTROPHE..NKO LOW TONE APOSTROPHE
|
||||
07FA ; ALetter # Lm NKO LAJANYALAN
|
||||
0903 ; ALetter # Mc DEVANAGARI SIGN VISARGA
|
||||
0800..0815 ; ALetter # Lo [22] SAMARITAN LETTER ALAF..SAMARITAN LETTER TAAF
|
||||
081A ; ALetter # Lm SAMARITAN MODIFIER LETTER EPENTHETIC YUT
|
||||
0824 ; ALetter # Lm SAMARITAN MODIFIER LETTER SHORT A
|
||||
0828 ; ALetter # Lm SAMARITAN MODIFIER LETTER I
|
||||
0840..0858 ; ALetter # Lo [25] MANDAIC LETTER HALQA..MANDAIC LETTER AIN
|
||||
08A0 ; ALetter # Lo ARABIC LETTER BEH WITH SMALL V BELOW
|
||||
08A2..08AC ; ALetter # Lo [11] ARABIC LETTER JEEM WITH TWO DOTS ABOVE..ARABIC LETTER ROHINGYA YEH
|
||||
0904..0939 ; ALetter # Lo [54] DEVANAGARI LETTER SHORT A..DEVANAGARI LETTER HA
|
||||
093D ; ALetter # Lo DEVANAGARI SIGN AVAGRAHA
|
||||
093E..0940 ; ALetter # Mc [3] DEVANAGARI VOWEL SIGN AA..DEVANAGARI VOWEL SIGN II
|
||||
0949..094C ; ALetter # Mc [4] DEVANAGARI VOWEL SIGN CANDRA O..DEVANAGARI VOWEL SIGN AU
|
||||
0950 ; ALetter # Lo DEVANAGARI OM
|
||||
0958..0961 ; ALetter # Lo [10] DEVANAGARI LETTER QA..DEVANAGARI LETTER VOCALIC LL
|
||||
097B..097F ; ALetter # Lo [5] DEVANAGARI LETTER GGA..DEVANAGARI LETTER BBA
|
||||
0982..0983 ; ALetter # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
|
||||
0971 ; ALetter # Lm DEVANAGARI SIGN HIGH SPACING DOT
|
||||
0972..0977 ; ALetter # Lo [6] DEVANAGARI LETTER CANDRA A..DEVANAGARI LETTER UUE
|
||||
0979..097F ; ALetter # Lo [7] DEVANAGARI LETTER ZHA..DEVANAGARI LETTER BBA
|
||||
0985..098C ; ALetter # Lo [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L
|
||||
098F..0990 ; ALetter # Lo [2] BENGALI LETTER E..BENGALI LETTER AI
|
||||
0993..09A8 ; ALetter # Lo [22] BENGALI LETTER O..BENGALI LETTER NA
|
||||
@ -277,14 +507,10 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
09B2 ; ALetter # Lo BENGALI LETTER LA
|
||||
09B6..09B9 ; ALetter # Lo [4] BENGALI LETTER SHA..BENGALI LETTER HA
|
||||
09BD ; ALetter # Lo BENGALI SIGN AVAGRAHA
|
||||
09BF..09C0 ; ALetter # Mc [2] BENGALI VOWEL SIGN I..BENGALI VOWEL SIGN II
|
||||
09C7..09C8 ; ALetter # Mc [2] BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI
|
||||
09CB..09CC ; ALetter # Mc [2] BENGALI VOWEL SIGN O..BENGALI VOWEL SIGN AU
|
||||
09CE ; ALetter # Lo BENGALI LETTER KHANDA TA
|
||||
09DC..09DD ; ALetter # Lo [2] BENGALI LETTER RRA..BENGALI LETTER RHA
|
||||
09DF..09E1 ; ALetter # Lo [3] BENGALI LETTER YYA..BENGALI LETTER VOCALIC LL
|
||||
09F0..09F1 ; ALetter # Lo [2] BENGALI LETTER RA WITH MIDDLE DIAGONAL..BENGALI LETTER RA WITH LOWER DIAGONAL
|
||||
0A03 ; ALetter # Mc GURMUKHI SIGN VISARGA
|
||||
0A05..0A0A ; ALetter # Lo [6] GURMUKHI LETTER A..GURMUKHI LETTER UU
|
||||
0A0F..0A10 ; ALetter # Lo [2] GURMUKHI LETTER EE..GURMUKHI LETTER AI
|
||||
0A13..0A28 ; ALetter # Lo [22] GURMUKHI LETTER OO..GURMUKHI LETTER NA
|
||||
@ -292,11 +518,9 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
0A32..0A33 ; ALetter # Lo [2] GURMUKHI LETTER LA..GURMUKHI LETTER LLA
|
||||
0A35..0A36 ; ALetter # Lo [2] GURMUKHI LETTER VA..GURMUKHI LETTER SHA
|
||||
0A38..0A39 ; ALetter # Lo [2] GURMUKHI LETTER SA..GURMUKHI LETTER HA
|
||||
0A3E..0A40 ; ALetter # Mc [3] GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN II
|
||||
0A59..0A5C ; ALetter # Lo [4] GURMUKHI LETTER KHHA..GURMUKHI LETTER RRA
|
||||
0A5E ; ALetter # Lo GURMUKHI LETTER FA
|
||||
0A72..0A74 ; ALetter # Lo [3] GURMUKHI IRI..GURMUKHI EK ONKAR
|
||||
0A83 ; ALetter # Mc GUJARATI SIGN VISARGA
|
||||
0A85..0A8D ; ALetter # Lo [9] GUJARATI LETTER A..GUJARATI VOWEL CANDRA E
|
||||
0A8F..0A91 ; ALetter # Lo [3] GUJARATI LETTER E..GUJARATI VOWEL CANDRA O
|
||||
0A93..0AA8 ; ALetter # Lo [22] GUJARATI LETTER O..GUJARATI LETTER NA
|
||||
@ -304,12 +528,8 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
0AB2..0AB3 ; ALetter # Lo [2] GUJARATI LETTER LA..GUJARATI LETTER LLA
|
||||
0AB5..0AB9 ; ALetter # Lo [5] GUJARATI LETTER VA..GUJARATI LETTER HA
|
||||
0ABD ; ALetter # Lo GUJARATI SIGN AVAGRAHA
|
||||
0ABE..0AC0 ; ALetter # Mc [3] GUJARATI VOWEL SIGN AA..GUJARATI VOWEL SIGN II
|
||||
0AC9 ; ALetter # Mc GUJARATI VOWEL SIGN CANDRA O
|
||||
0ACB..0ACC ; ALetter # Mc [2] GUJARATI VOWEL SIGN O..GUJARATI VOWEL SIGN AU
|
||||
0AD0 ; ALetter # Lo GUJARATI OM
|
||||
0AE0..0AE1 ; ALetter # Lo [2] GUJARATI LETTER VOCALIC RR..GUJARATI LETTER VOCALIC LL
|
||||
0B02..0B03 ; ALetter # Mc [2] ORIYA SIGN ANUSVARA..ORIYA SIGN VISARGA
|
||||
0B05..0B0C ; ALetter # Lo [8] ORIYA LETTER A..ORIYA LETTER VOCALIC L
|
||||
0B0F..0B10 ; ALetter # Lo [2] ORIYA LETTER E..ORIYA LETTER AI
|
||||
0B13..0B28 ; ALetter # Lo [22] ORIYA LETTER O..ORIYA LETTER NA
|
||||
@ -317,9 +537,6 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
0B32..0B33 ; ALetter # Lo [2] ORIYA LETTER LA..ORIYA LETTER LLA
|
||||
0B35..0B39 ; ALetter # Lo [5] ORIYA LETTER VA..ORIYA LETTER HA
|
||||
0B3D ; ALetter # Lo ORIYA SIGN AVAGRAHA
|
||||
0B40 ; ALetter # Mc ORIYA VOWEL SIGN II
|
||||
0B47..0B48 ; ALetter # Mc [2] ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI
|
||||
0B4B..0B4C ; ALetter # Mc [2] ORIYA VOWEL SIGN O..ORIYA VOWEL SIGN AU
|
||||
0B5C..0B5D ; ALetter # Lo [2] ORIYA LETTER RRA..ORIYA LETTER RHA
|
||||
0B5F..0B61 ; ALetter # Lo [3] ORIYA LETTER YYA..ORIYA LETTER VOCALIC LL
|
||||
0B71 ; ALetter # Lo ORIYA LETTER WA
|
||||
@ -333,62 +550,46 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
0BA3..0BA4 ; ALetter # Lo [2] TAMIL LETTER NNA..TAMIL LETTER TA
|
||||
0BA8..0BAA ; ALetter # Lo [3] TAMIL LETTER NA..TAMIL LETTER PA
|
||||
0BAE..0BB9 ; ALetter # Lo [12] TAMIL LETTER MA..TAMIL LETTER HA
|
||||
0BBF ; ALetter # Mc TAMIL VOWEL SIGN I
|
||||
0BC1..0BC2 ; ALetter # Mc [2] TAMIL VOWEL SIGN U..TAMIL VOWEL SIGN UU
|
||||
0BC6..0BC8 ; ALetter # Mc [3] TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI
|
||||
0BCA..0BCC ; ALetter # Mc [3] TAMIL VOWEL SIGN O..TAMIL VOWEL SIGN AU
|
||||
0C01..0C03 ; ALetter # Mc [3] TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA
|
||||
0BD0 ; ALetter # Lo TAMIL OM
|
||||
0C05..0C0C ; ALetter # Lo [8] TELUGU LETTER A..TELUGU LETTER VOCALIC L
|
||||
0C0E..0C10 ; ALetter # Lo [3] TELUGU LETTER E..TELUGU LETTER AI
|
||||
0C12..0C28 ; ALetter # Lo [23] TELUGU LETTER O..TELUGU LETTER NA
|
||||
0C2A..0C33 ; ALetter # Lo [10] TELUGU LETTER PA..TELUGU LETTER LLA
|
||||
0C35..0C39 ; ALetter # Lo [5] TELUGU LETTER VA..TELUGU LETTER HA
|
||||
0C41..0C44 ; ALetter # Mc [4] TELUGU VOWEL SIGN U..TELUGU VOWEL SIGN VOCALIC RR
|
||||
0C3D ; ALetter # Lo TELUGU SIGN AVAGRAHA
|
||||
0C58..0C59 ; ALetter # Lo [2] TELUGU LETTER TSA..TELUGU LETTER DZA
|
||||
0C60..0C61 ; ALetter # Lo [2] TELUGU LETTER VOCALIC RR..TELUGU LETTER VOCALIC LL
|
||||
0C82..0C83 ; ALetter # Mc [2] KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA
|
||||
0C85..0C8C ; ALetter # Lo [8] KANNADA LETTER A..KANNADA LETTER VOCALIC L
|
||||
0C8E..0C90 ; ALetter # Lo [3] KANNADA LETTER E..KANNADA LETTER AI
|
||||
0C92..0CA8 ; ALetter # Lo [23] KANNADA LETTER O..KANNADA LETTER NA
|
||||
0CAA..0CB3 ; ALetter # Lo [10] KANNADA LETTER PA..KANNADA LETTER LLA
|
||||
0CB5..0CB9 ; ALetter # Lo [5] KANNADA LETTER VA..KANNADA LETTER HA
|
||||
0CBD ; ALetter # Lo KANNADA SIGN AVAGRAHA
|
||||
0CBE ; ALetter # Mc KANNADA VOWEL SIGN AA
|
||||
0CC0..0CC1 ; ALetter # Mc [2] KANNADA VOWEL SIGN II..KANNADA VOWEL SIGN U
|
||||
0CC3..0CC4 ; ALetter # Mc [2] KANNADA VOWEL SIGN VOCALIC R..KANNADA VOWEL SIGN VOCALIC RR
|
||||
0CC7..0CC8 ; ALetter # Mc [2] KANNADA VOWEL SIGN EE..KANNADA VOWEL SIGN AI
|
||||
0CCA..0CCB ; ALetter # Mc [2] KANNADA VOWEL SIGN O..KANNADA VOWEL SIGN OO
|
||||
0CDE ; ALetter # Lo KANNADA LETTER FA
|
||||
0CE0..0CE1 ; ALetter # Lo [2] KANNADA LETTER VOCALIC RR..KANNADA LETTER VOCALIC LL
|
||||
0D02..0D03 ; ALetter # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
|
||||
0CF1..0CF2 ; ALetter # Lo [2] KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADHMANIYA
|
||||
0D05..0D0C ; ALetter # Lo [8] MALAYALAM LETTER A..MALAYALAM LETTER VOCALIC L
|
||||
0D0E..0D10 ; ALetter # Lo [3] MALAYALAM LETTER E..MALAYALAM LETTER AI
|
||||
0D12..0D28 ; ALetter # Lo [23] MALAYALAM LETTER O..MALAYALAM LETTER NA
|
||||
0D2A..0D39 ; ALetter # Lo [16] MALAYALAM LETTER PA..MALAYALAM LETTER HA
|
||||
0D3F..0D40 ; ALetter # Mc [2] MALAYALAM VOWEL SIGN I..MALAYALAM VOWEL SIGN II
|
||||
0D46..0D48 ; ALetter # Mc [3] MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN AI
|
||||
0D4A..0D4C ; ALetter # Mc [3] MALAYALAM VOWEL SIGN O..MALAYALAM VOWEL SIGN AU
|
||||
0D12..0D3A ; ALetter # Lo [41] MALAYALAM LETTER O..MALAYALAM LETTER TTTA
|
||||
0D3D ; ALetter # Lo MALAYALAM SIGN AVAGRAHA
|
||||
0D4E ; ALetter # Lo MALAYALAM LETTER DOT REPH
|
||||
0D60..0D61 ; ALetter # Lo [2] MALAYALAM LETTER VOCALIC RR..MALAYALAM LETTER VOCALIC LL
|
||||
0D82..0D83 ; ALetter # Mc [2] SINHALA SIGN ANUSVARAYA..SINHALA SIGN VISARGAYA
|
||||
0D7A..0D7F ; ALetter # Lo [6] MALAYALAM LETTER CHILLU NN..MALAYALAM LETTER CHILLU K
|
||||
0D85..0D96 ; ALetter # Lo [18] SINHALA LETTER AYANNA..SINHALA LETTER AUYANNA
|
||||
0D9A..0DB1 ; ALetter # Lo [24] SINHALA LETTER ALPAPRAANA KAYANNA..SINHALA LETTER DANTAJA NAYANNA
|
||||
0DB3..0DBB ; ALetter # Lo [9] SINHALA LETTER SANYAKA DAYANNA..SINHALA LETTER RAYANNA
|
||||
0DBD ; ALetter # Lo SINHALA LETTER DANTAJA LAYANNA
|
||||
0DC0..0DC6 ; ALetter # Lo [7] SINHALA LETTER VAYANNA..SINHALA LETTER FAYANNA
|
||||
0DD0..0DD1 ; ALetter # Mc [2] SINHALA VOWEL SIGN KETTI AEDA-PILLA..SINHALA VOWEL SIGN DIGA AEDA-PILLA
|
||||
0DD8..0DDE ; ALetter # Mc [7] SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOWEL SIGN KOMBUVA HAA GAYANUKITTA
|
||||
0DF2..0DF3 ; ALetter # Mc [2] SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHALA VOWEL SIGN DIGA GAYANUKITTA
|
||||
0F00 ; ALetter # Lo TIBETAN SYLLABLE OM
|
||||
0F40..0F47 ; ALetter # Lo [8] TIBETAN LETTER KA..TIBETAN LETTER JA
|
||||
0F49..0F6A ; ALetter # Lo [34] TIBETAN LETTER NYA..TIBETAN LETTER FIXED-FORM RA
|
||||
0F7F ; ALetter # Mc TIBETAN SIGN RNAM BCAD
|
||||
0F88..0F8B ; ALetter # Lo [4] TIBETAN SIGN LCE TSA CAN..TIBETAN SIGN GRU MED RGYINGS
|
||||
0F49..0F6C ; ALetter # Lo [36] TIBETAN LETTER NYA..TIBETAN LETTER RRA
|
||||
0F88..0F8C ; ALetter # Lo [5] TIBETAN SIGN LCE TSA CAN..TIBETAN SIGN INVERTED MCHU CAN
|
||||
10A0..10C5 ; ALetter # L& [38] GEORGIAN CAPITAL LETTER AN..GEORGIAN CAPITAL LETTER HOE
|
||||
10C7 ; ALetter # L& GEORGIAN CAPITAL LETTER YN
|
||||
10CD ; ALetter # L& GEORGIAN CAPITAL LETTER AEN
|
||||
10D0..10FA ; ALetter # Lo [43] GEORGIAN LETTER AN..GEORGIAN LETTER AIN
|
||||
10FC ; ALetter # Lm MODIFIER LETTER GEORGIAN NAR
|
||||
1100..1159 ; ALetter # Lo [90] HANGUL CHOSEONG KIYEOK..HANGUL CHOSEONG YEORINHIEUH
|
||||
115F..11A2 ; ALetter # Lo [68] HANGUL CHOSEONG FILLER..HANGUL JUNGSEONG SSANGARAEA
|
||||
11A8..11F9 ; ALetter # Lo [82] HANGUL JONGSEONG KIYEOK..HANGUL JONGSEONG YEORINHIEUH
|
||||
1200..1248 ; ALetter # Lo [73] ETHIOPIC SYLLABLE HA..ETHIOPIC SYLLABLE QWA
|
||||
10FD..1248 ; ALetter # Lo [332] GEORGIAN LETTER AEN..ETHIOPIC SYLLABLE QWA
|
||||
124A..124D ; ALetter # Lo [4] ETHIOPIC SYLLABLE QWI..ETHIOPIC SYLLABLE QWE
|
||||
1250..1256 ; ALetter # Lo [7] ETHIOPIC SYLLABLE QHA..ETHIOPIC SYLLABLE QHO
|
||||
1258 ; ALetter # Lo ETHIOPIC SYLLABLE QHWA
|
||||
@ -407,7 +608,7 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
1380..138F ; ALetter # Lo [16] ETHIOPIC SYLLABLE SEBATBEIT MWA..ETHIOPIC SYLLABLE PWE
|
||||
13A0..13F4 ; ALetter # Lo [85] CHEROKEE LETTER A..CHEROKEE LETTER YV
|
||||
1401..166C ; ALetter # Lo [620] CANADIAN SYLLABICS E..CANADIAN SYLLABICS CARRIER TTSA
|
||||
166F..1676 ; ALetter # Lo [8] CANADIAN SYLLABICS QAI..CANADIAN SYLLABICS NNGAA
|
||||
166F..167F ; ALetter # Lo [17] CANADIAN SYLLABICS QAI..CANADIAN SYLLABICS BLACKFOOT W
|
||||
1681..169A ; ALetter # Lo [26] OGHAM LETTER BEITH..OGHAM LETTER PEITH
|
||||
16A0..16EA ; ALetter # Lo [75] RUNIC LETTER FEHU FEOH FE F..RUNIC LETTER X
|
||||
16EE..16F0 ; ALetter # Nl [3] RUNIC ARLAUG SYMBOL..RUNIC BELGTHOR SYMBOL
|
||||
@ -421,29 +622,29 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
1843 ; ALetter # Lm MONGOLIAN LETTER TODO LONG VOWEL SIGN
|
||||
1844..1877 ; ALetter # Lo [52] MONGOLIAN LETTER TODO E..MONGOLIAN LETTER MANCHU ZHA
|
||||
1880..18A8 ; ALetter # Lo [41] MONGOLIAN LETTER ALI GALI ANUSVARA ONE..MONGOLIAN LETTER MANCHU ALI GALI BHA
|
||||
18AA ; ALetter # Lo MONGOLIAN LETTER MANCHU ALI GALI LHA
|
||||
18B0..18F5 ; ALetter # Lo [70] CANADIAN SYLLABICS OY..CANADIAN SYLLABICS CARRIER DENTAL S
|
||||
1900..191C ; ALetter # Lo [29] LIMBU VOWEL-CARRIER LETTER..LIMBU LETTER HA
|
||||
1923..1926 ; ALetter # Mc [4] LIMBU VOWEL SIGN EE..LIMBU VOWEL SIGN AU
|
||||
1929..192B ; ALetter # Mc [3] LIMBU SUBJOINED LETTER YA..LIMBU SUBJOINED LETTER WA
|
||||
1930..1931 ; ALetter # Mc [2] LIMBU SMALL LETTER KA..LIMBU SMALL LETTER NGA
|
||||
1933..1938 ; ALetter # Mc [6] LIMBU SMALL LETTER TA..LIMBU SMALL LETTER LA
|
||||
1A00..1A16 ; ALetter # Lo [23] BUGINESE LETTER KA..BUGINESE LETTER HA
|
||||
1A19..1A1B ; ALetter # Mc [3] BUGINESE VOWEL SIGN E..BUGINESE VOWEL SIGN AE
|
||||
1B04 ; ALetter # Mc BALINESE SIGN BISAH
|
||||
1B05..1B33 ; ALetter # Lo [47] BALINESE LETTER AKARA..BALINESE LETTER HA
|
||||
1B35 ; ALetter # Mc BALINESE VOWEL SIGN TEDUNG
|
||||
1B3B ; ALetter # Mc BALINESE VOWEL SIGN RA REPA TEDUNG
|
||||
1B3D..1B41 ; ALetter # Mc [5] BALINESE VOWEL SIGN LA LENGA TEDUNG..BALINESE VOWEL SIGN TALING REPA TEDUNG
|
||||
1B43 ; ALetter # Mc BALINESE VOWEL SIGN PEPET TEDUNG
|
||||
1B45..1B4B ; ALetter # Lo [7] BALINESE LETTER KAF SASAK..BALINESE LETTER ASYURA SASAK
|
||||
1B83..1BA0 ; ALetter # Lo [30] SUNDANESE LETTER A..SUNDANESE LETTER HA
|
||||
1BAE..1BAF ; ALetter # Lo [2] SUNDANESE LETTER KHA..SUNDANESE LETTER SYA
|
||||
1BBA..1BE5 ; ALetter # Lo [44] SUNDANESE AVAGRAHA..BATAK LETTER U
|
||||
1C00..1C23 ; ALetter # Lo [36] LEPCHA LETTER KA..LEPCHA LETTER A
|
||||
1C4D..1C4F ; ALetter # Lo [3] LEPCHA LETTER TTA..LEPCHA LETTER DDA
|
||||
1C5A..1C77 ; ALetter # Lo [30] OL CHIKI LETTER LA..OL CHIKI LETTER OH
|
||||
1C78..1C7D ; ALetter # Lm [6] OL CHIKI MU TTUDDAG..OL CHIKI AHAD
|
||||
1CE9..1CEC ; ALetter # Lo [4] VEDIC SIGN ANUSVARA ANTARGOMUKHA..VEDIC SIGN ANUSVARA VAMAGOMUKHA WITH TAIL
|
||||
1CEE..1CF1 ; ALetter # Lo [4] VEDIC SIGN HEXIFORM LONG ANUSVARA..VEDIC SIGN ANUSVARA UBHAYATO MUKHA
|
||||
1CF5..1CF6 ; ALetter # Lo [2] VEDIC SIGN JIHVAMULIYA..VEDIC SIGN UPADHMANIYA
|
||||
1D00..1D2B ; ALetter # L& [44] LATIN LETTER SMALL CAPITAL A..CYRILLIC LETTER SMALL CAPITAL EL
|
||||
1D2C..1D61 ; ALetter # Lm [54] MODIFIER LETTER CAPITAL A..MODIFIER LETTER SMALL CHI
|
||||
1D62..1D77 ; ALetter # L& [22] LATIN SUBSCRIPT SMALL LETTER I..LATIN SMALL LETTER TURNED G
|
||||
1D2C..1D6A ; ALetter # Lm [63] MODIFIER LETTER CAPITAL A..GREEK SUBSCRIPT SMALL LETTER CHI
|
||||
1D6B..1D77 ; ALetter # L& [13] LATIN SMALL LETTER UE..LATIN SMALL LETTER TURNED G
|
||||
1D78 ; ALetter # Lm MODIFIER LETTER CYRILLIC EN
|
||||
1D79..1D9A ; ALetter # L& [34] LATIN SMALL LETTER INSULAR G..LATIN SMALL LETTER EZH WITH RETROFLEX HOOK
|
||||
1D9B..1DBF ; ALetter # Lm [37] MODIFIER LETTER SMALL TURNED ALPHA..MODIFIER LETTER SMALL THETA
|
||||
1E00..1E9B ; ALetter # L& [156] LATIN CAPITAL LETTER A WITH RING BELOW..LATIN SMALL LETTER LONG S WITH DOT ABOVE
|
||||
1EA0..1EF9 ; ALetter # L& [90] LATIN CAPITAL LETTER A WITH DOT BELOW..LATIN SMALL LETTER Y WITH TILDE
|
||||
1F00..1F15 ; ALetter # L& [22] GREEK SMALL LETTER ALPHA WITH PSILI..GREEK SMALL LETTER EPSILON WITH DASIA AND OXIA
|
||||
1E00..1F15 ; ALetter # L& [278] LATIN CAPITAL LETTER A WITH RING BELOW..GREEK SMALL LETTER EPSILON WITH DASIA AND OXIA
|
||||
1F18..1F1D ; ALetter # L& [6] GREEK CAPITAL LETTER EPSILON WITH PSILI..GREEK CAPITAL LETTER EPSILON WITH DASIA AND OXIA
|
||||
1F20..1F45 ; ALetter # L& [38] GREEK SMALL LETTER ETA WITH PSILI..GREEK SMALL LETTER OMICRON WITH DASIA AND OXIA
|
||||
1F48..1F4D ; ALetter # L& [6] GREEK CAPITAL LETTER OMICRON WITH PSILI..GREEK CAPITAL LETTER OMICRON WITH DASIA AND OXIA
|
||||
@ -462,9 +663,9 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
1FE0..1FEC ; ALetter # L& [13] GREEK SMALL LETTER UPSILON WITH VRACHY..GREEK CAPITAL LETTER RHO WITH DASIA
|
||||
1FF2..1FF4 ; ALetter # L& [3] GREEK SMALL LETTER OMEGA WITH VARIA AND YPOGEGRAMMENI..GREEK SMALL LETTER OMEGA WITH OXIA AND YPOGEGRAMMENI
|
||||
1FF6..1FFC ; ALetter # L& [7] GREEK SMALL LETTER OMEGA WITH PERISPOMENI..GREEK CAPITAL LETTER OMEGA WITH PROSGEGRAMMENI
|
||||
2071 ; ALetter # L& SUPERSCRIPT LATIN SMALL LETTER I
|
||||
207F ; ALetter # L& SUPERSCRIPT LATIN SMALL LETTER N
|
||||
2090..2094 ; ALetter # Lm [5] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER SCHWA
|
||||
2071 ; ALetter # Lm SUPERSCRIPT LATIN SMALL LETTER I
|
||||
207F ; ALetter # Lm SUPERSCRIPT LATIN SMALL LETTER N
|
||||
2090..209C ; ALetter # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
|
||||
2102 ; ALetter # L& DOUBLE-STRUCK CAPITAL C
|
||||
2107 ; ALetter # L& EULER CONSTANT
|
||||
210A..2113 ; ALetter # L& [10] SCRIPT SMALL G..SCRIPT SMALL L
|
||||
@ -482,14 +683,19 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
214E ; ALetter # L& TURNED SMALL F
|
||||
2160..2182 ; ALetter # Nl [35] ROMAN NUMERAL ONE..ROMAN NUMERAL TEN THOUSAND
|
||||
2183..2184 ; ALetter # L& [2] ROMAN NUMERAL REVERSED ONE HUNDRED..LATIN SMALL LETTER REVERSED C
|
||||
2185..2188 ; ALetter # Nl [4] ROMAN NUMERAL SIX LATE FORM..ROMAN NUMERAL ONE HUNDRED THOUSAND
|
||||
24B6..24E9 ; ALetter # So [52] CIRCLED LATIN CAPITAL LETTER A..CIRCLED LATIN SMALL LETTER Z
|
||||
2C00..2C2E ; ALetter # L& [47] GLAGOLITIC CAPITAL LETTER AZU..GLAGOLITIC CAPITAL LETTER LATINATE MYSLITE
|
||||
2C30..2C5E ; ALetter # L& [47] GLAGOLITIC SMALL LETTER AZU..GLAGOLITIC SMALL LETTER LATINATE MYSLITE
|
||||
2C60..2C6C ; ALetter # L& [13] LATIN CAPITAL LETTER L WITH DOUBLE BAR..LATIN SMALL LETTER Z WITH DESCENDER
|
||||
2C74..2C77 ; ALetter # L& [4] LATIN SMALL LETTER V WITH CURL..LATIN SMALL LETTER TAILLESS PHI
|
||||
2C80..2CE4 ; ALetter # L& [101] COPTIC CAPITAL LETTER ALFA..COPTIC SYMBOL KAI
|
||||
2C60..2C7B ; ALetter # L& [28] LATIN CAPITAL LETTER L WITH DOUBLE BAR..LATIN LETTER SMALL CAPITAL TURNED E
|
||||
2C7C..2C7D ; ALetter # Lm [2] LATIN SUBSCRIPT SMALL LETTER J..MODIFIER LETTER CAPITAL V
|
||||
2C7E..2CE4 ; ALetter # L& [103] LATIN CAPITAL LETTER S WITH SWASH TAIL..COPTIC SYMBOL KAI
|
||||
2CEB..2CEE ; ALetter # L& [4] COPTIC CAPITAL LETTER CRYPTOGRAMMIC SHEI..COPTIC SMALL LETTER CRYPTOGRAMMIC GANGIA
|
||||
2CF2..2CF3 ; ALetter # L& [2] COPTIC CAPITAL LETTER BOHAIRIC KHEI..COPTIC SMALL LETTER BOHAIRIC KHEI
|
||||
2D00..2D25 ; ALetter # L& [38] GEORGIAN SMALL LETTER AN..GEORGIAN SMALL LETTER HOE
|
||||
2D30..2D65 ; ALetter # Lo [54] TIFINAGH LETTER YA..TIFINAGH LETTER YAZZ
|
||||
2D27 ; ALetter # L& GEORGIAN SMALL LETTER YN
|
||||
2D2D ; ALetter # L& GEORGIAN SMALL LETTER AEN
|
||||
2D30..2D67 ; ALetter # Lo [56] TIFINAGH LETTER YA..TIFINAGH LETTER YO
|
||||
2D6F ; ALetter # Lm TIFINAGH MODIFIER LETTER LABIALIZATION MARK
|
||||
2D80..2D96 ; ALetter # Lo [23] ETHIOPIC SYLLABLE LOA..ETHIOPIC SYLLABLE GGWE
|
||||
2DA0..2DA6 ; ALetter # Lo [7] ETHIOPIC SYLLABLE SSA..ETHIOPIC SYLLABLE SSO
|
||||
@ -500,25 +706,66 @@ FF9E..FF9F ; Katakana # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWI
|
||||
2DC8..2DCE ; ALetter # Lo [7] ETHIOPIC SYLLABLE KYA..ETHIOPIC SYLLABLE KYO
|
||||
2DD0..2DD6 ; ALetter # Lo [7] ETHIOPIC SYLLABLE XYA..ETHIOPIC SYLLABLE XYO
|
||||
2DD8..2DDE ; ALetter # Lo [7] ETHIOPIC SYLLABLE GYA..ETHIOPIC SYLLABLE GYO
|
||||
2E2F ; ALetter # Lm VERTICAL TILDE
|
||||
3005 ; ALetter # Lm IDEOGRAPHIC ITERATION MARK
|
||||
303B ; ALetter # Lm VERTICAL IDEOGRAPHIC ITERATION MARK
|
||||
303C ; ALetter # Lo MASU MARK
|
||||
3105..312C ; ALetter # Lo [40] BOPOMOFO LETTER B..BOPOMOFO LETTER GN
|
||||
3105..312D ; ALetter # Lo [41] BOPOMOFO LETTER B..BOPOMOFO LETTER IH
|
||||
3131..318E ; ALetter # Lo [94] HANGUL LETTER KIYEOK..HANGUL LETTER ARAEAE
|
||||
31A0..31B7 ; ALetter # Lo [24] BOPOMOFO LETTER BU..BOPOMOFO FINAL LETTER H
|
||||
31A0..31BA ; ALetter # Lo [27] BOPOMOFO LETTER BU..BOPOMOFO LETTER ZY
|
||||
A000..A014 ; ALetter # Lo [21] YI SYLLABLE IT..YI SYLLABLE E
|
||||
A015 ; ALetter # Lm YI SYLLABLE WU
|
||||
A016..A48C ; ALetter # Lo [1143] YI SYLLABLE BIT..YI SYLLABLE YYR
|
||||
A717..A71A ; ALetter # Lm [4] MODIFIER LETTER DOT VERTICAL BAR..MODIFIER LETTER LOWER RIGHT CORNER ANGLE
|
||||
A800..A801 ; ALetter # Lo [2] SYLOTI NAGRI LETTER A..SYLOTI NAGRI LETTER I
|
||||
A4D0..A4F7 ; ALetter # Lo [40] LISU LETTER BA..LISU LETTER OE
|
||||
A4F8..A4FD ; ALetter # Lm [6] LISU LETTER TONE MYA TI..LISU LETTER TONE MYA JEU
|
||||
A500..A60B ; ALetter # Lo [268] VAI SYLLABLE EE..VAI SYLLABLE NG
|
||||
A60C ; ALetter # Lm VAI SYLLABLE LENGTHENER
|
||||
A610..A61F ; ALetter # Lo [16] VAI SYLLABLE NDOLE FA..VAI SYMBOL JONG
|
||||
A62A..A62B ; ALetter # Lo [2] VAI SYLLABLE NDOLE MA..VAI SYLLABLE NDOLE DO
|
||||
A640..A66D ; ALetter # L& [46] CYRILLIC CAPITAL LETTER ZEMLYA..CYRILLIC SMALL LETTER DOUBLE MONOCULAR O
|
||||
A66E ; ALetter # Lo CYRILLIC LETTER MULTIOCULAR O
|
||||
A67F ; ALetter # Lm CYRILLIC PAYEROK
|
||||
A680..A697 ; ALetter # L& [24] CYRILLIC CAPITAL LETTER DWE..CYRILLIC SMALL LETTER SHWE
|
||||
A6A0..A6E5 ; ALetter # Lo [70] BAMUM LETTER A..BAMUM LETTER KI
|
||||
A6E6..A6EF ; ALetter # Nl [10] BAMUM LETTER MO..BAMUM LETTER KOGHOM
|
||||
A717..A71F ; ALetter # Lm [9] MODIFIER LETTER DOT VERTICAL BAR..MODIFIER LETTER LOW INVERTED EXCLAMATION MARK
|
||||
A722..A76F ; ALetter # L& [78] LATIN CAPITAL LETTER EGYPTOLOGICAL ALEF..LATIN SMALL LETTER CON
|
||||
A770 ; ALetter # Lm MODIFIER LETTER US
|
||||
A771..A787 ; ALetter # L& [23] LATIN SMALL LETTER DUM..LATIN SMALL LETTER INSULAR T
|
||||
A788 ; ALetter # Lm MODIFIER LETTER LOW CIRCUMFLEX ACCENT
|
||||
A78B..A78E ; ALetter # L& [4] LATIN CAPITAL LETTER SALTILLO..LATIN SMALL LETTER L WITH RETROFLEX HOOK AND BELT
|
||||
A790..A793 ; ALetter # L& [4] LATIN CAPITAL LETTER N WITH DESCENDER..LATIN SMALL LETTER C WITH BAR
|
||||
A7A0..A7AA ; ALetter # L& [11] LATIN CAPITAL LETTER G WITH OBLIQUE STROKE..LATIN CAPITAL LETTER H WITH HOOK
|
||||
A7F8..A7F9 ; ALetter # Lm [2] MODIFIER LETTER CAPITAL H WITH STROKE..MODIFIER LETTER SMALL LIGATURE OE
|
||||
A7FA ; ALetter # L& LATIN LETTER SMALL CAPITAL TURNED M
|
||||
A7FB..A801 ; ALetter # Lo [7] LATIN EPIGRAPHIC LETTER REVERSED F..SYLOTI NAGRI LETTER I
|
||||
A803..A805 ; ALetter # Lo [3] SYLOTI NAGRI LETTER U..SYLOTI NAGRI LETTER O
|
||||
A807..A80A ; ALetter # Lo [4] SYLOTI NAGRI LETTER KO..SYLOTI NAGRI LETTER GHO
|
||||
A80C..A822 ; ALetter # Lo [23] SYLOTI NAGRI LETTER CO..SYLOTI NAGRI LETTER HO
|
||||
A823..A824 ; ALetter # Mc [2] SYLOTI NAGRI VOWEL SIGN A..SYLOTI NAGRI VOWEL SIGN I
|
||||
A827 ; ALetter # Mc SYLOTI NAGRI VOWEL SIGN OO
|
||||
A840..A873 ; ALetter # Lo [52] PHAGS-PA LETTER KA..PHAGS-PA LETTER CANDRABINDU
|
||||
A882..A8B3 ; ALetter # Lo [50] SAURASHTRA LETTER A..SAURASHTRA LETTER LLA
|
||||
A8F2..A8F7 ; ALetter # Lo [6] DEVANAGARI SIGN SPACING CANDRABINDU..DEVANAGARI SIGN CANDRABINDU AVAGRAHA
|
||||
A8FB ; ALetter # Lo DEVANAGARI HEADSTROKE
|
||||
A90A..A925 ; ALetter # Lo [28] KAYAH LI LETTER KA..KAYAH LI LETTER OO
|
||||
A930..A946 ; ALetter # Lo [23] REJANG LETTER KA..REJANG LETTER A
|
||||
A960..A97C ; ALetter # Lo [29] HANGUL CHOSEONG TIKEUT-MIEUM..HANGUL CHOSEONG SSANGYEORINHIEUH
|
||||
A984..A9B2 ; ALetter # Lo [47] JAVANESE LETTER A..JAVANESE LETTER HA
|
||||
A9CF ; ALetter # Lm JAVANESE PANGRANGKEP
|
||||
AA00..AA28 ; ALetter # Lo [41] CHAM LETTER A..CHAM LETTER HA
|
||||
AA40..AA42 ; ALetter # Lo [3] CHAM LETTER FINAL K..CHAM LETTER FINAL NG
|
||||
AA44..AA4B ; ALetter # Lo [8] CHAM LETTER FINAL CH..CHAM LETTER FINAL SS
|
||||
AAE0..AAEA ; ALetter # Lo [11] MEETEI MAYEK LETTER E..MEETEI MAYEK LETTER SSA
|
||||
AAF2 ; ALetter # Lo MEETEI MAYEK ANJI
|
||||
AAF3..AAF4 ; ALetter # Lm [2] MEETEI MAYEK SYLLABLE REPETITION MARK..MEETEI MAYEK WORD REPETITION MARK
|
||||
AB01..AB06 ; ALetter # Lo [6] ETHIOPIC SYLLABLE TTHU..ETHIOPIC SYLLABLE TTHO
|
||||
AB09..AB0E ; ALetter # Lo [6] ETHIOPIC SYLLABLE DDHU..ETHIOPIC SYLLABLE DDHO
|
||||
AB11..AB16 ; ALetter # Lo [6] ETHIOPIC SYLLABLE DZU..ETHIOPIC SYLLABLE DZO
|
||||
AB20..AB26 ; ALetter # Lo [7] ETHIOPIC SYLLABLE CCHHA..ETHIOPIC SYLLABLE CCHHO
|
||||
AB28..AB2E ; ALetter # Lo [7] ETHIOPIC SYLLABLE BBA..ETHIOPIC SYLLABLE BBO
|
||||
ABC0..ABE2 ; ALetter # Lo [35] MEETEI MAYEK LETTER KOK..MEETEI MAYEK LETTER I LONSUM
|
||||
AC00..D7A3 ; ALetter # Lo [11172] HANGUL SYLLABLE GA..HANGUL SYLLABLE HIH
|
||||
FA30..FA6A ; ALetter # Lo [59] CJK COMPATIBILITY IDEOGRAPH-FA30..CJK COMPATIBILITY IDEOGRAPH-FA6A
|
||||
D7B0..D7C6 ; ALetter # Lo [23] HANGUL JUNGSEONG O-YEO..HANGUL JUNGSEONG ARAEA-E
|
||||
D7CB..D7FB ; ALetter # Lo [49] HANGUL JONGSEONG NIEUN-RIEUL..HANGUL JONGSEONG PHIEUPH-THIEUTH
|
||||
FB00..FB06 ; ALetter # L& [7] LATIN SMALL LIGATURE FF..LATIN SMALL LIGATURE ST
|
||||
FB13..FB17 ; ALetter # L& [5] ARMENIAN SMALL LIGATURE MEN NOW..ARMENIAN SMALL LIGATURE MEN XEH
|
||||
FB1D ; ALetter # Lo HEBREW LETTER YOD WITH HIRIQ
|
||||
@ -550,6 +797,8 @@ FFDA..FFDC ; ALetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
|
||||
10050..1005D ; ALetter # Lo [14] LINEAR B SYMBOL B018..LINEAR B SYMBOL B089
|
||||
10080..100FA ; ALetter # Lo [123] LINEAR B IDEOGRAM B100 MAN..LINEAR B IDEOGRAM VESSEL B305
|
||||
10140..10174 ; ALetter # Nl [53] GREEK ACROPHONIC ATTIC ONE QUARTER..GREEK ACROPHONIC STRATIAN FIFTY MNAS
|
||||
10280..1029C ; ALetter # Lo [29] LYCIAN LETTER A..LYCIAN LETTER X
|
||||
102A0..102D0 ; ALetter # Lo [49] CARIAN LETTER A..CARIAN LETTER UUU3
|
||||
10300..1031E ; ALetter # Lo [31] OLD ITALIC LETTER A..OLD ITALIC LETTER UU
|
||||
10330..10340 ; ALetter # Lo [17] GOTHIC LETTER AHSA..GOTHIC LETTER PAIRTHRA
|
||||
10341 ; ALetter # Nl GOTHIC LETTER NINETY
|
||||
@ -566,14 +815,34 @@ FFDA..FFDC ; ALetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
|
||||
1080A..10835 ; ALetter # Lo [44] CYPRIOT SYLLABLE KA..CYPRIOT SYLLABLE WO
|
||||
10837..10838 ; ALetter # Lo [2] CYPRIOT SYLLABLE XA..CYPRIOT SYLLABLE XE
|
||||
1083C ; ALetter # Lo CYPRIOT SYLLABLE ZA
|
||||
1083F ; ALetter # Lo CYPRIOT SYLLABLE ZO
|
||||
1083F..10855 ; ALetter # Lo [23] CYPRIOT SYLLABLE ZO..IMPERIAL ARAMAIC LETTER TAW
|
||||
10900..10915 ; ALetter # Lo [22] PHOENICIAN LETTER ALF..PHOENICIAN LETTER TAU
|
||||
10920..10939 ; ALetter # Lo [26] LYDIAN LETTER A..LYDIAN LETTER C
|
||||
10980..109B7 ; ALetter # Lo [56] MEROITIC HIEROGLYPHIC LETTER A..MEROITIC CURSIVE LETTER DA
|
||||
109BE..109BF ; ALetter # Lo [2] MEROITIC CURSIVE LOGOGRAM RMT..MEROITIC CURSIVE LOGOGRAM IMN
|
||||
10A00 ; ALetter # Lo KHAROSHTHI LETTER A
|
||||
10A10..10A13 ; ALetter # Lo [4] KHAROSHTHI LETTER KA..KHAROSHTHI LETTER GHA
|
||||
10A15..10A17 ; ALetter # Lo [3] KHAROSHTHI LETTER CA..KHAROSHTHI LETTER JA
|
||||
10A19..10A33 ; ALetter # Lo [27] KHAROSHTHI LETTER NYA..KHAROSHTHI LETTER TTTHA
|
||||
10A60..10A7C ; ALetter # Lo [29] OLD SOUTH ARABIAN LETTER HE..OLD SOUTH ARABIAN LETTER THETH
|
||||
10B00..10B35 ; ALetter # Lo [54] AVESTAN LETTER A..AVESTAN LETTER HE
|
||||
10B40..10B55 ; ALetter # Lo [22] INSCRIPTIONAL PARTHIAN LETTER ALEPH..INSCRIPTIONAL PARTHIAN LETTER TAW
|
||||
10B60..10B72 ; ALetter # Lo [19] INSCRIPTIONAL PAHLAVI LETTER ALEPH..INSCRIPTIONAL PAHLAVI LETTER TAW
|
||||
10C00..10C48 ; ALetter # Lo [73] OLD TURKIC LETTER ORKHON A..OLD TURKIC LETTER ORKHON BASH
|
||||
11003..11037 ; ALetter # Lo [53] BRAHMI SIGN JIHVAMULIYA..BRAHMI LETTER OLD TAMIL NNNA
|
||||
11083..110AF ; ALetter # Lo [45] KAITHI LETTER A..KAITHI LETTER HA
|
||||
110D0..110E8 ; ALetter # Lo [25] SORA SOMPENG LETTER SAH..SORA SOMPENG LETTER MAE
|
||||
11103..11126 ; ALetter # Lo [36] CHAKMA LETTER AA..CHAKMA LETTER HAA
|
||||
11183..111B2 ; ALetter # Lo [48] SHARADA LETTER A..SHARADA LETTER HA
|
||||
111C1..111C4 ; ALetter # Lo [4] SHARADA SIGN AVAGRAHA..SHARADA OM
|
||||
11680..116AA ; ALetter # Lo [43] TAKRI LETTER A..TAKRI LETTER RRA
|
||||
12000..1236E ; ALetter # Lo [879] CUNEIFORM SIGN A..CUNEIFORM SIGN ZUM
|
||||
12400..12462 ; ALetter # Nl [99] CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NUMERIC SIGN OLD ASSYRIAN ONE QUARTER
|
||||
13000..1342E ; ALetter # Lo [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
|
||||
16800..16A38 ; ALetter # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ
|
||||
16F00..16F44 ; ALetter # Lo [69] MIAO LETTER PA..MIAO LETTER HHA
|
||||
16F50 ; ALetter # Lo MIAO LETTER NASALIZATION
|
||||
16F93..16F9F ; ALetter # Lm [13] MIAO LETTER TONE-2..MIAO LETTER REFORMED TONE-8
|
||||
1D400..1D454 ; ALetter # L& [85] MATHEMATICAL BOLD CAPITAL A..MATHEMATICAL ITALIC SMALL G
|
||||
1D456..1D49C ; ALetter # L& [71] MATHEMATICAL ITALIC SMALL I..MATHEMATICAL SCRIPT CAPITAL A
|
||||
1D49E..1D49F ; ALetter # L& [2] MATHEMATICAL SCRIPT CAPITAL C..MATHEMATICAL SCRIPT CAPITAL D
|
||||
@ -604,40 +873,92 @@ FFDA..FFDC ; ALetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
|
||||
1D78A..1D7A8 ; ALetter # L& [31] MATHEMATICAL SANS-SERIF BOLD EPSILON SYMBOL..MATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL OMEGA
|
||||
1D7AA..1D7C2 ; ALetter # L& [25] MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL ALPHA..MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL OMEGA
|
||||
1D7C4..1D7CB ; ALetter # L& [8] MATHEMATICAL SANS-SERIF BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL BOLD SMALL DIGAMMA
|
||||
1EE00..1EE03 ; ALetter # Lo [4] ARABIC MATHEMATICAL ALEF..ARABIC MATHEMATICAL DAL
|
||||
1EE05..1EE1F ; ALetter # Lo [27] ARABIC MATHEMATICAL WAW..ARABIC MATHEMATICAL DOTLESS QAF
|
||||
1EE21..1EE22 ; ALetter # Lo [2] ARABIC MATHEMATICAL INITIAL BEH..ARABIC MATHEMATICAL INITIAL JEEM
|
||||
1EE24 ; ALetter # Lo ARABIC MATHEMATICAL INITIAL HEH
|
||||
1EE27 ; ALetter # Lo ARABIC MATHEMATICAL INITIAL HAH
|
||||
1EE29..1EE32 ; ALetter # Lo [10] ARABIC MATHEMATICAL INITIAL YEH..ARABIC MATHEMATICAL INITIAL QAF
|
||||
1EE34..1EE37 ; ALetter # Lo [4] ARABIC MATHEMATICAL INITIAL SHEEN..ARABIC MATHEMATICAL INITIAL KHAH
|
||||
1EE39 ; ALetter # Lo ARABIC MATHEMATICAL INITIAL DAD
|
||||
1EE3B ; ALetter # Lo ARABIC MATHEMATICAL INITIAL GHAIN
|
||||
1EE42 ; ALetter # Lo ARABIC MATHEMATICAL TAILED JEEM
|
||||
1EE47 ; ALetter # Lo ARABIC MATHEMATICAL TAILED HAH
|
||||
1EE49 ; ALetter # Lo ARABIC MATHEMATICAL TAILED YEH
|
||||
1EE4B ; ALetter # Lo ARABIC MATHEMATICAL TAILED LAM
|
||||
1EE4D..1EE4F ; ALetter # Lo [3] ARABIC MATHEMATICAL TAILED NOON..ARABIC MATHEMATICAL TAILED AIN
|
||||
1EE51..1EE52 ; ALetter # Lo [2] ARABIC MATHEMATICAL TAILED SAD..ARABIC MATHEMATICAL TAILED QAF
|
||||
1EE54 ; ALetter # Lo ARABIC MATHEMATICAL TAILED SHEEN
|
||||
1EE57 ; ALetter # Lo ARABIC MATHEMATICAL TAILED KHAH
|
||||
1EE59 ; ALetter # Lo ARABIC MATHEMATICAL TAILED DAD
|
||||
1EE5B ; ALetter # Lo ARABIC MATHEMATICAL TAILED GHAIN
|
||||
1EE5D ; ALetter # Lo ARABIC MATHEMATICAL TAILED DOTLESS NOON
|
||||
1EE5F ; ALetter # Lo ARABIC MATHEMATICAL TAILED DOTLESS QAF
|
||||
1EE61..1EE62 ; ALetter # Lo [2] ARABIC MATHEMATICAL STRETCHED BEH..ARABIC MATHEMATICAL STRETCHED JEEM
|
||||
1EE64 ; ALetter # Lo ARABIC MATHEMATICAL STRETCHED HEH
|
||||
1EE67..1EE6A ; ALetter # Lo [4] ARABIC MATHEMATICAL STRETCHED HAH..ARABIC MATHEMATICAL STRETCHED KAF
|
||||
1EE6C..1EE72 ; ALetter # Lo [7] ARABIC MATHEMATICAL STRETCHED MEEM..ARABIC MATHEMATICAL STRETCHED QAF
|
||||
1EE74..1EE77 ; ALetter # Lo [4] ARABIC MATHEMATICAL STRETCHED SHEEN..ARABIC MATHEMATICAL STRETCHED KHAH
|
||||
1EE79..1EE7C ; ALetter # Lo [4] ARABIC MATHEMATICAL STRETCHED DAD..ARABIC MATHEMATICAL STRETCHED DOTLESS BEH
|
||||
1EE7E ; ALetter # Lo ARABIC MATHEMATICAL STRETCHED DOTLESS FEH
|
||||
1EE80..1EE89 ; ALetter # Lo [10] ARABIC MATHEMATICAL LOOPED ALEF..ARABIC MATHEMATICAL LOOPED YEH
|
||||
1EE8B..1EE9B ; ALetter # Lo [17] ARABIC MATHEMATICAL LOOPED LAM..ARABIC MATHEMATICAL LOOPED GHAIN
|
||||
1EEA1..1EEA3 ; ALetter # Lo [3] ARABIC MATHEMATICAL DOUBLE-STRUCK BEH..ARABIC MATHEMATICAL DOUBLE-STRUCK DAL
|
||||
1EEA5..1EEA9 ; ALetter # Lo [5] ARABIC MATHEMATICAL DOUBLE-STRUCK WAW..ARABIC MATHEMATICAL DOUBLE-STRUCK YEH
|
||||
1EEAB..1EEBB ; ALetter # Lo [17] ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC MATHEMATICAL DOUBLE-STRUCK GHAIN
|
||||
|
||||
# Total code points: 21149
|
||||
# Total code points: 24941
|
||||
|
||||
# ================================================
|
||||
|
||||
0027 ; MidLetter # Po APOSTROPHE
|
||||
003A ; MidLetter # Po COLON
|
||||
00B7 ; MidLetter # Po MIDDLE DOT
|
||||
0387 ; MidLetter # Po GREEK ANO TELEIA
|
||||
05F4 ; MidLetter # Po HEBREW PUNCTUATION GERSHAYIM
|
||||
2019 ; MidLetter # Pf RIGHT SINGLE QUOTATION MARK
|
||||
2027 ; MidLetter # Po HYPHENATION POINT
|
||||
FE13 ; MidLetter # Po PRESENTATION FORM FOR VERTICAL COLON
|
||||
FE55 ; MidLetter # Po SMALL COLON
|
||||
FF1A ; MidLetter # Po FULLWIDTH COLON
|
||||
|
||||
# Total code points: 6
|
||||
# Total code points: 8
|
||||
|
||||
# ================================================
|
||||
|
||||
002C ; MidNum # Po COMMA
|
||||
002E ; MidNum # Po FULL STOP
|
||||
003B ; MidNum # Po SEMICOLON
|
||||
037E ; MidNum # Po GREEK QUESTION MARK
|
||||
0589 ; MidNum # Po ARMENIAN FULL STOP
|
||||
060D ; MidNum # Po ARABIC DATE SEPARATOR
|
||||
060C..060D ; MidNum # Po [2] ARABIC COMMA..ARABIC DATE SEPARATOR
|
||||
066C ; MidNum # Po ARABIC THOUSANDS SEPARATOR
|
||||
07F8 ; MidNum # Po NKO COMMA
|
||||
2044 ; MidNum # Sm FRACTION SLASH
|
||||
FE10 ; MidNum # Po PRESENTATION FORM FOR VERTICAL COMMA
|
||||
FE13..FE14 ; MidNum # Po [2] PRESENTATION FORM FOR VERTICAL COLON..PRESENTATION FORM FOR VERTICAL SEMICOLON
|
||||
FE14 ; MidNum # Po PRESENTATION FORM FOR VERTICAL SEMICOLON
|
||||
FE50 ; MidNum # Po SMALL COMMA
|
||||
FE54 ; MidNum # Po SMALL SEMICOLON
|
||||
FF0C ; MidNum # Po FULLWIDTH COMMA
|
||||
FF1B ; MidNum # Po FULLWIDTH SEMICOLON
|
||||
|
||||
# Total code points: 11
|
||||
# Total code points: 15
|
||||
|
||||
# ================================================
|
||||
|
||||
0027 ; MidNumLet # Po APOSTROPHE
|
||||
002E ; MidNumLet # Po FULL STOP
|
||||
2018 ; MidNumLet # Pi LEFT SINGLE QUOTATION MARK
|
||||
2019 ; MidNumLet # Pf RIGHT SINGLE QUOTATION MARK
|
||||
2024 ; MidNumLet # Po ONE DOT LEADER
|
||||
FE52 ; MidNumLet # Po SMALL FULL STOP
|
||||
FF07 ; MidNumLet # Po FULLWIDTH APOSTROPHE
|
||||
FF0E ; MidNumLet # Po FULLWIDTH FULL STOP
|
||||
|
||||
# Total code points: 8
|
||||
|
||||
# ================================================
|
||||
|
||||
0030..0039 ; Numeric # Nd [10] DIGIT ZERO..DIGIT NINE
|
||||
0660..0669 ; Numeric # Nd [10] ARABIC-INDIC DIGIT ZERO..ARABIC-INDIC DIGIT NINE
|
||||
066B..066C ; Numeric # Po [2] ARABIC DECIMAL SEPARATOR..ARABIC THOUSANDS SEPARATOR
|
||||
066B ; Numeric # Po ARABIC DECIMAL SEPARATOR
|
||||
06F0..06F9 ; Numeric # Nd [10] EXTENDED ARABIC-INDIC DIGIT ZERO..EXTENDED ARABIC-INDIC DIGIT NINE
|
||||
07C0..07C9 ; Numeric # Nd [10] NKO DIGIT ZERO..NKO DIGIT NINE
|
||||
0966..096F ; Numeric # Nd [10] DEVANAGARI DIGIT ZERO..DEVANAGARI DIGIT NINE
|
||||
@ -653,15 +974,32 @@ FE13..FE14 ; MidNum # Po [2] PRESENTATION FORM FOR VERTICAL COLON..PRESENTA
|
||||
0ED0..0ED9 ; Numeric # Nd [10] LAO DIGIT ZERO..LAO DIGIT NINE
|
||||
0F20..0F29 ; Numeric # Nd [10] TIBETAN DIGIT ZERO..TIBETAN DIGIT NINE
|
||||
1040..1049 ; Numeric # Nd [10] MYANMAR DIGIT ZERO..MYANMAR DIGIT NINE
|
||||
1090..1099 ; Numeric # Nd [10] MYANMAR SHAN DIGIT ZERO..MYANMAR SHAN DIGIT NINE
|
||||
17E0..17E9 ; Numeric # Nd [10] KHMER DIGIT ZERO..KHMER DIGIT NINE
|
||||
1810..1819 ; Numeric # Nd [10] MONGOLIAN DIGIT ZERO..MONGOLIAN DIGIT NINE
|
||||
1946..194F ; Numeric # Nd [10] LIMBU DIGIT ZERO..LIMBU DIGIT NINE
|
||||
19D0..19D9 ; Numeric # Nd [10] NEW TAI LUE DIGIT ZERO..NEW TAI LUE DIGIT NINE
|
||||
1A80..1A89 ; Numeric # Nd [10] TAI THAM HORA DIGIT ZERO..TAI THAM HORA DIGIT NINE
|
||||
1A90..1A99 ; Numeric # Nd [10] TAI THAM THAM DIGIT ZERO..TAI THAM THAM DIGIT NINE
|
||||
1B50..1B59 ; Numeric # Nd [10] BALINESE DIGIT ZERO..BALINESE DIGIT NINE
|
||||
1BB0..1BB9 ; Numeric # Nd [10] SUNDANESE DIGIT ZERO..SUNDANESE DIGIT NINE
|
||||
1C40..1C49 ; Numeric # Nd [10] LEPCHA DIGIT ZERO..LEPCHA DIGIT NINE
|
||||
1C50..1C59 ; Numeric # Nd [10] OL CHIKI DIGIT ZERO..OL CHIKI DIGIT NINE
|
||||
A620..A629 ; Numeric # Nd [10] VAI DIGIT ZERO..VAI DIGIT NINE
|
||||
A8D0..A8D9 ; Numeric # Nd [10] SAURASHTRA DIGIT ZERO..SAURASHTRA DIGIT NINE
|
||||
A900..A909 ; Numeric # Nd [10] KAYAH LI DIGIT ZERO..KAYAH LI DIGIT NINE
|
||||
A9D0..A9D9 ; Numeric # Nd [10] JAVANESE DIGIT ZERO..JAVANESE DIGIT NINE
|
||||
AA50..AA59 ; Numeric # Nd [10] CHAM DIGIT ZERO..CHAM DIGIT NINE
|
||||
ABF0..ABF9 ; Numeric # Nd [10] MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT NINE
|
||||
104A0..104A9 ; Numeric # Nd [10] OSMANYA DIGIT ZERO..OSMANYA DIGIT NINE
|
||||
11066..1106F ; Numeric # Nd [10] BRAHMI DIGIT ZERO..BRAHMI DIGIT NINE
|
||||
110F0..110F9 ; Numeric # Nd [10] SORA SOMPENG DIGIT ZERO..SORA SOMPENG DIGIT NINE
|
||||
11136..1113F ; Numeric # Nd [10] CHAKMA DIGIT ZERO..CHAKMA DIGIT NINE
|
||||
111D0..111D9 ; Numeric # Nd [10] SHARADA DIGIT ZERO..SHARADA DIGIT NINE
|
||||
116C0..116C9 ; Numeric # Nd [10] TAKRI DIGIT ZERO..TAKRI DIGIT NINE
|
||||
1D7CE..1D7FF ; Numeric # Nd [50] MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL MONOSPACE DIGIT NINE
|
||||
|
||||
# Total code points: 282
|
||||
# Total code points: 451
|
||||
|
||||
# ================================================
|
||||
|
||||
|
Loading…
Reference in New Issue
Block a user