mirror of
https://sourceware.org/git/glibc.git
synced 2024-11-21 12:30:06 +00:00
a7b5eb821d
Unicode 16.0.0 Support: Character encoding, character type info, and transliteration tables are all updated to Unicode 16.0.0, using the generator scripts contributed by Mike FABIAN (Red Hat). Changes in CHARMAP and WIDTH: Total added characters in newly generated CHARMAP: 5185 Total removed characters in newly generated WIDTH: 1 Total added characters in newly generated WIDTH: 170 The removed character from WIDTH is U+1171E AHOM CONSONANT SIGN MEDIAL RA. It changed like this: UnicodeData.txt 15.1.0: 1171E;AHOM CONSONANT SIGN MEDIAL RA;Mn;0;NSM;;;;;N;;;;; UnicodeData.txt 16.0.0: 1171E;AHOM CONSONANT SIGN MEDIAL RA;Mc;0;L;;;;;N;;;;; EastAsianWidth.txt 15.1.0: 1171D..1171F ; N # Mn [3] AHOM CONSONANT SIGN MEDIAL LA..AHOM CONSONANT SIGN MEDIAL LIGATING RA EastAsianWidth.txt 16.0.0: 1171E ; N # Mc AHOM CONSONANT SIGN MEDIAL RA I.e it changed from Mn (Mark Nonspacing) to Mc (Mark Spacing combining). So it should now have width 1 instead of 0, therefore it is OK that it was removed from WIDTH, characters not in WIDTH get width 1 by default. Nothing suspicious when browsing the list of the 170 added characters. Changes in ctype: alpha: Added 4452 characters in new ctype which were not in old ctype combining: Added 51 characters in new ctype which were not in old ctype combining_level3: Added 43 characters in new ctype which were not in old ctype graph: Added 5185 characters in new ctype which were not in old ctype lower: Added 25 characters in new ctype which were not in old ctype print: Added 5185 characters in new ctype which were not in old ctype punct: Missing 33 characters of old ctype in new ctype punct: Added 766 characters in new ctype which were not in old ctype tolower: Added 27 characters in new ctype which were not in old ctype totitle: Added 27 characters in new ctype which were not in old ctype toupper: Added 27 characters in new ctype which were not in old ctype upper: Added 27 characters in new ctype which were not in old ctype Nothing suspicous in the additions. About the 33 characters removed from `punct`: U+0363 - U+036F are identical in UnicodeData.txt. Difference in DerivedCoreProperties.txt: DerivedCoreProperties.txt 15.1.0: not there. DerivedCoreProperties.txt 16.0.0: 0363..036F ; Alphabetic # Mn [13] COMBINING LATIN SMALL LETTER A..COMBINING LATIN SMALL LETTER X So that’s the reason why they are added to `alpha` and removed from `punct`. Same for U+1DD3 - U+1DE6, they are identical in UnicodeData.txt but there is a difference in DerivedCoreProperties.txt: DerivedCoreProperties.txt 15.1.0: 1DE7..1DF4 ; Alphabetic # Mn [14] COMBINING LATIN SMALL LETTER ALPHA..COMBINING LATIN SMALL LETTER U WITH DIAERESIS DerivedCoreProperties.txt 16.0.0: 1DD3..1DF4 ; Alphabetic # Mn [34] COMBINING LATIN SMALL LETTER FLATTENED OPEN A ABOVE..COMBINING LATIN SMALL LETTER U WITH DIAERESIS So they became `Alphabetic` and were thus added to `alpha` and removed from `punct`. Resolves: BZ #32168 Reviewed-by: Carlos O'Donell <carlos@redhat.com>
64 lines
3.0 KiB
Plaintext
64 lines
3.0 KiB
Plaintext
escape_char /
|
|
comment_char %
|
|
|
|
% This file is part of the GNU C Library and contains locale data.
|
|
% The Free Software Foundation does not claim any copyright interest
|
|
% in the locale data contained in this file. The foregoing does not
|
|
% affect the license of the GNU C Library as a whole. It does not
|
|
% exempt you from the conditions of the license if your use would
|
|
% otherwise be governed by that license.
|
|
|
|
% Transliterations of fractions.
|
|
% Generated automatically from UnicodeData.txt by gen_translit_fraction.py on 2024-09-12 for Unicode 16.0.0.
|
|
% The replacements have been surrounded with spaces, because fractions are
|
|
% often preceded by a decimal number and followed by a unit or a math symbol.
|
|
|
|
LC_CTYPE
|
|
|
|
translit_start
|
|
|
|
% VULGAR FRACTION ONE QUARTER
|
|
<U00BC> "<U0020><U0031><U2044><U0034><U0020>";"<U0020><U0031><U002F><U0034><U0020>"
|
|
% VULGAR FRACTION ONE HALF
|
|
<U00BD> "<U0020><U0031><U2044><U0032><U0020>";"<U0020><U0031><U002F><U0032><U0020>"
|
|
% VULGAR FRACTION THREE QUARTERS
|
|
<U00BE> "<U0020><U0033><U2044><U0034><U0020>";"<U0020><U0033><U002F><U0034><U0020>"
|
|
% VULGAR FRACTION ONE SEVENTH
|
|
<U2150> "<U0020><U0031><U2044><U0037><U0020>";"<U0020><U0031><U002F><U0037><U0020>"
|
|
% VULGAR FRACTION ONE NINTH
|
|
<U2151> "<U0020><U0031><U2044><U0039><U0020>";"<U0020><U0031><U002F><U0039><U0020>"
|
|
% VULGAR FRACTION ONE TENTH
|
|
<U2152> "<U0020><U0031><U2044><U0031><U0030><U0020>";"<U0020><U0031><U002F><U0031><U0030><U0020>"
|
|
% VULGAR FRACTION ONE THIRD
|
|
<U2153> "<U0020><U0031><U2044><U0033><U0020>";"<U0020><U0031><U002F><U0033><U0020>"
|
|
% VULGAR FRACTION TWO THIRDS
|
|
<U2154> "<U0020><U0032><U2044><U0033><U0020>";"<U0020><U0032><U002F><U0033><U0020>"
|
|
% VULGAR FRACTION ONE FIFTH
|
|
<U2155> "<U0020><U0031><U2044><U0035><U0020>";"<U0020><U0031><U002F><U0035><U0020>"
|
|
% VULGAR FRACTION TWO FIFTHS
|
|
<U2156> "<U0020><U0032><U2044><U0035><U0020>";"<U0020><U0032><U002F><U0035><U0020>"
|
|
% VULGAR FRACTION THREE FIFTHS
|
|
<U2157> "<U0020><U0033><U2044><U0035><U0020>";"<U0020><U0033><U002F><U0035><U0020>"
|
|
% VULGAR FRACTION FOUR FIFTHS
|
|
<U2158> "<U0020><U0034><U2044><U0035><U0020>";"<U0020><U0034><U002F><U0035><U0020>"
|
|
% VULGAR FRACTION ONE SIXTH
|
|
<U2159> "<U0020><U0031><U2044><U0036><U0020>";"<U0020><U0031><U002F><U0036><U0020>"
|
|
% VULGAR FRACTION FIVE SIXTHS
|
|
<U215A> "<U0020><U0035><U2044><U0036><U0020>";"<U0020><U0035><U002F><U0036><U0020>"
|
|
% VULGAR FRACTION ONE EIGHTH
|
|
<U215B> "<U0020><U0031><U2044><U0038><U0020>";"<U0020><U0031><U002F><U0038><U0020>"
|
|
% VULGAR FRACTION THREE EIGHTHS
|
|
<U215C> "<U0020><U0033><U2044><U0038><U0020>";"<U0020><U0033><U002F><U0038><U0020>"
|
|
% VULGAR FRACTION FIVE EIGHTHS
|
|
<U215D> "<U0020><U0035><U2044><U0038><U0020>";"<U0020><U0035><U002F><U0038><U0020>"
|
|
% VULGAR FRACTION SEVEN EIGHTHS
|
|
<U215E> "<U0020><U0037><U2044><U0038><U0020>";"<U0020><U0037><U002F><U0038><U0020>"
|
|
% FRACTION NUMERATOR ONE
|
|
<U215F> "<U0020><U0031><U2044><U0020>";"<U0020><U0031><U002F><U0020>"
|
|
% VULGAR FRACTION ZERO THIRDS
|
|
<U2189> "<U0020><U0030><U2044><U0033><U0020>";"<U0020><U0030><U002F><U0033><U0020>"
|
|
|
|
translit_end
|
|
|
|
END LC_CTYPE
|