glibc

mirror of https://sourceware.org/git/glibc.git synced 2024-11-21 12:30:06 +00:00

Author	SHA1	Message	Date
Mike FABIAN	b31a01909c	localedata: fy_DE: make this "Western Frisian" to agree with the language code "fy" Resolves: BZ # 14522	2024-01-03 20:55:44 +01:00
Mike FABIAN	3c173c1f63	localedata: fy_DE, fy_NL: convert to UTF-8	2024-01-03 20:07:21 +01:00
Mike FABIAN	bec492c1da	localedata: ast_ES: convert to UTF-8	2024-01-03 17:44:52 +01:00
Mike FABIAN	521e96c13f	localedata: ast_ES: Remove wrong copyright text Resolves: BZ # 27601	2024-01-03 17:43:55 +01:00
Mike FABIAN	5448a127e4	localedata: de_{AT,BE,CH,IT,LU}: convert to UTF-8	2024-01-03 13:54:34 +01:00
Mike FABIAN	a8f7f742be	localedata: lv_LV, it_IT, it_CH: convert to UTF-8	2024-01-03 13:54:34 +01:00
Mike FABIAN	61171bb2b9	localedata: it_IT, lv_LV: currency symbol should follow the amount Resolves: BZ # 28558	2024-01-03 13:54:34 +01:00
Mike FABIAN	fe316dad7c	localedata: ms_MY should not use 12-hour format Resolves: BZ # 29504	2024-01-03 11:07:27 +01:00
Mike FABIAN	b5b558ab4b	localedata: es_ES: convert to UTF-8	2024-01-02 21:30:42 +01:00
Mike FABIAN	e3e98b0327	localedata: es_ES: Add am_pm strings Resolves: BZ # 24013 Use <U202F> instead of a plain space because CLDR also uses that.	2024-01-02 21:30:42 +01:00
Mike FABIAN	67f371e882	localedata: convert uz_UZ and uz_UZ@cyrillic to UTF-8	2024-01-02 16:36:43 +01:00
Mike FABIAN	cdce63a767	localedata: uz_UZ and uz_UZ@cyrillic: Fix decimal point and thousands separator Resolves: BZ # 31204	2024-01-02 16:36:43 +01:00
Paul Eggert	dff8da6b3e	Update copyright dates with scripts/update-copyrights	2024-01-01 10:53:40 -08:00
Mike FABIAN	fce5528fcb	localedata: yo_NT: remove redundant comments See: https://sourceware.org/pipermail/libc-alpha/2023-December/153538.html	2023-12-26 13:27:07 +01:00
Mike FABIAN	6b3ace3a1d	localedata: convert en_AU, en_NZ, mi_NZ, niu_NZ to UTF-8	2023-12-26 10:05:50 +01:00
Mike FABIAN	89d727efd7	localedata: First day of the week in AU is Monday, LC_TIME in en_NZ is identical to LC_TIME in en_AU then Resolves: BZ # 24877	2023-12-26 09:59:10 +01:00
Mike FABIAN	e65ca11515	localedata: convert yo_NG to UTF-8, check that language name in Yoruba agrees with CLDR Related: BZ # 24878	2023-12-25 21:04:38 +01:00
Mike FABIAN	1e70252508	localedata: id_ID: change first weekday to Sunday Resolves: BZ # 30412 See: https://sourceware.org/bugzilla/show_bug.cgi?id=30412#c7 CLDR also has ID in the list of territories which have Sunday as the first day of the week.	2023-12-19 11:23:19 +01:00
RushingAlien	12ab77e893	id_ID: Update Time Locales Hello! I am Indonesian, was born and raised in Indonesia and still do live in Indonesia. This patch brings a few changes to the time locales of id_ID, which includes : \- Defining am_pm and time_fmpt_ampm \- Changing time_fmt and d_t_fmt to use the 24-hour format \- Changing first_weekday to Monday This is a squashed version of what is previously a 5 patch set Here are reasons and details of the changes : Change 1 part 1 id_ID: Define `am_pm` string Current formatting does not define am_pm string, leading to AM and PM not being specified in 12 H time format. This change defines the string by changing it from an empty string to "AM";"PM". output of `date +%r`: before commit: 01:23 after commit: 01:23 PM Change 1 part 2 id_ID: Define time_fmt_ampm, change from an empty string Currently, time_fmpt_ampm is set to an empty string, causing some programs to not be able to display time in the 12-hour format, for example, glib: https://gitlab.gnome.org/GNOME/glib/-/issues/2967. This commit changes it from an empty string to "%I:%M:%S %p" Change 2 part 1 id_ID: Use 24-hour format for time_fmt Indonesian standard and formal time format uses the 24-hour format inst- ead of the 12-hour format. This commit aims to change the id_ID locale's time_fmt to match that accordingly. Change 2 part 2 id_ID: Use 24-hour format for d_t_fmt. Indonesian standard and formal time format uses the 24-hour format inst- ead of the 12-hour format. This commit aims to change the id_ID locale's d_t_fmt to match that accordingly. Change 3 id_ID: Change first_weekday to monday Indonesian calendar starts of the week with Monday, let's comply Message-ID: <20230821035530.9075-1-rushing27alien@gmail.com> Resolves: BZ # 30412 Reviewed-by: Mike Fabian <mfabian@redhat.com>	2023-12-18 09:57:33 +01:00
Mike FABIAN	73d92c4b73	localedata: Convert el_GR and el_CY locales to UTF-8	2023-12-15 21:08:44 +01:00
Mike FABIAN	14a94f2e35	localedata: el_GR: Greece now uses the 24h format for time Resolves: BZ # 23012	2023-12-15 21:08:44 +01:00
Mike FABIAN	958478889c	localedata: Convert day names in nn_NO locale to UTF-8	2023-12-07 08:28:25 +01:00
Mike FABIAN	ff25f355af	localedata: Remove trailing whitespace in weekday names in nn_NO locale Resolves: BZ # 25868	2023-12-07 08:28:25 +01:00
Mike FABIAN	dae3cf4134	localedata: Convert oc_FR locale to UTF-8	2023-11-16 23:58:17 +01:00
Mike FABIAN	70246b8495	localedata: Add information for Occitan Resolves: BZ # 28787	2023-11-16 23:58:17 +01:00
Mike FABIAN	3fddfe3c5d	New Zealand locales (en_NZ & mi_NZ) first day of week should be Monday Resolves: BZ #29486	2023-11-16 13:59:00 +01:00
Mike FABIAN	d2d797a49b	Remove unused localedata/th_TH.in	2023-09-21 10:34:35 +02:00
Mike FABIAN	aceda10bd5	Adapt collation in th_TH locale to use the iso14651_t1_common file and sync the collation with CLDR I made it to agree as much as possible with the rules from CLDR (see: https://github.com/unicode-org/cldr/blob/main/common/collation/th.xml). It seems to be impossible to follow the CLDR rules &[before 1]๚<ฯ # should be "variable" and &๛<ๆ # should be "variable" exactly though. These ask for a primary difference in punctuation characters whose primary weight should be "IGNORE". But using a secondary differnence instead still sorts the test data correctly and the previously used collation in th_TH used tertiary differences for these characters. There was old localedata/th_TH.in test data in TIS-620 encoding which was not used (it was not in the localedata/Makefile). I converted this to UTF-8 and moved it to localedata/th_TH.UTF-8.in and added it to localedata/Makefile. Using the existing collation rules in the th_TH locale did not sort that test file completely correct, I think my new collation rules based on iso14651_t1 are better.	2023-09-21 10:34:35 +02:00
Mike FABIAN	bb5bbc2070	Update to Unicode 15.1.0 [BZ #30854 ] Unicode 15.1.0 Support: Character encoding, character type info, and transliteration tables are all updated to Unicode 15.1.0, using the generator scripts contributed by Mike FABIAN (Red Hat). Total removed characters in newly generated CHARMAP: 0 Total changed characters in newly generated CHARMAP: 0 Total added characters in newly generated CHARMAP: 627 Total removed characters in newly generated WIDTH: 0 Total changed characters in newly generated WIDTH: 0 Total added characters in newly generated WIDTH: 627 alpha: Added 622 characters in new ctype which were not in old ctype graph: Added 627 characters in new ctype which were not in old ctype print: Added 627 characters in new ctype which were not in old ctype punct: Added 5 characters in new ctype which were not in old ctype The five characters added to punct are: 2FFC;IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM RIGHT;So;0;ON;;;;;N;;;;; 2FFD;IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM LOWER RIGHT;So;0;ON;;;;;N;;;;; 2FFE;IDEOGRAPHIC DESCRIPTION CHARACTER HORIZONTAL REFLECTION;So;0;ON;;;;;N;;;;; 2FFF;IDEOGRAPHIC DESCRIPTION CHARACTER ROTATION;So;0;ON;;;;;N;;;;; 31EF;IDEOGRAPHIC DESCRIPTION CHARACTER SUBTRACTION;So;0;ON;;;;;N;;;;; The Unicode announcement blog entry says "[...] adds 627 characters, [...] additions include 622 CJK unified ideographs in a new block, [...]", so that looks OK. The Unicode blog mentions "six completely new emoji" but they don't appear here as they are all sequences and not single code points. Resolves: BZ #30854 Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2023-09-16 08:37:03 +02:00
Mike FABIAN	71de3aead9	localedata/unicode-gen/utf8_gen.py: adapt regexp to get relevant lines from EastAsianWidth.txt Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2023-09-16 08:37:02 +02:00
Mike FABIAN	ba017b4f9d	Fix regexp syntax warnings in localedata/unicode-gen/ctype_compatibility.py Fix these: $ python -m py_compile ./ctype_compatibility.py ./ctype_compatibility.py:146: SyntaxWarning: invalid escape sequence '\)' Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2023-09-16 08:37:02 +02:00
lijianglin	e1d3312015	add GB18030-2022 charmap and test the entire GB18030 charmap [BZ #30243 ] support GB18030-2022 after add and change some transcoding relationship of GB18030-2022.Details are as follows: add 25 transcoding relationship UE81E 0x82359037 UE826 0x82359038 UE82B 0x82359039 UE82C 0x82359130 UE832 0x82359131 UE843 0x82359132 UE854 0x82359133 UE864 0x82359134 UE78D 0x84318236 UE78F 0x84318237 UE78E 0x84318238 UE790 0x84318239 UE791 0x84318330 UE792 0x84318331 UE793 0x84318332 UE794 0x84318333 UE795 0x84318334 UE796 0x84318335 UE816 0xfe51 UE817 0xfe52 UE818 0xfe53 UE831 0xfe6c UE83B 0xfe76 UE855 0xfe91 change 6 transcoding relationship U20087 0x95329031 U20089 0x95329033 U200CC 0x95329730 U215D7 0x9536b937 U2298F 0x9630ba35 U241FE 0x9635b630 Test the entire GB18030 charmap, not only the Unicode BMP part. Co-authored-by: yangyanchao <yangyanchao6@huawei.com> Co-authored-by: liqingqing <liqingqing3@huawei.com> Co-authored-by: Bruno Haible <bruno@clisp.org> Reviewed-by: Andreas Schwab <schwab@suse.de> Reviewed-by: Mike FABIAN <mfabian@redhat.com>	2023-08-29 19:02:30 +02:00
Colin Leroy-Mira	dfe8c44588	localedata: Translit common emojis to smileys [BZ #30649 ] Add common emojis to the translit-able characters (mostly faces and hearts), and translit them to old-fashioned smileys. Signed-off-by: Colin Leroy-Mira <colin@colino.net> Reviewed-by: Florian Weimer <fweimer@redhat.com>	2023-08-29 09:31:23 +02:00
Florian Weimer	4dc6b2dfb0	localedata: de_DE should not use Fräulein This honorific has fallen out of use quite some time ago.	2023-02-27 16:54:22 +01:00
Joseph Myers	6d7e8eda9b	Update copyright dates with scripts/update-copyrights	2023-01-06 21:14:39 +00:00
Mike FABIAN	7fe6734d28	Update to Unicode 15.0.0 [BZ #29604 ] Unicode 15.0.0 Support: Character encoding, character type info, and transliteration tables are all updated to Unicode 15.0.0, using the generator scripts contributed by Mike FABIAN (Red Hat). Total added characters in newly generated CHARMAP: 4489 Total removed characters in newly generated WIDTH: 0 Total changed characters in newly generated WIDTH: 0 Total added characters in newly generated WIDTH: 4257 alpha: Added 4389 characters in new ctype which were not in old ctype combining: Added 42 characters in new ctype which were not in old ctype combining_level3: Added 34 characters in new ctype which were not in old ctype graph: Added 4489 characters in new ctype which were not in old ctype lower: Added 73 characters in new ctype which were not in old ctype print: Added 4489 characters in new ctype which were not in old ctype punct: Missing 5 characters of old ctype in new ctype punct: Missing: ఄ 0xc04 TELUGU SIGN COMBINING ANUSVARA ABOVE punct: Missing: ྂ 0xf82 TIBETAN SIGN NYI ZLA NAA DA punct: Missing: ྃ 0xf83 TIBETAN SIGN SNA LDAN punct: Missing: 𑂀 0x11080 KAITHI SIGN CANDRABINDU punct: Missing: 𑂁 0x11081 KAITHI SIGN ANUSVARA That’s OK, because these are now Alphabetic in DerivedCoreProperties.txt punct: Added 105 characters in new ctype which were not in old ctype Resolves: BZ #29604 Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2022-10-06 08:58:33 +02:00
Adhemerval Zanella Netto	de477abcaa	Use '%z' instead of '%Z' on printf functions The Z modifier is a nonstandard synonymn for z (that predates z itself) and compiler might issue an warning for in invalid conversion specifier. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2022-09-22 08:48:04 -03:00
Florian Weimer	1d78299911	localedata: Convert French language locales (fr_*) to UTF-8	2022-08-17 11:07:00 +02:00
Florian Weimer	01441ae333	de_DE: Convert to UTF-8 Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com>	2022-07-05 09:07:02 +02:00
Emil Soleyman-Zomalan	3e29dc5233	Add locale for syr_SY	2022-04-21 13:05:40 +02:00
Ilyahoo Proshel	189906b687	Add rif_MA locale [BZ #27781 ] Resolves: BZ #27781	2022-04-07 14:59:41 +02:00
Adhemerval Zanella	74942fd273	localedate: Fix printf type on tst_mbrtowc Checked on x86_64-linux-gnu and i686-linux-gnu.	2022-03-31 08:49:55 -03:00
Adhemerval Zanella	d1eefcb2a0	localedata: Remove unused variables in tests Checked on x86_64-linux-gnu and i686-linux-gnu.	2022-03-31 08:38:35 -03:00
Carlos O'Donell	1c7a34567d	localedata: Do not generate output if warnings were present. With LC_MONETARY parsing fixed we can now generate locales without forcing output with '-c'. Removing '-c' from localedef invocation is the equivalent of using -Werror for localedef. The glibc locale sources should always be clean and free from warnings. We remove '-c' from both test locale generation and the targets used for installing locales e.g. install-locale-archive, and install-locale-files. Tested on x86_64 and i686 without regressions. Tested with install-locale-archive target. Tested with install-locale-files target. Reviewed-by: DJ Delorie <dj@redhat.com>	2022-02-25 07:31:27 -05:00
Carlos O'Donell	7e0ad15c0f	localedata: Adjust C.UTF-8 to align with C/POSIX. We have had one downstream report from Canonical [1] that an rrdtool test was broken by the differences in LC_TIME that we had in the non-builtin C locale (C.UTF-8). If one application has an issue there are going to be others, and so with this commit we review and fix all the issues that cause the builtin C locale to be different from C.UTF-8, which includes: * mon_decimal_point should be empty e.g. "" - Depends on mon_decimal_point_wc fix. * negative_sign should be empty e.g. "" * week should be aligned with the builtin C/POSIX locale * d_fmt corrected with escaped slashes e.g. "%m//%d//%y" * yesstr and nostr should be empty e.g. "" * country_ab2 and country_ab3 should be empty e.g. "" We bump LC_IDENTIFICATION version and adjust the date to indicate the change in the locale. A new tst-c-utf8-consistency test is added to ensure consistency between C/POSIX and C.UTF-8. Tested on x86_64 and i686 without regression. [1] https://sourceware.org/pipermail/libc-alpha/2022-January/135703.html Co-authored-by: Florian Weimer <fweimer@redhat.com> Reviewed-by: Florian Weimer <fweimer@redhat.com>	2022-02-01 11:12:36 -05:00
Paul Eggert	581c785bf3	Update copyright dates with scripts/update-copyrights I used these shell commands: ../glibc/scripts/update-copyrights $PWD/../gnulib/build-aux/update-copyright (cd ../glibc && git commit -am"[this commit message]") and then ignored the output, which consisted lines saying "FOO: warning: copyright statement not found" for each of 7061 files FOO. I then removed trailing white space from math/tgmath.h, support/tst-support-open-dev-null-range.c, and sysdeps/x86_64/multiarch/strlen-vec.S, to work around the following obscure pre-commit check failure diagnostics from Savannah. I don't know why I run into these diagnostics whereas others evidently do not. remote: * 912-#endif remote: * 913: remote: * 914- remote: * error: lines with trailing whitespace found ... remote: *** error: sysdeps/unix/sysv/linux/statx_cp.c: trailing lines	2022-01-01 11:40:24 -08:00
Maxim Kuvyrkov	c16dc431c8	Update copyright header in recently merged ab_GE locale ab_GE locale was committed under DCO and this header proposed in [1] suits it better. [1] https://sourceware.org/pipermail/libc-alpha/2021-September/130692.html Signed-off-by: Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org> Signed-off-by: Nart Tlisha <daniel.abzakh@gmail.com>	2021-12-17 18:22:21 +00:00
Nart Tlisha	a16c5ab139	localedata: add new locale ab_GE Add the Abkhazian language in the Georgia territory The ab_GE was just recently added to CLDR, it should be available in CLDR v41, https://github.com/unicode-org/cldr/pull/1402 The Abkhazian language has been added to Gnome for localization The locale has been tested on Ubuntu 20.04, Mint 20.2 and Fedora 35 Beta Signed-off-by: Nart Tlisha <daniel.abzakh@gmail.com> Reviewed-by: Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>	2021-12-16 14:37:14 +00:00
Adhemerval Zanella	3a523ccd78	locale: Fix localedata/sort-test undefined behavior The collate-test.c triggers UB with an signed integer overflow, which results in an error on some architectures (powerpc32). Checked on x86_64, i686, and powerpc.	2021-11-08 15:28:48 -03:00
Mike FABIAN	b517256015	Update to Unicode 14.0.0 [BZ #28390 ] Unicode 14.0.0 Support: Character encoding, character type info, and transliteration tables are all updated to Unicode 14.0.0, using the generator scripts contributed by Mike FABIAN (Red Hat). Total added characters in newly generated CHARMAP: 838 Total removed characters in newly generated WIDTH: 1 (Characters not in WIDTH get width 1 by default, i.e. these have width 1 now.) removed: <U1734> 0 : eaw=N category=Mc bidi=L name=HANUNOO SIGN PAMUDPOD That seems intentional, the character had category Mn (Mark, nonspacing) before and now has Mc (Mark, spacing combining) Total changed characters in newly generated WIDTH: 0 Total added characters in newly generated WIDTH: 175	2021-10-04 08:54:27 +02:00
Carlos O'Donell	466f2be6c0	Add generic C.UTF-8 locale (Bug 17318) We add a new C.UTF-8 locale. This locale is not builtin to glibc, but is provided as a distinct locale. The locale provides full support for UTF-8 and this includes full code point sorting via STRCMP-based collation (strcmp or wcscmp). The collation uses a new keyword 'codepoint_collation' which drops all collation rules and generates an empty zero rules collation to enable STRCMP usage in collation. This ensures that we get full code point sorting for C.UTF-8 with a minimal 1406 bytes of overhead (LC_COLLATE structure information and ASCII collating tables). The new locale is added to SUPPORTED. Minimal test data for specific code points (minus those not supported by collate-test) is provided in C.UTF-8.in, and this verifies code point sorting is working reasonably across the range. The locale was tested manually with the full set of code points without failure. The locale is harmonized with locales already shipping in various downstream distributions. A new tst-iconv9 test is added which verifies the C.UTF-8 locale is generally usable. Testing for fnmatch, regexec, and recomp is provided by extending bug-regex1, bugregex19, bug-regex4, bug-regex6, transbug, tst-fnmatch, tst-regcomp-truncated, and tst-regex to use C.UTF-8. Tested on x86_64 or i686 without regression. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2021-09-06 11:30:28 -04:00
Siddhesh Poyarekar	30891f35fa	Remove "Contributed by" lines We stopped adding "Contributed by" or similar lines in sources in 2012 in favour of git logs and keeping the Contributors section of the glibc manual up to date. Removing these lines makes the license header a bit more consistent across files and also removes the possibility of error in attribution when license blocks or files are copied across since the contributed-by lines don't actually reflect reality in those cases. Move all "Contributed by" and similar lines (Written by, Test by, etc.) into a new file CONTRIBUTED-BY to retain record of these contributions. These contributors are also mentioned in manual/contrib.texi, so we just maintain this additional record as a courtesy to the earlier developers. The following scripts were used to filter a list of files to edit in place and to clean up the CONTRIBUTED-BY file respectively. These were not added to the glibc sources because they're not expected to be of any use in future given that this is a one time task: https://gist.github.com/siddhesh/b5ecac94eabfd72ed2916d6d8157e7dc https://gist.github.com/siddhesh/15ea1f5e435ace9774f485030695ee02 Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2021-09-03 22:06:44 +05:30
Siddhesh Poyarekar	2d2d9f2b48	Move malloc hooks into a compat DSO Remove all malloc hook uses from core malloc functions and move it into a new library libc_malloc_debug.so. With this, the hooks now no longer have any effect on the core library. libc_malloc_debug.so is a malloc interposer that needs to be preloaded to get hooks functionality back so that the debugging features that depend on the hooks, i.e. malloc-check, mcheck and mtrace work again. Without the preloaded DSO these debugging features will be nops. These features will be ported away from hooks in subsequent patches. Similarly, legacy applications that need hooks functionality need to preload libc_malloc_debug.so. The symbols exported by libc_malloc_debug.so are maintained at exactly the same version as libc.so. Finally, static binaries will no longer be able to use malloc debugging features since they cannot preload the debugging DSO. Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com>	2021-07-22 18:37:59 +05:30
Siddhesh Poyarekar	06a1b79407	Reinstate gconv-modules as the default configuration file Reinstate gconv-modules as the main file so that the configuration files in gconv-modules.d/ become add-on configuration. With this, the effective user visible change is that GCONV_PATH can now have supplementary configuration in GCONV_PATH/gconv-modules.d/ in addition to the main GCONV_PATH/gconv-modules file.	2021-06-14 18:38:09 +05:30
Siddhesh Poyarekar	fc5bfade69	iconvdata: Move gconv-modules configuration to gconv-modules.conf Move all gconv-modules configuration files to gconv-modules.conf. That is, the S390 extensions now become gconv-modules-s390.conf. Move both configuration files into gconv-modules.d. Now GCONV_PATH/gconv-modules is read only for backward compatibility for third-party gconv modules directories. Reviewed-by: DJ Delorie <dj@redhat.com>	2021-06-09 09:47:16 +05:30
Florian Weimer	f17164bd51	localedata: Use U+00AF MACRON in more EBCDIC charsets [BZ #27882 ] This updates IBM256, IBM277, IBM278, IBM280, IBM284, IBM297, IBM424 in the same way that IBM273 was updated for bug 23290. IBM256 and IBM424 still have holes after this change, so HAS_HOLES is not updated. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2021-05-18 07:21:45 +02:00
Sebastian Rasmussen	ebde2baeb5	Update sv_SE to treate 'W' as a distinct character (Bug 25036) The 13th edition of Svenska Akademiens ordlista lists 'W' as a distinct letter that sorts after 'V'. We adjust the sv_SE locale (and tests) to match this updated and "reformed" language change. This harmonizes us with CLDR 1.5.0 (2007) for sv_SE sorting of the letter 'W'. No regressions on x86_64, and locale sorting tests all pass. Co-authored-by: Carlos O'Donell <carlos@redhat.com>	2021-04-06 12:34:02 -04:00
Marc Aurèle La France	c6e2ca2c3f	POSIX locale: Fix typo in comment	2021-01-09 12:14:44 +01:00
Paul Eggert	2b778ceb40	Update copyright dates with scripts/update-copyrights I used these shell commands: ../glibc/scripts/update-copyrights $PWD/../gnulib/build-aux/update-copyright (cd ../glibc && git commit -am"[this commit message]") and then ignored the output, which consisted lines saying "FOO: warning: copyright statement not found" for each of 6694 files FOO. I then removed trailing white space from benchtests/bench-pthread-locks.c and iconvdata/tst-iconv-big5-hkscs-to-2ucs4.c, to work around this diagnostic from Savannah: remote: * pre-commit check failed ... remote: * error: lines with trailing whitespace found remote: error: hook declined to update refs/heads/master	2021-01-02 12:17:34 -08:00
Andreas Schwab	8f8052c2aa	Revert "Fix missing redirects in testsuite targets" This reverts commit `d5afb38503`. The log files are actually created by the various shell scripts that drive the tests.	2020-10-08 10:09:30 +02:00
Carlos O'Donell	8cde977077	en_US: Minimize changes to date_fmt (Bug 25923) In 2000 when date_fmt was originally added as an extension the en_US locale did not have a date_fmt specifier and so used the default which resulted in the abbreviated month name coming before the day of the month (as expected in the US and other locales). In commit `7395f3a0ef` the date_fmt was added to en_US with a 12H time to better align with US user expectations. Unfortunately the abbreviated month name and day were inverted during that transition, and that was seen as a regression and reported against Fedora 32: https://bugzilla.redhat.com/show_bug.cgi?id=1830623 The progression of date_fmt looks like this: "%a %b %e %H:%M:%S %Z %Y" <- Originally (2000) "%a %d %b %Y %I:%M:%S %p %Z" <- glibc 2.29 (2019) "%a %b %e %r %Z %Y" <- glibc 2.32 (2020) [this commit] Note: "%r" is "%I:%M:%S %p" in en_US and so shorter to write. Likewise the year is in the wrong place in commit `7395f3a0ef` and this is corrected in this patch. For reference d_t_fmt: "%a %d %b %Y %r %Z" <- d_t_fmt (1997) Yes, d_t_fmt and date_fmt are not the same, this is just the history of this locale. This commit does not change d_t_fmt to better align with date_fmt. No users have requested we change d_t_fmt or given any justification for such a change. The only goals of this change are to place the abbreviated month name before the day of the month as it has been printed since 2000, and place the year at the end. This minimizes the change from commit `7395f3a0ef` and makes good on changing only from 24H clock to 12H clock. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2020-07-16 17:17:10 -04:00
Mike FABIAN	6e540caa21	Set width of JUNGSEONG/JONGSEONG characters from UD7B0 to UD7FB to 0 [BZ #26120 ] Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2020-06-26 09:54:43 +02:00
Florian Weimer	3404def00a	ckb_IQ, or_IN locales: Add missing reorder-end keywords This suppresses a non-fatal error during locale building. Reviewed-by: Rafał Lużyński <digitalfreak@lingonborough.com>	2020-05-08 10:52:22 +02:00
Carlos O'Donell	df6c63ebbc	localedef: Add tests-container test for --no-hard-links. The new tst-localedef-hardlinks verifies that when compiling two locales (with default output directory) one with --no-hard-links and one without the option, results in the expected behaviour. When --no-hard-links is used the link counts on LC_CTYPE is 1, indicating that even thoug the two locale are identical (though different named source files and output direcotry) the localedef did not carry out the hard link optimization. Then when --no-hard-links is omitted the localedef hard link optimization is correctly carried out and for 2 compiled locales the link count for LC_CTYPE is 2. Reviewed-by: DJ Delorie <dj@redhat.com>	2020-04-30 16:28:07 -04:00
Mike FABIAN	8645f62469	Bug 25819: Update to Unicode 13.0.0 Unicode 13.0.0 Support: Character encoding, character type info, and transliteration tables are all updated to Unicode 13.0.0, using the generator scripts contributed by Mike FABIAN (Red Hat). Total added characters in newly generated CHARMAP: 5930 Total added characters in newly generated WIDTH: 5536	2020-04-21 18:17:23 +02:00
kokoye2007	8a1d13d0c7	Updates to the shn_MM locale [BZ #25532 ]	2020-04-08 12:22:36 +02:00
Rafał Lużyński	10b2cdc3b3	oc_FR locale: Fix spelling of April (bug 25639) Confirmed by CLDR and a native speaker: "abril" is more often used even if "abrial" is also correct. Both nominative (alt_mon) and genitive (mon) cases are updated.	2020-04-07 00:20:53 +02:00
Rafał Lużyński	649fdf039b	oc_FR locale: Fix spelling of Thursday (bug 25639) As reported by a native speaker: Thursday: "dijóus" -> "dijòus" (also confirmed by CLDR)	2020-03-19 00:19:07 +01:00
Mike FABIAN	eb948facd8	Fix typo in the name for Wednesday in Kurdish [BZ #9809 ]	2020-02-11 10:18:45 +01:00
Mike FABIAN	cdeae33d71	Update or_IN collation [BZ #22525 ] - Add a test file or_IN.UTF-8.in. - Make the collation agree with CLDR.	2020-02-03 10:19:20 +01:00
Mike FABIAN	ae199e7d64	Fix ckb_IQ [BZ #9809 ] Add ckb_IQ to SUPPORTED file. Add ckb_IQ.UTF-8.in collation test file. Mention new ckb_IQ locale in NEWS.	2020-02-03 10:19:20 +01:00
Jwtiyar Nariman	4267522f5e	Add new locale: ckb_IQ (Kurdish/Sorani spoken in Iraq) [BZ #9809 ]	2020-02-03 10:19:20 +01:00
Rafał Lużyński	135540285c	sl_SI locale: Use "." as the thousands separator (bug 25233) This is correct according to CLDR [1] and Florian Weimer's quick research. [2] [1] https://st.unicode.org/cldr-apps/v#/sl/Symbols/ [2] https://sourceware.org/bugzilla/show_bug.cgi?id=25233#c0 Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2020-01-08 00:13:48 +01:00
Rafał Lużyński	75ba929987	Multiple locales: Add date_fmt (bug 24054) It is not specified what should be the content of d_t_fmt and date_fmt but in the built-in C locale those fields have only one difference: date_fmt contains "%Z" (the current time zone) while d_t_fmt does not. For most of the locales this commit does the following operation: copy d_t_fmt to date_fmt, and then remove "%Z" from d_t_fmt. If "%Z" was originally missing from d_t_fmt add it to date_fmt. It also corrects comments where necessary. Exceptions: * In bo_CN, dz_BT, and km_KH "%Z" has not been added to date_fmt because it was too difficult. In these locales date_fmt has been set to the copy of d_t_fmt. * In en_DK "%Z" has not been removed from d_t_fmt in order to preserve the conformance with the standard mentioned in the comment. The command to identify and initially edit the locales that need the update was: for i in `grep -lw d_t_fmt *` do if ! grep -qw date_fmt $i ; then awk '/d_t_fmt/ { print $0; gsub("d_t_fmt", "date_fmt"); } //{ print $0 }' < $i > $i.next mv $i.next $i fi done and then each file was further edited manually.	2020-01-02 11:45:45 +01:00
Joseph Myers	d614a75396	Update copyright dates with scripts/update-copyrights.	2020-01-01 00:14:33 +00:00
Rafał Lużyński	d99b500e3d	lv_LV locale: Correct the time part of d_t_fmt (bug 25324) Currently d_t_fmt formats time as "plkst. %H un %M". A quick Google search says that "plkst." means "o’clock" and "un" means "and". Also this format does not display seconds. CLDR does not mention anything like that. We have no reason to use anything different than "%H:%M:%S".	2019-12-30 11:48:20 +01:00
Rafał Lużyński	20a740b2b2	km_KH locale: Use "%M" instead of "m" in d_t_fmt (bug 25323) A quick analysis suggests that the original author meant "%M" (minutes format specifier) instead of "m" which is just a literal "m" letter.	2019-12-30 11:48:19 +01:00
Rafał Lużyński	b8c210bcc7	mnw_MM, my_MM, and shn_MM locales: Do not use %Op The "O" modifier does nothing when used with "%p" so let's better not use it at all and replace "%Op" with "%p".	2019-12-23 23:49:22 +01:00
Rafał Lużyński	c372d2e863	ru_UA locale: use copy "ru_RU" in LC_TIME (bug 25044) Replacing incorrect abbreviated weekday names "Пнд", "Вто", "Срд"... with correct ones "Пн", "Вт", "Ср"... makes the LC_TIME sections in those two locales almost identical. The only remaining difference was that ab_alt_mon elements in ru_UA were lowercase while in ru_RU they had the first letter uppercase, the latter was pointed as a better choice by a native speaker. This commit unifies LC_TIME between ru_RU and ru_UA.	2019-11-26 11:54:29 +01:00
Talachan Mon	c5fbd7c3ea	Add new locale: mnw_MM (Mon language spoken in Myanmar) [BZ #25139 ]	2019-11-06 08:15:16 +01:00
Arjun Shankar	513aaa0d78	Add Transliterations for Unicode Misc. Mathematical Symbols-A/B [BZ #23132 ] This commit adds previously missing transliterations for several code points in the Unicode blocks "Miscellaneous Mathematical Symbols-A/B" - transliterated to their approximate ASCII representations. It also adds a corresponding iconv transliteration test. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2019-10-25 19:45:55 +02:00
DJ Delorie	97476447ed	Install charmaps uncompressed in testroot The testroot does not have a gunzip command, so the charmap files should not be installed gzipped else they cannot be used (and thus tested). With this patch, installing with INSTALL_UNCOMPRESSED=yes installs uncompressed charmaps instead. Note that we must purge the $(symbolic_link_list) as it contains references to $(DESTDIR), which we change during the testroot installation. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2019-10-24 17:01:04 -04:00
Mike FABIAN	8e42fc6811	Sync "language", "lang_name", "territory", "country_name" with CLDR/langtable Sync these values with CLDR and langtable as much as possible. Add missing values. If possible, take the values from CLDR, if CLDR does not have it, take it from langtable. The values from langtable which are not from CLDR are from Wikipedia or native speakers.	2019-10-01 10:27:02 +02:00
Paul Eggert	5a82c74822	Prefer https to http for gnu.org and fsf.org URLs Also, change sources.redhat.com to sourceware.org. This patch was automatically generated by running the following shell script, which uses GNU sed, and which avoids modifying files imported from upstream: sed -ri ' s,(http\|ftp)(://(.\.)?(gnu\|fsf\|sourceware)\.org($\|[^.]\|\.[^a-z])),https\2,g s,(http\|ftp)(://(.\.)?)sources\.redhat\.com($\|[^.]\|\.[^a-z]),https\2sourceware.org\4,g ' \ $(find $(git ls-files) -prune -type f \ ! -name '.po' \ ! -name 'ChangeLog' \ ! -path COPYING ! -path COPYING.LIB \ ! -path manual/fdl-1.3.texi ! -path manual/lgpl-2.1.texi \ ! -path manual/texinfo.tex ! -path scripts/config.guess \ ! -path scripts/config.sub ! -path scripts/install-sh \ ! -path scripts/mkinstalldirs ! -path scripts/move-if-change \ ! -path INSTALL ! -path locale/programs/charmap-kw.h \ ! -path po/libc.pot ! -path sysdeps/gnu/errlist.c \ ! '(' -name configure \ -execdir test -f configure.ac -o -f configure.in ';' ')' \ ! '(' -name preconfigure \ -execdir test -f preconfigure.ac ';' ')' \ -print) and then by running 'make dist-prepare' to regenerate files built from the altered files, and then executing the following to cleanup: chmod a+x sysdeps/unix/sysv/linux/riscv/configure # Omit irrelevant whitespace and comment-only changes, # perhaps from a slightly-different Autoconf version. git checkout -f \ sysdeps/csky/configure \ sysdeps/hppa/configure \ sysdeps/riscv/configure \ sysdeps/unix/sysv/linux/csky/configure # Omit changes that caused a pre-commit check to fail like this: # remote: * error: sysdeps/powerpc/powerpc64/ppc-mcount.S: trailing lines git checkout -f \ sysdeps/powerpc/powerpc64/ppc-mcount.S \ sysdeps/unix/sysv/linux/s390/s390-64/syscall.S # Omit change that caused a pre-commit check to fail like this: # remote: * error: sysdeps/sparc/sparc64/multiarch/memcpy-ultra3.S: last line does not end in newline git checkout -f sysdeps/sparc/sparc64/multiarch/memcpy-ultra3.S	2019-09-07 02:43:31 -07:00
Rafal Luzynski	c0fd3244e7	Chinese locales: Set first_weekday to 2 (bug 24682). The first day of the week in China (Mainland) should be Monday according to the national standard GB/T 7408-2005. References: * https://www.doc88.com/p-1166696540287.html * https://unicode-org.atlassian.net/browse/CLDR-11510 [BZ #24682] * localedata/locales/bo_CN (first_weekday): Add, set to 2 (Monday). * localedata/locales/ug_CN (first_weekday): Likewise. * localedata/locales/zh_CN (first_weekday): Likewise.	2019-08-23 00:07:06 +02:00
Rafal Luzynski	9208c3b804	Afar locales: Months and days updated from CLDR (bug 21897). This commit updates month and weekday names (full and abbreviated) from CLDR 35.1 with the following exceptions. It was not clear why the full name of February in aa_DJ and aa_ER was "Kudo" while the abbreviated version is "Nah" but some additional sources [1] [2] as well as the content of aa_ER and aa_ER@saaho suggest it should be "Naharsi Kudo". This commit consequently sets the translation of February to "Naharsi Kudo" in aa_DJ and aa_ET. aa_ER@saaho is not supported by CLDR but since the month names were identical to aa_ER before this commit, the same values have been copied from aa_ER. Links: [1] https://fr.wiktionary.org/wiki/naharsi_kudo [2] http://www.mcit.gov.et/web/guest/-/localization-standard-for-afaraf [BZ #21897] * localedata/locales/aa_DJ (abday): Update from CLDR, all words begin with an uppercase letter now. (abmon): Likewise. (mon): Update from CLDR, reword February from "Kudo" to "Naharsi Kudo", April from "Agda Baxisso" to "Agda Baxis", and August from "Liiqen" to "Leqeeni". * localedata/locales/aa_ER (mon): Update from CLDR, reword April from "Agda Baxisso" to "Agda Baxis" and August from "Leqeeni" to "Liiqen". * localedata/locales/aa_ER@saaho (mon): Likewise. * localedata/locales/aa_ET (abmon): Update from CLDR, reword abbreviated February from "Kud" to "Nah". (mon): Update from CLDR, reword February from "Kudo" to "Naharsi Kudo" and April from "Agda Baxisso" to "Agda Baxis".	2019-07-17 11:58:21 +02:00
Rafal Luzynski	fba6d4bbce	nl_BE locale: Use "copy "nl_NL"" in LC_NAME (bug 23996). The content of the section is identical in both languages. [BZ #23996] * localedata/locales/nl_BE (LC_NAME): Replace with “copy "nl_NL"”.	2019-07-17 11:53:08 +02:00
PanderMusubi	3cc7c9c5f1	nl_BE and nl_NL locales: Dutch salutations (bug 23996). [BZ #23996] * localedata/locales/nl_BE (LC_NAME): Add name_gen, name_mr, name_mrs, name_miss, and name_ms. * localedata/locales/nl_NL (LC_NAME): Likewise.	2019-07-17 11:50:42 +02:00
Daniil Zhilin	cce7b6a578	ga_IE and en_IE locales: Revert first_weekday removal (bug 24200). These values were removed by the commit `0a410e76f5`. [BZ #24200] * localedata/locales/ga_IE (first_weekday): Add, set to 2 (Monday). * localedata/locales/en_IE (first_weekday): Likewise.	2019-07-17 11:41:24 +02:00
Rafal Luzynski	a55541fd1c	szl_PL locale: Fix a typo in the previous commit (bug 24652). The Unicode sequences in the format <Uxxxx> should be used instead of non-ASCII characters. Reported by Piotr Drąg: https://sourceware.org/bugzilla/show_bug.cgi?id=24652#c8 [BZ #24652] * localedata/locales/szl_PL (day): Use the correct Unicode sequences instead of non-ASCII characters.	2019-06-24 22:17:58 +02:00
Grzegorz Kulik	2bd81b60d6	szl_PL locale: Spelling corrections (bug 24652). This commit also provides the correct month names in both nominative and genitive case for Silesian language, as required by the fix for the bug 10871. [BZ #24652] * localedata/locales/szl_PL (abday): Spelling corrections. (day): Likewise. (abmon): Likewise. (mon): Rename to... (alt_mon): This, then apply spelling corrections. (mon): New entry, month names in the genitive case.	2019-06-24 10:59:11 +02:00
Rafal Luzynski	fefa21790b	nl_{AW,NL}: Correct the thousands separator and grouping (bug 23831). According to CLDR 35.1 and the bug report the thousands grouping separator should be always "." (a single dot) and digits should be grouped by 3. [BZ #23831] * localedata/locales/nl_AW (mon_thousands_sep): Set to ".". * localedata/locales/nl_NL (mon_thousands_sep): Likewise. (thousands_sep): Likewise. (grouping): Set to 3;3.	2019-06-21 20:48:35 +02:00
Rafal Luzynski	f59a54ab0c	nl_AW locale: Correct the negative monetary format (bug 24614). Follow the same changes as made in the commit `02d8b5ab1c` because the respective entries in nl_NL and nl_AW had been the same before the change so they should be the same after. CLDR does not provide complete data for nl_AW, it says it is missing and displays a copy of nl_NL. [BZ #24614] * localedata/locales/nl_AW (n_sep_by_space): Set to 2 (a space between the currency symbol and the minus sign). (n_sign_posn): Set to 4 (the minus sign after the currency symbol).	2019-06-19 23:44:47 +02:00
Rafal Luzynski	02d8b5ab1c	nl_NL locale: Correct the negative monetary format (bug 24614). According to CLDR 35.1 and the bug report the correct monetary format for negative amounts should be "EUR -1 234,56" while previously it was "EUR 1 234,56-". This patch does not change the thousands (grouping) separator. [BZ #24614] * localedata/Makefile (LOCALES): Add nl_NL.UTF-8. * localedata/locales/nl_NL (n_sep_by_space): Set to 2 (a space between the currency symbol and the minus sign). (n_sign_posn): Set to 4 (the minus sign after the currency symbol). * localedata/tst-strfmon1.c (tests): Add test data for nl_NL.UTF-8.	2019-06-17 23:42:06 +02:00
mansayk	157cda1ff0	tt_RU: Add lang_name [BZ #24370 ] This commit adds a lang_name according to CLDR-35.1. [BZ #24370] * localedata/locales/tt_RU (lang_name): Add from CLDR-35.1.	2019-05-28 22:13:32 +02:00
mansayk	182a3746b8	tt_RU: Fix orthographic mistakes in mon and abmon sections [BZ #24369 ] This commit fixes some errors and converts all month names to lowercase. The content is synchronized with CLDR-35.1 now but trailing dots are removed from abmon values in order to maintain consistency with the previous values and with many other locales which do the same. [BZ #24369] * localedata/locales/tt_RU (mon): Update from CLDR-35.1, fix errors. (abmon): Likewise, but remove the trailing dots.	2019-05-28 22:11:22 +02:00
Mike FABIAN	f6efec90c8	Bug 24535: Update to Unicode 12.1.0 Unicode 12.1.0 Support: Character encoding, character type info, and transliteration tables are all updated to Unicode 12.1.0, using the generator scripts contributed by Mike FABIAN (Red Hat). Some info about the number of characters added or changed: Total added characters in newly generated CHARMAP: 1 added: <U32FF> /xe3/x8b/xbf SQUARE ERA NAME REIWA Total added characters in newly generated WIDTH: 1 added: <U32FF> 2 : eaw=W category=So bidi=L name=SQUARE ERA NAME REIWA graph: Added 1 characters in new ctype which were not in old ctype graph: Added: ㋿ U+32FF SQUARE ERA NAME REIWA print: Added 1 characters in new ctype which were not in old ctype print: Added: ㋿ U+32FF SQUARE ERA NAME REIWA punct: Added 1 characters in new ctype which were not in old ctype punct: Added: ㋿ U+32FF SQUARE ERA NAME REIWA	2019-05-13 17:25:03 +02:00
TAMUKI Shoichi	466afec308	ja_JP locale: Add entry for the new Japanese era [BZ #22964 ] The Japanese era name will be changed on May 1, 2019. The Japanese government made a preliminary announcement on April 1, 2019. The glibc ja_JP locale must be updated to include the new era name for strftime's alternative year format support. Checked on x86_64-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com> ChangeLog: [BZ #22964] * localedata/locales/ja_JP (LC_TIME): Add entry for the new Japanese era. * time/tst-strftime2.c (dates): Add 2019-04-30 and 2019-05-01. (mkreftable): Add rules for the new Japanese era and the new dates.	2019-04-02 16:46:55 +09:00
Carlos O'Donell	62449176e0	Add verbose comments to 'era' in ja_JP locale. Reviewed-by: Rafal Luzynski <digitalfreak@lingonborough.com> Reviewed-by: TAMUKI Shoichi <tamuki@linet.gr.jp>	2019-04-01 15:14:16 -04:00
mansayk	57ada43c90	tt_RU: Fix orthographic mistakes in day and abday sections [BZ #24296 ] This commit fixes some errors and converts all weekday names to lowercase. The content is synchronized with CLDR-34 now, but trailing dots are removed from abday values in order to maintain consistency with the previous values and with many other locales which do the same. [BZ #24296] * localedata/locales/tt_RU (day): Update from CLDR-34, fix errors. (abday): Likewise, but remove the trailing dots.	2019-03-20 22:00:00 +01:00

1 2 3 4 5 ...

1751 Commits