Commit Graph

1462 Commits

Author SHA1 Message Date
Mike FABIAN
96480903ab Finish change from “Bengali” to “Bangla”
“Bengali” still remained in some comments in the bn_BD locale file,
in iso-639.def and in a test input file. Change it there as well.
“Bangla” is now used as the English name for this language in CLDR.

	[BZ #14925]
	* libio/tst-widetext.input: Change “Bengali” to “Bangla”.
	* locale/iso-639.def: Change “Bengali” to “Bangla”.
	* localedata/locales/bn_BD: “Bengali” was still used in some
	comments. Change it to “Bangla”.
2017-09-09 12:41:54 +02:00
Mike FABIAN
7c9c5be96f Use ASCII in LC_TIME of om_ET for better readability
* localedata/locales/om_ET (LC_TIME): use ASCII for better readability.
2017-09-07 09:44:50 +02:00
Akhilesh Kumar
ae214523ee Remove redundant LC_TIME data in om_KE which is identical to the data in om_ET
[BZ #22100]
	* locales/om_KE (LC_TIME): copy "om_ET".
2017-09-07 09:44:49 +02:00
Mike FABIAN
2ae5be041d Improve utf8_gen.py to set the width for characters with Prepended_Concatenation_Mark property to 1
[BZ #22070]
	* localedata/unicode-gen/utf8_gen.py: Set the width for
	characters with Prepended_Concatenation_Mark property to 1
	* localedata/charmaps/UTF-8: Updated using the improved script.
2017-09-06 12:39:49 +02:00
Mike FABIAN
af83ed5c46 Write all ranges of neighbouring characters with the same width using the range notation in charmaps/UTF-8
Writing ranges of neighbouring characters with the same with like this

    <U000E0100>...<U000E01EF>	0

in charmaps/UTF-8 is more efficient than writing many single character lines
like:

    <U000E0100>	0
    <U000E0101>	0
    ...

	[BZ #21750]
	* unicode-gen/utf8_gen.py: Write all ranges of neighbouring characters
	with the same width using the range notation in charmaps/UTF-8.
2017-09-06 12:37:49 +02:00
Mike FABIAN
3f802aeb34 Add miq_NI locale for Miskito
[BZ #20498]
	* locale/iso-639.def: add Miskito.
	* localedata/SUPPORTED: Add miq_NI/UTF-8.
	* localedata/locales/miq_NI: New file.
2017-09-05 15:04:23 +02:00
Zack Weinberg
ce870587d5 Move all old ChangeLogs to a top-level ChangeLog.old directory. 2017-09-01 09:31:43 -04:00
Mike FABIAN
8715f25630 Fix decimal point and thousands separator in es_CU locale to agree with CLDR.
[BZ #15332]
	* locales/es_CU (LC_MONETARY): use “,” for mon_decimal_point
	and “.” for mon_thousands_sep (to agree with CLDR)
	* locales/es_CU (LC_NUMERIC): Likewise.
2017-08-31 12:57:26 +02:00
Akhilesh Kumar
d6e290a6ca Remove redundant data in li_BE which is identical to the data in li_NL
[BZ #22044]
	* locales/li_BE (LC_TIME): copy "li_NL".
2017-08-31 08:22:55 +02:00
Mike FABIAN
8a0e619652 Use “.” in thousands_sep in the ca_ES locale
[BZ #16148]
	* locales/ca_ES (LC_NUMERIC): Use “.” in thousands_sep.
	And make it more readable by using ASCII.
2017-08-30 15:19:39 +02:00
Mike FABIAN
5f9409b787 Add country_name for mni_IN
[BZ #22022]
	* locales/mni_IN (LC_ADDRESS): Add country_name.
2017-08-30 12:27:46 +02:00
Akhilesh Kumar
232d919061 Fix abbreviated weeks and months for Somali to agree with CLDR
[BZ #22038]
	* locales/so_DJ (LC_TIME): Fix abday, abmon and
	make t_fmt in the comment agree with the value of t_fmt.
	* locales/so_ET (LC_TIME): Fix abday (From Axa to Axd)
	* locales/so_KE (LC_TIME): Fix abday (From Axa to Axd)
	* locales/so_SO (LC_TIME): Fix abday (From Axa to Axd)
2017-08-30 11:54:15 +02:00
Wei-Lun Chao
bdbf14d12e Update hanzi collation by stroke
[BZ #21951]
        * locales/cns11643_stroke: Update hanzi collation by stroke
        (Add license declaration, use strokes by component).
2017-08-29 23:34:48 +02:00
Mike FABIAN
f11f2f6e14 Use “,” as mon_decimal_point for ru_RU and os_RU locales (follow CLDR)
[BZ #13805]
	* locales/ru_RU (LC_MONETARY): Use “,” for mon_decimal_point
	(to agree with CLDR).
	* locales/ru_RU (LC_NUMERIC): Write mon_decimal_point in ASCII
	for readability.
	* locales/os_RU (LC_MONETARY): Copy from ru_RU,
	makes it agree with CLDR.
2017-08-29 13:56:41 +02:00
Akhilesh Kumar
f8de9568d8 Add locale for mfe_MU
Add locale for “Morisyen” which is also called “Mauritian Creole”
and is spoken in Mauritius.

	[BZ #21971]
	* localedata/SUPPORTED: Add mfe_MU/UTF-8.
	* localedata/locales/mfe_MU: New File.

	[BZ #21971]
	* locale/iso-639.def: add Morisyen.
2017-08-29 13:56:41 +02:00
Mike FABIAN
15cf43a999 Use NFC for the strings in “day”, “yesstr”, and “nostr” in the sgs_LT locale
* locales/sgs_LT (LC_TIME): Use NFC for the strings in “day”.
	* locales/sgs_LT (LC_MESSAGES): Use NFC for “yesstr” and “nostr”.
2017-08-29 13:55:42 +02:00
Akhilesh Kumar
56bc7f4360 Fix country_name in zu_ZA to agree with CLDR
[BZ #21959]
	* locales/zu_ZA (LC_ADDRESS): Fix country name to agree with CLDR.
2017-08-28 17:51:08 +02:00
Mike FABIAN
486afa6d27 Use the range notation in charmaps/UTF-8 for all ranges of neighbouring characters with the same width
[BZ #21750]
	* charmaps/UTF-8: Use the range notation for all ranges
	of neighbouring characters with the same width.
2017-08-18 13:44:40 +02:00
Akhilesh Kumar
bb6274ee12 Fix abmon for bem_ZM
Until now the abbreviated month names were in English.

	[BZ #21960]
	* locales/bem_ZM (LC_TIME): Fix abmon, make it agree with CLDR.
2017-08-17 11:06:08 +02:00
Akhilesh Kumar
c14b84baae Fix country name for xh_ZA
[BZ #21959]
	* locales/xh_ZA (LC_ADDRESS): Fix country name.
2017-08-17 11:06:08 +02:00
Thorsten Glaser
7a79e321c6 Refresh generated charmap data and ChangeLog
[BZ #21750]
	* charmaps/UTF-8: Refresh.
2017-08-17 11:06:08 +02:00
Thorsten Glaser
267ee5d7ab Resolve some historically special cases of ambiguous width
[BZ #21750]
* unicode-gen/utf8_gen.py (U+00AD): Set width to 1.
* unicode-gen/utf8_gen.py (U+1160..U+11FF): Set width to 0.
* unicode-gen/utf8_gen.py (U+3248..U+324F): Set width to 2.
* unicode-gen/utf8_gen.py (U+4DC0..U+4DFF): Likewise.
2017-08-17 11:06:08 +02:00
Thorsten Glaser
41b6f0ce85 Handle more cases of combining characters
[BZ #21750]
* unicode-gen/utf8_gen.py: Treat category Me and Mn as combining.
2017-08-17 11:06:08 +02:00
Thorsten Glaser
580be3035d UnicodeData has precedence over EastAsianWidth
[BZ #19852]
[BZ #21750]
* unicode-gen/utf8_gen.py: Process EastAsianWidth lines before
  UnicodeData lines so the latter have precedence; remove hack
  to group output by EastAsianWidth ranges.
2017-08-17 11:06:08 +02:00
Mike FABIAN
54d68f5f61 Change language name in LC_IDENTIFICATION of bn_BD and bn_IN from “Bengali” to “Bangla”
[BZ #14925]
	* locales/bn_BD (LC_IDENTIFICATION): Change language name in
	“title” and “language” from Bengali to Bangla.
	* locales/bn_IN (LC_IDENTIFICATION): Likewise.
2017-08-14 19:48:27 +02:00
Mike FABIAN
dbb488ff46 Use “copy "i18n"” in km_KH locale
The custom stuff which was in LC_CTYPE of the km_KH locale seems
to be a very incomplete subset of what one gets by using
“copy "i18n"”. I cannot find anything special there which is not
in “copy "i18n"”, only lots of stuff which is missing.

	[BZ #20008]
	* locales/km_KH (LC_CTYPE): Use “copy "i18n"”.
2017-08-14 19:14:38 +02:00
Mike FABIAN
2a124c6163 Use two letter abbreviations in abday in all German locales.
[BZ #20482]
	* locales/de_AT (LC_TIME): Use 2 letter abbreviations in abday.
	* locales/de_BE (LC_TIME): Use 2 letter abbreviations in abday.
	* locales/de_CH (LC_TIME): Use 2 letter abbreviations in abday.
	* locales/de_DE (LC_TIME): Use readable ASCII in abday.
	* locales/de_IT (LC_TIME): Use readable ASCII in abday.
	* locales/de_LU (LC_TIME): Use 2 letter abbreviations in abday.
2017-08-14 17:12:37 +02:00
Julen Ruiz Aizpuru
66ba2e4ae5 Fix thousands_sep and grouping and use readable ASCII for decimal_point in eu_ES locale
[BZ #12349]
	* locales/eu_ES (LC_NUMERIC): Fix thousands_sep and grouping
	and use readable ASCII for decimal_point.
2017-08-14 15:58:32 +02:00
Mike FABIAN
a6bd872286 Adapt test case data to the changes in the thousands separators
[BZ #20756]
	* localedata/tst-langinfo.sh: Adapt test case data.
	* stdlib/tst-strfmon_l.c: Likewise.
	* stdlib/tst-strtod4.c: Likewise.
	* stdlib/tst-strtod5i.c: Likewise.
2017-08-12 17:20:24 +02:00
Mike FABIAN
d68b451903 Use U+202F instead of U+2009 as thousands_sep for es_MX
See also [BZ #20756].

U+202F NARROW NO-BREAK SPACE: a narrow form of a no-break space,
typically the width of a thin space or a mid space.

U+2009 THIN SPACE.
2017-08-12 16:03:49 +02:00
Stanislav Brabec
70a6707fa1 Locales: Use Unicode wise thousands separator
Many languages use small gap as thousands separator.

Thousands separator should not be a plain space, but a narrow space.
And additionally, it is not allowed to wrap line in the middle of the
number.

Locale data were created in a deep age of 8-bit encodings, so most of
them use space (incorrect: it allows wrapping the line in the middle
of the number), or NBSP (better, but typographically incorrect: space
between groups is too wide).

Now UNICODE is widely supported, so we should leave legacy characters
in favor of correct UNICODE character.

UNICODE has a dedicated character for this purpose:

NNBSP
U+202F NARROW NO-BREAK SPACE: a narrow form of a no-break space,
typically the width of a thin space or a mid space

The NNBSP exists since Unicode 3.0.

Use of NNBSP will prevent line wrapping in the midle of number and
improve readability of numbers.

	[BZ #20756]
	* locales/aa_DJ (LC_MONETARY): Replace space by NNBSP as thousands separator.
	* locales/az_AZ (LC_MONETARY): Likewise.
	* locales/be_BY (LC_MONETARY): Likewise.
	* locales/be_BY@latin (LC_MONETARY): Likewise.
	* locales/bg_BG (LC_MONETARY): Likewise.
	* locales/bs_BA (LC_MONETARY): Likewise.
	* locales/ce_RU (LC_MONETARY): Likewise.
	* locales/crh_UA (LC_MONETARY): Likewise.
	* locales/cs_CZ (LC_MONETARY): Likewise.
	* locales/cs_CZ (LC_NUMERIC): Likewise.
	* locales/cv_RU (LC_MONETARY): Likewise.
	* locales/de_AT (LC_MONETARY): Likewise.
	* locales/eo (LC_MONETARY): Likewise.
	* locales/es_CR (LC_MONETARY): Likewise.
	* locales/es_CR (LC_NUMERIC): Likewise.
	* locales/es_CU (LC_MONETARY): Likewise.
	* locales/et_EE (LC_MONETARY): Likewise.
	* locales/et_EE (LC_NUMERIC): Likewise.
	* locales/fi_FI (LC_MONETARY): Likewise.
	* locales/fi_FI (LC_NUMERIC): Likewise.
	* locales/fr_CA (LC_MONETARY): Likewise.
	* locales/fr_FR (LC_MONETARY): Likewise.
	* locales/fr_FR (LC_NUMERIC): Likewise.
	* locales/fr_LU (LC_MONETARY): Likewise.
	* locales/fr_LU (LC_NUMERIC): Likewise.
	* locales/hr_HR (LC_MONETARY): Likewise.
	* locales/ht_HT (LC_NUMERIC): Likewise.
	* locales/kk_KZ (LC_MONETARY): Likewise.
	* locales/kk_KZ (LC_NUMERIC): Likewise.
	* locales/ky_KG (LC_MONETARY): Likewise.
	* locales/ky_KG (LC_NUMERIC): Likewise.
	* locales/lv_LV (LC_MONETARY): Likewise.
	* locales/lv_LV (LC_NUMERIC): Likewise.
	* locales/mg_MG (LC_MONETARY): Likewise.
	* locales/mhr_RU (LC_MONETARY): Likewise.
	* locales/mk_MK (LC_MONETARY): Likewise.
	* locales/mk_MK (LC_NUMERIC): Likewise.
	* locales/mn_MN (LC_MONETARY): Likewise.
	* locales/nb_NO (LC_MONETARY): Likewise.
	* locales/nb_NO (LC_NUMERIC): Likewise.
	* locales/nl_AW (LC_MONETARY): Likewise.
	* locales/nl_NL (LC_MONETARY): Likewise.
	* locales/nn_NO (LC_MONETARY): Likewise.
	* locales/os_RU (LC_MONETARY): Likewise.
	* locales/pap_AW (LC_MONETARY): Likewise.
	* locales/pap_CW (LC_MONETARY): Likewise.
	* locales/ru_RU (LC_MONETARY): Likewise.
	* locales/ru_RU (LC_NUMERIC): Likewise.
	* locales/ru_UA (LC_MONETARY): Likewise.
	* locales/sk_SK (LC_MONETARY): Likewise.
	* locales/sk_SK (LC_NUMERIC): Likewise.
	* locales/sl_SI (LC_MONETARY): Likewise.
	* locales/sl_SI (LC_NUMERIC): Likewise.
	* locales/sq_MK (LC_MONETARY): Likewise.
	* locales/sv_SE (LC_MONETARY): Likewise.
	* locales/sv_SE (LC_NUMERIC): Likewise.
	* locales/tg_TJ (LC_MONETARY): Likewise.
	* locales/tt_RU (LC_MONETARY): Likewise.
	* locales/tt_RU@iqtelif (LC_MONETARY): Likewise.
	* locales/uk_UA (LC_MONETARY): Likewise.
	* locales/uk_UA (LC_NUMERIC): Likewise.
	* locales/unm_US (LC_MONETARY): Likewise.
	* locales/unm_US (LC_NUMERIC): Likewise.
	* locales/wo_SN (LC_MONETARY): Likewise.
2017-08-12 16:03:49 +02:00
Akhilesh Kumar
4169825556 Remove redundant data for LC_MONETARY for Indian locales
Reference is taken from
	https://en.wikipedia.org/wiki/Indian_numbering_system
	https://en.wikipedia.org/wiki/Indian_rupee

	CLDR has the currency format pattern “¤#,##,##0.00”.

	[BZ #21836]
	* locales/ar_IN (LC_MONETARY) : copy "hi_IN"
	* locales/as_IN (LC_MONETARY) : copy "hi_IN"
	* locales/bhb_IN (LC_MONETARY): copy "hi_IN"
	* locales/bn_IN (LC_MONETARY) : copy "hi_IN"
	* locales/en_IN (LC_MONETARY) : copy "hi_IN"
	* locales/gu_IN (LC_MONETARY) : copy "hi_IN"
	* locales/hi_IN (LC_MONETARY) : Fix mon_grouping,
	p_sep_by_space and n_sep_by_space
	* locales/kn_IN (LC_MONETARY) : copy "hi_IN"
	* locales/kok_IN(LC_MONETARY) : copy "hi_IN"
	* locales/ks_IN (LC_MONETARY) : copy "hi_IN"
	* locales/ml_IN (LC_MONETARY) : copy "hi_IN"
	* locales/mr_IN (LC_MONETARY) : copy "hi_IN"
	* locales/or_IN (LC_MONETARY) : copy "hi_IN"
	* locales/pa_IN (LC_MONETARY) : copy "hi_IN"
	* locales/sa_IN (LC_MONETARY) : copy "hi_IN"
	* locales/sd_IN (LC_MONETARY) : copy "hi_IN"
	* locales/ta_IN (LC_MONETARY) : copy "hi_IN"
	* locales/tcy_IN(LC_MONETARY) : copy "hi_IN"
	* locales/te_IN (LC_MONETARY) : copy "hi_IN"
	* locales/ur_IN (LC_MONETARY) : copy "hi_IN"
2017-08-10 13:00:31 +02:00
Wei-Lun Chao
38dbcacb60 cmn_TW: add hanzi collation
[BZ #17563]
	[BZ #16905]
	* locales/cmn_TW (LC_COLLATE): Use cns11643_stroke file for sorting.
	* locales/cmn_TW (LC_TIME): Improve time and date formats.
	* locales/cmn_TW (LC_MESSAGES): Add  yesstr and nostr.
	* locales/cns11643_stroke: New file, stroke count collation for
	traditional Chinese.
2017-08-10 12:18:11 +02:00
Akhilesh Kumar
756d169989 Fix p_cs_precedes, n_cs_precedes, and n_sign_posn for mt_MT
Positive Currency Format: €500.00
Negative Currency Format: -€500.00
References:
http://www.kunsilltalmalti.gov.mt/filebank/documents/rapportdwarlismijiettalmunitaewropea.pdf, page 7, top right.
CLDR has “¤#,##0.00” as the currency format pattern.

	[BZ #21920]
	* locales/mt_MT (LC_MONETARY): Fix p_cs_precedes/n_cs_precedes.
	* locales/mt_MT (LC_MONETARY): Fix n_sign_posn.
2017-08-09 11:02:03 +02:00
Mike FABIAN
39b20aae21 Remove “% Charset: ...” comments from locale sources
These comments are useless and only confusing.  The encodings used to
create binary locales from source locales are listed in the
localedata/SUPPORTED file.  The source files itself are ASCII or UTF-8
encoded where non-ASCII UTF-8 is currently only used in comments. If
all locale source files are UTF-8 anyway, there is no need to specify
that in a special comment.
2017-08-09 11:02:03 +02:00
Akhilesh Kumar
e119dfcfd6 Added new locale en_SC
New locale is added for the Seychelles which is a member of the African
Union. English is an offical language for the Seychelles.

	[BZ #21854]
	* locales/en_SC: New file.
	* localedata/SUPPORTED : Add en_SC/UTF-8.
2017-08-09 11:02:03 +02:00
Andreas Schwab
d5afb38503 Fix missing redirects in testsuite targets 2017-08-07 18:13:36 +02:00
Mike FABIAN
08897a57a8 Add yesstr and nostr for doi_IN, kok_IN, and sat_IN
For the locales doi_IN,  kok_IN, and sat_IN, the words for
	“yes” and “no” were apparently in yesexpr and noexpr.
	Copy them from there to add yesstr and nostr.
	Also make yesexpr and noexpr more readable by using
	the POSIX portable character set.

	* locales/doi_IN (LC_MESSAGES): Add yesstr and nostr.
	* locales/kok_IN (LC_MESSAGES): Add yesstr and nostr.
	* locales/sat_IN (LC_MESSAGES): Add yesstr and nostr.
2017-08-03 17:30:11 +02:00
Mike FABIAN
2aad4b04ad Revert “Fix yesexpr in en_DK locale”
This reverts commit 8f75515080

	Revert “Fix yesexpr in en_DK locale”.

	* locales/en_DK (LC_MESSAGES): Restore original yesexpr, noexpr,
	yesstr, nostr. Convert them to ASCII and add a comment why
	we want to have them like this.
2017-08-03 13:39:54 +02:00
Mike FABIAN
1f22702ea2 drop redundant ([eE][sS])? and [oO]? from yesexpr and noexpr in km_KH locale
* locales/km_KH (LC_MESSAGES): drop redundant ([eE][sS])? and [oO]?
	from yesexpr and noexpr
2017-08-03 12:11:35 +02:00
Mike FABIAN
a0b7f996b7 Drop redundant .* from yesexpr and noexpr in agr_PE locale
And make the expressions more readable by using the POSIX portable character set
instead of Unicode code points.

	* locales/agr_PE (LC_MESSAGES): drop .* from yesexpr and noexpr
2017-08-03 12:11:35 +02:00
Akhilesh Kumar
6244070d41 Added new Locale bho_NP
Bhojpuri is one of the most common languages in Nepal and India.
http://codefornepal.org/en/2014/02/top-10-languages-spoken-as-mother-tongues-in-nepal/
https://en.wikipedia.org/wiki/Bhojpuri_language

Added "bho_NP" locale and Reference is taken form "bho_IN".

	[BZ #21845]
	* locales/bho_NP: New file, Bhojpuri locale for Nepal.
	* SUPPORTED: Add bho_NP/UTF-8
2017-08-03 12:11:35 +02:00
Akhilesh Kumar
8f75515080 Fix yesexpr in en_DK locale
[BZ #21867]
	* locales/en_DK (LC_MESSAGES): Fix yesexpr
2017-08-03 12:11:35 +02:00
Akhilesh Kumar
1b2be2732f Fix abday in zh_SG
Make it the same as in zh_CN and zh_TW which agrees with "narrow" in CLDR.

	[BZ #21853]
	* locales/zh_SG (LC_TIME): Fix abday
2017-08-03 12:11:35 +02:00
Steve Ellcey
b7f95f493b Remove CFLAG settings that turn off warning (no longer needed).
2017-08-02  Steve Ellcey  <sellcey@cavium.com>

	* localedata/Makefile (CFLAGS-tst_iswalnum.c, CFLAGS-tst_iswalpha.c
	CFLAGS-tst_iswcntrl.c, CFLAGS-tst_iswdigit.c, CFLAGS-tst_iswgraph.c,
	CFLAGS-tst_iswlower.c, CFLAGS-tst_iswprint.c, CFLAGS-tst_iswpunct.c,
	CFLAGS-tst_iswspace.c, CFLAGS-tst_iswupper.c, CFLAGS-tst_iswxdigit.c,
	CFLAGS-tst_towlower.c, CFLAGS-tst_towupper.c): Remove.
2017-08-02 12:57:58 -07:00
Akhilesh Kumar
10b8eb8fcc Fix country name in title of mai_NP locale
[BZ #21848]
	* locales/mai_NP (LC_IDENTIFICATION): Fix country name in title.
2017-07-27 16:24:07 +02:00
Ihar Hrachyshka
551d64b2d7 Improve country_name in be_BY@latin 2017-07-27 16:11:04 +02:00
Mike FABIAN
c5527d7ddc Minor improvements to new az_IR locale
* locales/az_IR (LC_MESSAGES): Improve yesexpr and noexpr.
	* locales/az_IR (LC_ADDRESS): Fix typo in comment  and
	use the individual iso-639-3 code for South Azerbaijani
	"azb" in lang_term.
	* locales/az_IR (LC_NAME): Improve readability of name_fmt in source.
2017-07-27 16:11:04 +02:00
Mousa Moradi
30c0c707f7 Add new az_IR locale
[BZ #14172]
	* locales/az_IR: New file.
2017-07-27 16:10:12 +02:00
Akhilesh Kumar
9e17c7aee3 Fix wrong monetary system used in ta_LK locale
[BZ #21839]
	* locales/ta_LK (LC_MONETARY): copy "si_LK"
2017-07-27 16:10:12 +02:00