Commit Graph

32 Commits

Author SHA1 Message Date
Mike FABIAN
5176a830e7 localedata: Use consistent values for grouping and mon_grouping
Resolves: BZ # 31205

Adapt test cases in test-grouping_iterator.c
2024-01-25 11:41:02 +01:00
Mike FABIAN
c0c259c3bd localedata: revert all the remaining locale sources to UTF-8 2024-01-11 15:04:25 +01:00
Rafał Lużyński
75ba929987 Multiple locales: Add date_fmt (bug 24054)
It is not specified what should be the content of d_t_fmt and date_fmt
but in the built-in C locale those fields have only one difference:
date_fmt contains "%Z" (the current time zone) while d_t_fmt does not.

For most of the locales this commit does the following operation:
copy d_t_fmt to date_fmt, and then remove "%Z" from d_t_fmt.
If "%Z" was originally missing from d_t_fmt add it to date_fmt.
It also corrects comments where necessary.

Exceptions:

* In bo_CN, dz_BT, and km_KH "%Z" has not been added to date_fmt because
  it was too difficult.  In these locales date_fmt has been set to the
  copy of d_t_fmt.
* In en_DK "%Z" has not been removed from d_t_fmt in order to preserve
  the conformance with the standard mentioned in the comment.

The command to identify and initially edit the locales that need the
update was:

    for i in `grep -lw d_t_fmt *`
    do
        if ! grep -qw date_fmt $i ; then
            awk '/d_t_fmt/ { print $0; gsub("d_t_fmt", "date_fmt"); } //{ print $0 }' < $i > $i.next
            mv $i.next $i
        fi
    done

and then each file was further edited manually.
2020-01-02 11:45:45 +01:00
Rafał Lużyński
20a740b2b2 km_KH locale: Use "%M" instead of "m" in d_t_fmt (bug 25323)
A quick analysis suggests that the original author meant "%M" (minutes
format specifier) instead of "m" which is just a literal "m" letter.
2019-12-30 11:48:19 +01:00
Mike FABIAN
b110af8278 km_KH locale: remove tags which were using Lao characters [BZ #13953]
[BZ #13953]
	* localedata/locales/km_KH: Use ASCII as much
	as possible for better readability of the source and
	remove useless comments.
	* localedata/locales/km_KH (LC_TIME): Remove era stuff, it
	was commented out and apparently wrong anyway because it was
	using Lao characters. If Buddhist era should be used
	for km_KH, a native speaker should write the correct formaat
	for Khmer.
	* localedata/locales/km_KH (LC_TIME): Add first_weekday 1
	(According to CLDR, the first weekday for Cambodia is Sunday).
	* localedata/locales/km_KH (LC_NAME): Remove name_mr and name_mrs
	(These were using Lao characters which must be wrong. If we get
	the correct data from a native speaker, we could add it back, until
	then it is better not to have name_mr and name_mrs at all than
	having it wrong).
2017-10-30 11:46:05 +01:00
Mike FABIAN
1e5971ec14 Fix all remaining issues of [BZ #15260]
[BZ #15260]
	* localedata/locales/doi_IN (LC_MESSAGES): Match only for the
	first letters of yesstr and nostr in yesexpr and noexpr,
	not for the full words.
	* localedata/locales/hne_IN (LC_MESSAGES): Likewise.
	* localedata/locales/kok_IN (LC_MESSAGES): Likewise.
	* localedata/locales/mr_IN (LC_MESSAGES): Likewise.
	* localedata/locales/sat_IN (LC_MESSAGES): Likewise.
	* localedata/locales/km_KH (LC_MESSAGES): Match also for the
	first letters of yesstr and nostr in yesexpr and noexpr,
	until now only English was matched in yesexpr and noexpr.
	* localedata/locales/tl_PH (LC_MESSAGES): Use “copy "fil_PH"”
	instead of “copy "en_US"”. CLDR has yesstr and nostr data for
	fil but not for tl. As tl and fil are very similar, using fil
	is probably better than using English.
2017-10-27 10:53:35 +02:00
Mike FABIAN
fe043d8af2 km_KH locale: Fix yesstr and nostr.
* localedata/locales/km_KH (LC_MESSAGES): Fix yestr and nostr.
	The yesstr and nostr apparently	came from CLDR. And CLDR has a bug there:
	these strings contain a U+17D6 (which somewhat looks like a colon)
	instead of a real colon to separate the full words for “yes”
	and “no” from the single letter responses.
2017-10-25 10:49:36 +02:00
Mike FABIAN
dbb488ff46 Use “copy "i18n"” in km_KH locale
The custom stuff which was in LC_CTYPE of the km_KH locale seems
to be a very incomplete subset of what one gets by using
“copy "i18n"”. I cannot find anything special there which is not
in “copy "i18n"”, only lots of stuff which is missing.

	[BZ #20008]
	* locales/km_KH (LC_CTYPE): Use “copy "i18n"”.
2017-08-14 19:14:38 +02:00
Mike FABIAN
39b20aae21 Remove “% Charset: ...” comments from locale sources
These comments are useless and only confusing.  The encodings used to
create binary locales from source locales are listed in the
localedata/SUPPORTED file.  The source files itself are ASCII or UTF-8
encoded where non-ASCII UTF-8 is currently only used in comments. If
all locale source files are UTF-8 anyway, there is no need to specify
that in a special comment.
2017-08-09 11:02:03 +02:00
Mike FABIAN
1f22702ea2 drop redundant ([eE][sS])? and [oO]? from yesexpr and noexpr in km_KH locale
* locales/km_KH (LC_MESSAGES): drop redundant ([eE][sS])? and [oO]?
	from yesexpr and noexpr
2017-08-03 12:11:35 +02:00
Mike FABIAN
ec1cff5a6b Remove erroneous spaces from some strings in locale files
* locales/hif_FJ (LC_TIME): Remove erroneous spaces from d_t_fmt.
	* locales/km_KH (LC_TIME): Remove erroneous space from d_t_fmt.
	* locales/sm_WS (LC_TIME): Remove erroneous spaces from day.
2017-07-26 16:47:00 +02:00
Mike Frysinger
867c5c33e2 localedata: CLDRv29: update LC_ADDRESS.lang_name translations
This updates a bunch of locales based on CLDR v29 data:
  az_AZ: changing Azərbaycanca to azərbaycan dili
  be_BY: changing беларуская мова to беларуская
  bem_ZM: changing iciBemba to Ichibemba
  bg_BG: changing български език to български
  bo_CN: changing པོད་སྐད་ to བོད་སྐད་
  bo_IN: changing པོད་སྐད་ to བོད་སྐད་
  br_FR: changing Brezhoneg to brezhoneg
  brx_IN: lang_name: setting to बड़ो
  ce_RU: changing нохчийн мотт to нохчийн
  cs_CZ: changing Čeština to čeština
  dz_BT: changing (རྫོང་ཁ to རྫོང་ཁ
  el_CY: changing ελληνικά to Ελληνικά
  el_GR: changing ελληνικά to Ελληνικά
  es_AR: changing Español to español
  es_BO: changing Español to español
  es_CL: changing Español to español
  es_CO: changing Español to español
  es_CR: changing Español to español
  es_CU: changing Español to español
  es_DO: changing Español to español
  es_EC: changing Español to español
  es_ES: changing Español to español
  es_GT: changing Español to español
  es_HN: changing Español to español
  es_MX: changing Español to español
  es_NI: changing Español to español
  es_PA: changing Español to español
  es_PE: changing Español to español
  es_PR: changing Español to español
  es_PY: changing Español to español
  es_SV: changing Español to español
  es_US: changing Español to español
  es_UY: changing Español to español
  es_VE: changing Español to español
  et_EE: changing eesti keel to eesti
  eu_ES: changing Euskara to euskara
  fr_BE: changing Français to français
  fr_CA: changing Français to français
  fr_CH: changing Français to français
  fr_FR: changing Français to français
  fr_LU: changing Français to français
  fur_IT: changing Furlan to furlan
  fy_NL: changing Frysk to West-Frysk
  gl_ES: changing Galego to galego
  gv_GB: changing y Ghaelg to Gaelg
  he_IL: lang_name: setting to עברית
  hsb_DE: changing Hornjoserbšćina to hornjoserbšćina
  hy_AM: changing Հայերեն to հայերեն
  id_ID: changing Bahasa Indonesia to Bahasa Indonesia
  it_CH: changing Italiano to italiano
  it_IT: changing Italiano to italiano
  kl_GL: changing Kalaallisut to kalaallisut
  km_KH: changing ភាសាខ្មែរ to ខ្មែរ
  ko_KR: changing 한국말 to 한국어
  ks_IN: changing kạ̄šur to کٲشُر
  kw_GB: changing Kernowek to kernewek
  ky_KG: changing Кыргызча to кыргызча
  lg_UG: changing Oluganda to Luganda
  lt_LT: changing lietuvių kalba to lietuvių
  lv_LV: changing latviešu valoda to latviešu
  mk_MK: changing македонск/и јазик to македонски
  mn_MN: changing Монгол хэл to монгол
  nb_NO: changing Bokmål to norsk bokmål
  nn_NO: changing Nynorsk to nynorsk
  os_RU: lang_name: setting to ирон
  ru_RU: lang_name: setting to русский
  ru_UA: lang_name: setting to русский
  se_NO: changing Davvisámegiella to davvisámegiella
  sk_SK: lang_name: setting to slovenčina
  ta_IN: lang_name: setting to தமிழ்
  ta_LK: lang_name: setting to தமிழ்
  tk_TM: changing Türkmençe to türkmençe
  tr_CY: changing Turkish to Türkçe
  tr_TR: changing Turkish to Türkçe
  ur_IN: lang_name: setting to {اردو}
  ur_PK: lang_name: setting to {اردو}
  vi_VN: changing Việt ngữ to Tiếng Việt
  yo_NG: changing Yorùbá to Èdè Yorùbá
  zu_ZA: changing IsiZulu to isiZulu

Most of these are simple case changes, but they match the CLDR db.
A search for a few of the others suggests they're also correct.
2017-06-30 06:41:49 +02:00
Rafal Luzynski
d4e0331b0a localedata: Months updated from CLDR - other scripts [BZ #21217]
* localedata/locales/am_ET (mon): Updated from CLDR.
* localedata/locales/km_KH (mon): Likewise.
* localedata/locales/lo_LA (mon): Likewise.
* localedata/locales/si_LK (mon): Likewise.
* localedata/locales/yi_US (mon): Likewise.
* localedata/locales/yue_HK (mon): Likewise.
2017-06-27 16:50:33 +02:00
Mike Frysinger
00277a3f81 localedata: CLDRv29: update LC_MESSAGES yes/no strings [BZ #15264] [BZ #16975]
The yes/no strings should be based on the dictionary words.  That means
they are capitalized based on the dictionary rather than position in the
sentence (e.g. the first word).
  bo_CN: nostr: changing མེན to མིན།
  bo_CN: yesstr: changing ཨིན to ཡིན།
  dz_BT: nostr: changing མེན to མེན་
  dz_BT: yesstr: changing ཨིན to ཨིན་
  en_CA: yesstr: changing Yes to yes
  en_CA: nostr: changing No to no
  en_US: yesstr: changing Yes to yes
  en_US: nostr: changing No to no
  es_ES: nostr: changing No to no
  es_ES: yesstr: changing Si to sí
  fi_FI: nostr: changing Ei to ei
  fi_FI: yesstr: changing Kyllä to kyllä
  ig_NG: yesstr: changing Ee to Eye
  ko_KR: nostr: changing 아니오 to 아니요
  ky_KG: nostr: changing Жок to жок
  ky_KG: yesstr: changing Ооба to ооба
  ms_MY: nostr: changing Tidak to tidak
  ms_MY: yesstr: changing Ya to ya
  te_IN: nostr: changing కాదు to వద్దు
  te_IN: yesstr: changing అవను to అవును
  ur_PK: nostr: changing نهيں to نہیں
  ur_PK: yesstr: changing بلكل to ہاں
  uz_UZ: nostr: changing Yo'q to yo‘q
  uz_UZ: yesstr: changing Ha to ha
  uz_UZ@cyrillic: nostr: changing Йўқ to йўқ
  uz_UZ@cyrillic: yesstr: changing Ҳа to ҳа
  wae_CH: nostr: changing Nei to nei
  wae_CH: yesstr: changing Ja to ja
  yo_NG: nostr: changing Bẹ́ẹ̀ kọ́ to Bẹ́ẹ̀kọ́
  yo_NG: yesstr: changing Bẹ́ẹ̀ ni to Bẹ́ẹ̀ni

Some of the translations were just wrong.
  el_GR: nostr: changing no to όχι
  el_GR: yesstr: changing yes to ναι
  km_KH: nostr: changing no:NO:n:N to ទេ​៖ n
  km_KH: yesstr: changing yes:YES:y:Y to បាទ/ចាស​៖ y
  ug_CN: nostr: changing No to ياق
  ug_CN: yesstr: changing Yes to ھەئە

Add missing translations for a number of locales:
  af_ZA: nostr: setting to nee
  af_ZA: yesstr: setting to ja
  am_ET: nostr: setting to አይ
  am_ET: yesstr: setting to አዎን
  ast_ES: nostr: setting to non
  ast_ES: yesstr: setting to sí
  be_BY: nostr: setting to не
  be_BY: yesstr: setting to так
  bem_ZM: nostr: setting to Awe
  bem_ZM: yesstr: setting to Ee
  bg_BG: nostr: setting to не
  bg_BG: yesstr: setting to да
  brx_IN: nostr: setting to नहीं
  brx_IN: yesstr: setting to हाँ
  bs_BA: nostr: setting to ne
  bs_BA: yesstr: setting to da
  ca_ES: nostr: setting to no
  ca_ES: yesstr: setting to sí
  da_DK: nostr: setting to nej
  da_DK: yesstr: setting to ja
  de_DE: nostr: setting to nein
  de_DE: yesstr: setting to ja
  en_DK: nostr: setting to yes
  en_DK: yesstr: setting to no
  et_EE: nostr: setting to ei
  et_EE: yesstr: setting to jah
  eu_ES: nostr: setting to ez
  eu_ES: yesstr: setting to bai
  fa_IR: nostr: setting to نه
  fa_IR: yesstr: setting to بله
  ff_SN: nostr: setting to Alaa
  ff_SN: yesstr: setting to Eey
  fo_FO: nostr: setting to nei
  fo_FO: yesstr: setting to já
  fr_BE: nostr: setting to non
  fr_BE: yesstr: setting to oui
  fr_CH: nostr: setting to non
  fr_CH: yesstr: setting to oui
  fr_FR: nostr: setting to non
  fr_FR: yesstr: setting to oui
  fr_LU: nostr: setting to non
  fr_LU: yesstr: setting to oui
  fur_IT: nostr: setting to no
  fur_IT: yesstr: setting to sì
  fy_DE: nostr: setting to nee
  fy_DE: yesstr: setting to ja
  ga_IE: nostr: setting to níl
  ga_IE: yesstr: setting to tá
  gd_GB: nostr: setting to chan eil
  gd_GB: yesstr: setting to tha
  gl_ES: nostr: setting to non
  gl_ES: yesstr: setting to si
  gu_IN: nostr: setting to નહીં
  gu_IN: yesstr: setting to હા
  he_IL: nostr: setting to לא
  he_IL: yesstr: setting to כן
  hi_IN: nostr: setting to नहीं
  hi_IN: yesstr: setting to हाँ
  hr_HR: nostr: setting to ne
  hr_HR: yesstr: setting to da
  hu_HU: nostr: setting to nem
  hu_HU: yesstr: setting to igen
  id_ID: nostr: setting to tidak
  id_ID: yesstr: setting to ya
  is_IS: nostr: setting to nei
  is_IS: yesstr: setting to já
  it_CH: nostr: setting to no
  it_CH: yesstr: setting to sì
  it_IT: nostr: setting to no
  it_IT: yesstr: setting to sì
  ka_GE: nostr: setting to არა
  ka_GE: yesstr: setting to კი
  kk_KZ: nostr: setting to жоқ
  kk_KZ: yesstr: setting to иә
  kl_GL: nostr: setting to naagga
  kl_GL: yesstr: setting to aap
  kn_IN: nostr: setting to ಇಲ್ಲ
  kn_IN: yesstr: setting to ಹೌದು
  ko_KR: yesstr: setting to 예
  lb_LU: nostr: setting to nee
  lb_LU: yesstr: setting to jo
  lg_UG: nostr: setting to Nedda
  lg_UG: yesstr: setting to Ye
  lt_LT: nostr: setting to ne
  lt_LT: yesstr: setting to taip
  lv_LV: nostr: setting to nē
  lv_LV: yesstr: setting to jā
  mg_MG: nostr: setting to Tsia
  mg_MG: yesstr: setting to Eny
  mn_MN: nostr: setting to үгүй
  mn_MN: yesstr: setting to тийм
  mr_IN: nostr: setting to नाहीःना
  mr_IN: yesstr: setting to होयःहो
  mt_MT: nostr: setting to le
  mt_MT: yesstr: setting to iva
  nb_NO: nostr: setting to nei
  nb_NO: yesstr: setting to ja
  ne_NP: nostr: setting to होइन
  ne_NP: yesstr: setting to हो
  nl_NL: nostr: setting to nee
  nl_NL: yesstr: setting to ja
  nn_NO: nostr: setting to nei
  nn_NO: yesstr: setting to ja
  or_IN: nostr: setting to ନା
  or_IN: yesstr: setting to ହଁ
  os_RU: nostr: setting to нӕйы
  os_RU: yesstr: setting to уойы
  pa_IN: nostr: setting to ਨਹੀਂ
  pa_IN: yesstr: setting to ਹਾਂ
  pl_PL: nostr: setting to nie
  pl_PL: yesstr: setting to tak
  pt_BR: nostr: setting to não
  pt_BR: yesstr: setting to sim
  pt_PT: nostr: setting to não
  pt_PT: yesstr: setting to sim
  ro_RO: nostr: setting to nu
  ro_RO: yesstr: setting to da
  ru_RU: nostr: setting to нет
  ru_RU: yesstr: setting to да
  ru_UA: nostr: setting to нет
  ru_UA: yesstr: setting to да
  se_NO: nostr: setting to ii
  se_NO: yesstr: setting to jo
  sl_SI: nostr: setting to ne
  sl_SI: yesstr: setting to da
  so_DJ: nostr: setting to maya
  so_DJ: yesstr: setting to haa
  so_SO: nostr: setting to maya
  so_SO: yesstr: setting to haa
  sq_AL: nostr: setting to jo
  sq_AL: yesstr: setting to po
  sr_RS@latin: nostr: setting to ne
  sr_RS@latin: yesstr: setting to da
  sr_RS: nostr: setting to не
  sr_RS: yesstr: setting to да
  sv_SE: nostr: setting to nej
  sv_SE: yesstr: setting to ja
  sw_KE: nostr: setting to Hapana
  sw_KE: yesstr: setting to Ndiyo
  yue_HK: nostr: setting to 唔係
  yue_HK: yesstr: setting to 係
  zu_ZA: nostr: setting to cha
  zu_ZA: yesstr: setting to yebo
2016-04-23 03:59:08 -04:00
Mike Frysinger
f982160ead localedata: LC_MESSAGES.{yes,no}expr: add +1/-0 to all regexes [BZ #15263]
A handful of regexes were allowing +1 for yesexpr and -0 for noexpr,
and it's the i18n definition.  Standardize all locales by allowing
these language-independent values in them.

Example change for en_US goes from ^[yY] to ^[+1yY], and from ^[nN]
to ^[-0nN].
2016-04-23 03:41:56 -04:00
Mike Frysinger
0a410e76f5 localedata: CLDRv29: update LC_TIME week/first_week,workday fields
I've spot checked a number of these, including some that were def
wrong (like ff_SN).  It also fixes all open week-related bugs.

Since ff_SN is the only one that changes its base date, I also made
sure that its ordering of day translations were correct.  Looks like
another case Petr brought up where the week field was not actually
checked against the day arrays.

I also took the opportunity to drop first_weekday/first_workday when
the value aligned with the defaults (1 & 2 respectively).  This didn't
impact too many locales In practice because the majority omitted them
already.

A few locales were defining some values incorrectly for their region:
  ak_GH: week: changing [7, 19971130, 7] to [7, 19971130, 1]
  ak_GH: first_weekday: changing 1 to 2
  ayc_PE: week: changing [7, 19971130, 7] to [7, 19971130, 1]
  bem_ZM: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  bem_ZM: first_weekday: changing 1 to 2
  en_IE: first_weekday: changing 2 to 1
  en_US: week: changing [7, 19971130, 7] to [7, 19971130, 1]
  es_CO: first_weekday: changing 2 to 1
  es_ES: week: changing [7, 19971130, 5] to [7, 19971130, 4]
  ff_SN: week: changing [7, 19971129, 1] to [7, 19971130, 1]
  ff_SN: first_weekday: changing 1 to 2
  ga_IE: first_weekday: changing 2 to 1
  ht_HT: week: changing [7, 19971130, 7] to [7, 19971130, 1]
  ht_HT: first_weekday: changing 1 to 2
  mk_MK: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  mt_MT: first_weekday: changing 2 to 1
  quz_PE: week: changing [7, 19971130, 7] to [7, 19971130, 1]
  sr_ME: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  sr_RS: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  sr_RS@latin: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  sw_KE: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  sw_KE: first_weekday: changing 2 to 1
  uk_UA: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  unm_US: week: changing [7, 19971130, 4] to [7, 19971130, 1]

Some locales were copying locales that had the wrong week settings, so
that content had to be duplicated so the values could be adjusted:
  el_CY: week: setting to [7, 19971130, 1]
  en_AG: week: setting to [7, 19971130, 1]
  en_AG: first_weekday: changing 2 to 1
  en_ZM: week: setting to [7, 19971130, 1]
  es_CU: week: setting to [7, 19971130, 1]
  nl_AW: week: setting to [7, 19971130, 1]
  sw_TZ: first_weekday: setting to 2
  ta_LK: first_weekday: setting to 2

The majority of locales were omitting the week field thus getting the
default [7, 19971130, 0 (localedef) / 7 (ISO standard)].  Unfortunately,
neither of those are used by any locales, so we end up having to define
the field just to se the ndays field.  In practice, this rarely matters
due to it usage, and the first two fields match the defaults.
  aa_DJ: setting to [7, 19971130, 1]
  aa_ER: setting to [7, 19971130, 1]
  aa_ER@saaho: setting to [7, 19971130, 1]
  aa_ET: setting to [7, 19971130, 1]
  af_ZA: setting to [7, 19971130, 1]
  am_ET: setting to [7, 19971130, 1]
  an_ES: setting to [7, 19971130, 4]
  anp_IN: setting to [7, 19971130, 1]
  ar_AE: setting to [7, 19971130, 1]
  ar_BH: setting to [7, 19971130, 1]
  ar_DZ: setting to [7, 19971130, 1]
  ar_EG: setting to [7, 19971130, 1]
  ar_IN: setting to [7, 19971130, 1]
  ar_IQ: setting to [7, 19971130, 1]
  ar_JO: setting to [7, 19971130, 1]
  ar_KW: setting to [7, 19971130, 1]
  ar_LB: setting to [7, 19971130, 1]
  ar_LY: setting to [7, 19971130, 1]
  ar_MA: setting to [7, 19971130, 1]
  ar_OM: setting to [7, 19971130, 1]
  ar_QA: setting to [7, 19971130, 1]
  ar_SA: setting to [7, 19971130, 1]
  ar_SD: setting to [7, 19971130, 1]
  ar_SS: setting to [7, 19971130, 1]
  ar_SY: setting to [7, 19971130, 1]
  ar_TN: setting to [7, 19971130, 1]
  ar_YE: setting to [7, 19971130, 1]
  as_IN: setting to [7, 19971130, 1]
  ast_ES: setting to [7, 19971130, 4]
  az_AZ: setting to [7, 19971130, 1]
  be_BY: setting to [7, 19971130, 1]
  be_BY@latin: setting to [7, 19971130, 1]
  ber_DZ: setting to [7, 19971130, 1]
  ber_MA: setting to [7, 19971130, 1]
  bg_BG: setting to [7, 19971130, 4]
  bhb_IN: setting to [7, 19971130, 1]
  bho_IN: setting to [7, 19971130, 1]
  bn_BD: setting to [7, 19971130, 1]
  bn_IN: setting to [7, 19971130, 1]
  bo_CN: setting to [7, 19971130, 1]
  br_FR: setting to [7, 19971130, 4]
  brx_IN: setting to [7, 19971130, 1]
  bs_BA: setting to [7, 19971130, 1]
  byn_ER: setting to [7, 19971130, 1]
  ca_AD: setting to [7, 19971130, 4]
  ca_ES: setting to [7, 19971130, 4]
  ca_ES@euro: setting to [7, 19971130, 4]
  ca_FR: setting to [7, 19971130, 4]
  ca_IT: setting to [7, 19971130, 4]
  ce_RU: setting to [7, 19971130, 1]
  cmn_TW: setting to [7, 19971130, 1]
  crh_UA: setting to [7, 19971130, 1]
  cv_RU: setting to [7, 19971130, 1]
  cy_GB: setting to [7, 19971130, 4]
  de_BE: setting to [7, 19971130, 4]
  de_LU: setting to [7, 19971130, 4]
  doi_IN: setting to [7, 19971130, 1]
  dv_MV: setting to [7, 19971130, 1]
  dz_BT: setting to [7, 19971130, 1]
  el_GR: setting to [7, 19971130, 4]
  el_GR@euro: setting to [7, 19971130, 4]
  en_AU: setting to [7, 19971130, 1]
  en_BW: setting to [7, 19971130, 1]
  en_CA: setting to [7, 19971130, 1]
  en_HK: setting to [7, 19971130, 1]
  en_IE: setting to [7, 19971130, 4]
  en_IN: setting to [7, 19971130, 1]
  en_NG: setting to [7, 19971130, 1]
  en_NZ: setting to [7, 19971130, 1]
  en_PH: setting to [7, 19971130, 1]
  en_SG: setting to [7, 19971130, 1]
  en_ZA: setting to [7, 19971130, 1]
  en_ZW: setting to [7, 19971130, 1]
  es_AR: setting to [7, 19971130, 1]
  es_BO: setting to [7, 19971130, 1]
  es_CL: setting to [7, 19971130, 1]
  es_CO: setting to [7, 19971130, 1]
  es_CR: setting to [7, 19971130, 1]
  es_DO: setting to [7, 19971130, 1]
  es_EC: setting to [7, 19971130, 1]
  es_ES@euro: setting to [7, 19971130, 4]
  es_GT: setting to [7, 19971130, 1]
  es_HN: setting to [7, 19971130, 1]
  es_MX: setting to [7, 19971130, 1]
  es_NI: setting to [7, 19971130, 1]
  es_PA: setting to [7, 19971130, 1]
  es_PE: setting to [7, 19971130, 1]
  es_PR: setting to [7, 19971130, 1]
  es_PY: setting to [7, 19971130, 1]
  es_SV: setting to [7, 19971130, 1]
  es_US: setting to [7, 19971130, 1]
  es_UY: setting to [7, 19971130, 1]
  es_VE: setting to [7, 19971130, 1]
  eu_ES: setting to [7, 19971130, 4]
  fa_IR: setting to [7, 19971130, 1]
  fil_PH: setting to [7, 19971130, 1]
  fo_FO: setting to [7, 19971130, 4]
  fr_CA: setting to [7, 19971130, 1]
  fr_CH: setting to [7, 19971130, 4]
  fr_LU: setting to [7, 19971130, 4]
  fy_NL: setting to [7, 19971130, 4]
  ga_IE: setting to [7, 19971130, 4]
  gd_GB: setting to [7, 19971130, 4]
  gez_ER: setting to [7, 19971130, 1]
  gez_ET: setting to [7, 19971130, 1]
  gl_ES: setting to [7, 19971130, 4]
  gu_IN: setting to [7, 19971130, 1]
  gv_GB: setting to [7, 19971130, 4]
  hak_TW: setting to [7, 19971130, 1]
  ha_NG: setting to [7, 19971130, 1]
  he_IL: setting to [7, 19971130, 1]
  hi_IN: setting to [7, 19971130, 1]
  hne_IN: setting to [7, 19971130, 1]
  hr_HR: setting to [7, 19971130, 1]
  hy_AM: setting to [7, 19971130, 1]
  id_ID: setting to [7, 19971130, 1]
  ig_NG: setting to [7, 19971130, 1]
  ik_CA: setting to [7, 19971130, 1]
  is_IS: setting to [7, 19971130, 4]
  it_CH: setting to [7, 19971130, 4]
  it_IT: setting to [7, 19971130, 4]
  it_IT@euro: setting to [7, 19971130, 4]
  iu_CA: setting to [7, 19971130, 1]
  ja_JP: setting to [7, 19971130, 1]
  ka_GE: setting to [7, 19971130, 1]
  kk_KZ: setting to [7, 19971130, 1]
  kl_GL: setting to [7, 19971130, 1]
  km_KH: setting to [7, 19971130, 1]
  kn_IN: setting to [7, 19971130, 1]
  kok_IN: setting to [7, 19971130, 1]
  ko_KR: setting to [7, 19971130, 1]
  ks_IN: setting to [7, 19971130, 1]
  ks_IN@devanagari: setting to [7, 19971130, 1]
  ku_TR: setting to [7, 19971130, 1]
  kw_GB: setting to [7, 19971130, 4]
  ky_KG: setting to [7, 19971130, 1]
  lg_UG: setting to [7, 19971130, 1]
  lij_IT: setting to [7, 19971130, 4]
  lo_LA: setting to [7, 19971130, 1]
  lt_LT: setting to [7, 19971130, 4]
  lv_LV: setting to [7, 19971130, 1]
  lzh_TW: setting to [7, 19971130, 1]
  mag_IN: setting to [7, 19971130, 1]
  mai_IN: setting to [7, 19971130, 1]
  mg_MG: setting to [7, 19971130, 1]
  mhr_RU: setting to [7, 19971130, 1]
  mi_NZ: setting to [7, 19971130, 1]
  ml_IN: setting to [7, 19971130, 1]
  mni_IN: setting to [7, 19971130, 1]
  mn_MN: setting to [7, 19971130, 1]
  mr_IN: setting to [7, 19971130, 1]
  ms_MY: setting to [7, 19971130, 1]
  mt_MT: setting to [7, 19971130, 1]
  my_MM: setting to [7, 19971130, 1]
  nan_TW: setting to [7, 19971130, 1]
  nan_TW@latin: setting to [7, 19971130, 1]
  ne_NP: setting to [7, 19971130, 1]
  nhn_MX: setting to [7, 19971130, 1]
  niu_NU: setting to [7, 19971130, 1]
  niu_NZ: setting to [7, 19971130, 1]
  nl_BE: setting to [7, 19971130, 4]
  nl_BE@euro: setting to [7, 19971130, 4]
  nr_ZA: setting to [7, 19971130, 1]
  nso_ZA: setting to [7, 19971130, 1]
  oc_FR: setting to [7, 19971130, 4]
  om_ET: setting to [7, 19971130, 1]
  om_KE: setting to [7, 19971130, 1]
  or_IN: setting to [7, 19971130, 1]
  os_RU: setting to [7, 19971130, 1]
  pa_IN: setting to [7, 19971130, 1]
  pap_AW: setting to [7, 19971130, 1]
  pap_CW: setting to [7, 19971130, 1]
  pa_PK: setting to [7, 19971130, 1]
  ps_AF: setting to [7, 19971130, 1]
  pt_BR: setting to [7, 19971130, 1]
  pt_PT: setting to [7, 19971130, 4]
  pt_PT@euro: setting to [7, 19971130, 4]
  raj_IN: setting to [7, 19971130, 1]
  ro_RO: setting to [7, 19971130, 1]
  ru_RU: setting to [7, 19971130, 1]
  ru_UA: setting to [7, 19971130, 1]
  rw_RW: setting to [7, 19971130, 1]
  sa_IN: setting to [7, 19971130, 1]
  sat_IN: setting to [7, 19971130, 1]
  sd_IN: setting to [7, 19971130, 1]
  sd_IN@devanagari: setting to [7, 19971130, 1]
  se_NO: setting to [7, 19971130, 4]
  shs_CA: setting to [7, 19971130, 1]
  sid_ET: setting to [7, 19971130, 1]
  si_LK: setting to [7, 19971130, 1]
  sl_SI: setting to [7, 19971130, 1]
  so_DJ: setting to [7, 19971130, 1]
  so_ET: setting to [7, 19971130, 1]
  so_KE: setting to [7, 19971130, 1]
  so_SO: setting to [7, 19971130, 1]
  sq_AL: setting to [7, 19971130, 1]
  ss_ZA: setting to [7, 19971130, 1]
  st_ZA: setting to [7, 19971130, 1]
  sv_FI: setting to [7, 19971130, 4]
  sv_SE: setting to [7, 19971130, 4]
  ta_IN: setting to [7, 19971130, 1]
  tcy_IN: setting to [7, 19971130, 1]
  te_IN: setting to [7, 19971130, 1]
  tg_TJ: setting to [7, 19971130, 1]
  the_NP: setting to [7, 19971130, 1]
  th_TH: setting to [7, 19971130, 1]
  ti_ER: setting to [7, 19971130, 1]
  ti_ET: setting to [7, 19971130, 1]
  tig_ER: setting to [7, 19971130, 1]
  tk_TM: setting to [7, 19971130, 1]
  tl_PH: setting to [7, 19971130, 1]
  tn_ZA: setting to [7, 19971130, 1]
  tr_CY: setting to [7, 19971130, 1]
  tr_TR: setting to [7, 19971130, 1]
  ts_ZA: setting to [7, 19971130, 1]
  tt_RU: setting to [7, 19971130, 1]
  tt_RU@iqtelif: setting to [7, 19971130, 1]
  ug_CN: setting to [7, 19971130, 1]
  ur_IN: setting to [7, 19971130, 1]
  ur_PK: setting to [7, 19971130, 1]
  uz_UZ: setting to [7, 19971130, 1]
  uz_UZ@cyrillic: setting to [7, 19971130, 1]
  ve_ZA: setting to [7, 19971130, 1]
  vi_VN: setting to [7, 19971130, 1]
  wa_BE: setting to [7, 19971130, 4]
  wal_ET: setting to [7, 19971130, 1]
  wo_SN: setting to [7, 19971130, 1]
  xh_ZA: setting to [7, 19971130, 1]
  yi_US: setting to [7, 19971130, 1]
  yo_NG: setting to [7, 19971130, 1]
  yue_HK: setting to [7, 19971130, 1]
  zh_CN: setting to [7, 19971130, 1]
  zh_HK: setting to [7, 19971130, 1]
  zh_SG: setting to [7, 19971130, 1]
  zh_TW: setting to [7, 19971130, 1]
  zu_ZA: setting to [7, 19971130, 1]

Finally, set first_weekday in all the locales that were omitting it
and wanted something other than the default of 1.
  aa_DJ: setting to 7
  aa_ER: setting to 2
  aa_ER@saaho: setting to 2
  ar_AE: setting to 7
  ar_BH: setting to 7
  ar_DZ: setting to 7
  ar_EG: setting to 7
  ar_IQ: setting to 7
  ar_JO: setting to 7
  ar_KW: setting to 7
  ar_LB: setting to 2
  ar_LY: setting to 7
  ar_MA: setting to 7
  ar_OM: setting to 7
  ar_QA: setting to 7
  ar_SD: setting to 7
  ar_SS: setting to 2
  ar_SY: setting to 7
  az_AZ: setting to 2
  be_BY: setting to 2
  be_BY@latin: setting to 2
  ber_DZ: setting to 7
  ber_MA: setting to 7
  bn_BD: setting to 6
  bs_BA: setting to 2
  byn_ER: setting to 2
  dv_MV: setting to 6
  en_NG: setting to 2
  es_BO: setting to 2
  es_CL: setting to 2
  es_EC: setting to 2
  es_UY: setting to 2
  fo_FO: setting to 2
  fr_CH: setting to 2
  gd_GB: setting to 2
  gez_ER: setting to 2
  ha_NG: setting to 2
  hr_HR: setting to 2
  hy_AM: setting to 2
  ig_NG: setting to 2
  is_IS: setting to 2
  it_CH: setting to 2
  ka_GE: setting to 2
  kk_KZ: setting to 2
  kl_GL: setting to 2
  ku_TR: setting to 2
  ky_KG: setting to 2
  lg_UG: setting to 2
  mg_MG: setting to 2
  mn_MN: setting to 2
  ms_MY: setting to 2
  niu_NU: setting to 2
  pap_AW: setting to 2
  pap_CW: setting to 2
  pt_PT: setting to 2
  pt_PT@euro: setting to 2
  rw_RW: setting to 2
  se_NO: setting to 2
  si_LK: setting to 2
  so_DJ: setting to 7
  so_SO: setting to 2
  sq_AL: setting to 2
  tg_TJ: setting to 2
  ti_ER: setting to 2
  tig_ER: setting to 2
  tk_TM: setting to 2
  tt_RU: setting to 2
  tt_RU@iqtelif: setting to 2
  uz_UZ: setting to 2
  uz_UZ@cyrillic: setting to 2
  vi_VN: setting to 2
  wo_SN: setting to 2
  yo_NG: setting to 2
2016-04-23 03:01:45 -04:00
Mike Frysinger
1a06eee869 localedata: LC_IDENTIFICATION.category: set to ISO 30112 2014 standard
The ISO 30112 standard defines the valid values for the category
keyword as only a few options:
	posix:1993
	i18n:2004
	i18n:2012

The vast majority of locales had changed the "i18n" string to the
name of its own locale (e.g. "ak_GH:2013") as well as tweaking the
date (presumably thinking it should be the date of submission).

Convert all of them to "i18n:2012" for consistency.  A follow up
change will update localedef to actually check/validate the field.
2016-04-15 12:37:38 -04:00
Mike Frysinger
0e8120ae53 localedata: fix LC_ADDRESS.country_car entries
These entries have been checked mostly against Wikipedia, but also using
the sources it cites (like the UN and other treaty sources).

Fix incorrect values:
  en_BW:  changing RB to BW
  kl_GL:  changing GRO to KN
  km_KH:  changing LAO to KH
  my_MM:  changing BA to MYA
  oc_FR:  changing F to F
  tr_CY:  changing TR to CY
  wae_CH: changing DH to CH

Add missing entries:
  aa_DJ:  changing to DJI
  ak_GH:  changing to GH
  ar_OM:  changing to OM
  ar_SS:  changing to SUD
  ar_YE:  changing to YAR
  bo_CN:  changing to CHN
  cmn_TW: changing to RC
  dv_MV:  changing to MV
  dz_BT:  changing to BHT
  en_AG:  changing to AG
  es_HN:  changing to HN
  es_PR:  changing to PR
  hak_TW: changing to RC
  lzh_TW: changing to RC
  nan_TW: changing to RC
  nan_TW@latin: changing to RC
  nl_AW:  changing to AUA
  pap_AW: changing to AUA
  so_DJ:  changing to DJI
  the_NP: changing to NEP
  ug_CN:  changing to CHN
  yue_HK: changing to HK
  zh_CN:  changing to CHN
  zh_HK:  changing to HK
  zh_TW:  changing to RC
2016-04-15 00:41:22 -04:00
Mike Frysinger
76c3498180 localedata: CLDRv29: update LC_ADDRESS.country_num values
This updates a few locales based on CLDR v29 data.

Add missing fields:
  as_IN: changing to 356
  dv_MV: changing to 462
  kk_KZ: changing to 398
  my_MM: changing to 104
  rw_RW: changing to 646
  tt_RU: changing to 643

Update ones that are wrong:
  dz_BT: changing BHU to 064
  en_PH: changing 360 to 608
  km_KH: changing 418 to 116
  ky_KG: changing 643 to 417
  tr_CY: changing 792 to 196
  wo_SN: changing 450 to 686

As a result of fixing these, I had to update country_ab[23]:
  dz_BT: changing BHU to BTN
  en_PH: changing ID/IDN to PH/PHL
  km_KH: changing LA/LAO to KH/KHM
  ky_KG: changing KY/KYR to KG/KGZ
  tr_CY: changing TR/TUR to CY/CYP
  wo_SN: changing MG/MDG to SN/SEN

Pad with leading zeros to match the standard and other locales:
  ber_DZ: changing 12 to 012
  ca_AD: changing 20 to 020
  en_AG: changing 28 to 028
  hy_AM: changing 51 to 051
  li_BE: changing 56 to 056
  wa_BE: changing 56 to 056

I hand checked the first two sets against ISO 3166-1 directly.
2016-04-15 00:27:11 -04:00
Mike Frysinger
fe1f87ac12 localedata: LC_PAPER: use copy directives everywhere
There are only two page sizes that locales use: US-Letter and A4.
For the former, move to copying the en_US locale, while for the
latter, move to copying the i18n locale.  This lets us clean up
all the stray comments like FIXME.

There should be no functional differences here.
2016-04-15 00:15:14 -04:00
Mike Frysinger
1c20cb2098 localedata: LC_MEASUREMENT: use copy directives everywhere
There are only two measurement systems that locales use: US and metric.
For the former, move to copying the en_US locale, while for the latter,
move to copying the i18n locale.  This lets us clean up all the stray
comments like FIXME.

There should be no functional differences here.
2016-04-12 16:47:52 -04:00
Mike Frysinger
a4cea54b12 localedata: standardize copyright/license information [BZ #11213]
Use the language from the FSF in all locale files to disclaim any
license/copyright on locale data.

See https://sourceware.org/ml/libc-locales/2013-q1/msg00048.html
2016-03-21 02:29:56 -04:00
Mike Frysinger
6bc81cf205 localedata: standardize first few lines
Purely a style touchup to make sure the headers all look the same.
2016-03-21 02:00:09 -04:00
Mike Frysinger
d3362b1e3c localedata: CLDRv28: update LC_TELEPHONE.int_prefix
This updates a bunch of locales based on CLDR v28 data:
  ar_SS: int_prefix: changing 249 to 211
  bn_BD: int_prefix: changing 88 to 880
  dz_BT: int_prefix: changing 66 to 975
  en_HK: int_prefix: changing  to 852
  en_PH: int_prefix: changing  to 63
  en_SG: int_prefix: changing  to 65
  es_DO: int_prefix: changing 1809 to 1
  es_PA: int_prefix: changing 502 to 507
  es_PR: int_prefix: changing 1787 to 1
  km_KH: int_prefix: changing 856 to 855
  mt_MT: int_prefix: changing  to 356
  ne_NP: int_prefix: changing 91 to 977
  pap_AW: int_prefix: changing 599 to 297
  the_NP: int_prefix: changing 91 to 977
  tk_TM: int_prefix: changing  to 993
  uz_UZ: int_prefix: changing 27 to 998
  zh_SG: int_prefix: changing  to 65

I've also checked these against https://countrycode.org/.

Note: the Dominican Republic (DO) and Puerto Rico (PR) updates are
correct: they both use +1.  Historically, DO had one area code of
809 and PR of 787 which is why they were listed as such, but they
have both expanded into 829 and 989 respectively, so using the four
digit value is def incorrect now.
2016-02-19 12:46:14 -05:00
Marko Myllynen
441c3b59d1 Fix lang_lib/lang_term as per ISO 639-2 [BZ #16973]
lang_lib (which reflects ISO 639-2/B (bibliographic) codes) and
lang_term (which reflects ISO 639-2/T (terminology) codes) should be
identical except for those languages for which ISO 639-2 specifies
separate bibliographic/terminology values.

I used this Library of Congress page as the source:
	http://www.loc.gov/standards/iso639-2/php/code_list.php
2015-08-18 10:15:04 -04:00
Marko Myllynen
42eaa27fac localedata: remove timezone information [BZ #18525]
as discussed in the thread starting at

https://sourceware.org/ml/libc-alpha/2015-06/msg00098.html

it looks like the best options is to remove locale timezone information
from locales which currently provide it (in incomplete or incorrect
fashion) rather than to start duplicating tzdata info in glibc.
2015-08-05 05:02:18 -04:00
Chris Leonard
8f18faed2e [BZ #16103] LC_MEASUREMENT review and standardization 2013-11-14 17:19:35 -05:00
Chris Leonard
9f9082ffa0 Correct L10n of country-name and lang-name in km_KH
[BZ #13952]
	* locales/km_KH: Correct country-name and lang-name.
2012-11-15 19:37:48 +01:00
Ulrich Drepper
c10d32c834 * locale/programs/localedef.c (add_to_readlist): Rename local
variables to avoid confusion.

	* locale/programs/charmap.c (charmap_read): Emit error message if
	charmap couldn't be found or read.
2006-07-30 22:24:49 +00:00
Ulrich Drepper
2cf8a390dd [BZ #714]
2005-09-25  Ulrich Drepper  <drepper@redhat.com>
	[BZ #714]
	* iconv/loop.c [!STORE_REST] (SINGLE): Correctly record number of
	left-over bytes and store them correctly.
	* iconvdata/tst-iconv6.c: New file.
	* iconvdata/Makefile (tests): Add tst-iconv6.
2005-09-25 17:12:25 +00:00
Ulrich Drepper
a334319f65 (CFLAGS-tst-align.c): Add -mpreferred-stack-boundary=4. 2004-12-22 20:10:10 +00:00
Jakub Jelinek
0ecb606cb6 2.5-18.1 2007-07-12 18:26:36 +00:00