Commit Graph

1335 Commits

Author SHA1 Message Date
Rafal Luzynski
95b38bbb03 localedata: fur_IT: Fix spelling of Wednesday (Miercus)
* localedata/locales/fur_IT (day, abday): reworded "Miarcus" to
  "Miercus" and abbreviated "Mia" to "Mie".
2017-06-22 17:44:03 +02:00
Mike FABIAN
925fac7793 Bug 21533: Update to Unicode 10.0.0
* Unicode 10.0.0 Support: Character encoding, character type info, and
  transliteration tables are all updated to Unicode 10.0.0, using
  generator scripts contributed by Mike FABIAN (Red Hat).
2017-06-22 17:02:55 +02:00
Zack Weinberg
af85385f31 Use locale_t, not __locale_t, throughout glibc
<locale.h> is specified to define locale_t in POSIX.1-2008, and so are
all of the headers that define functions that take locale_t arguments.
Under _GNU_SOURCE, the additional headers that define such functions
have also always defined locale_t.  Therefore, there is no need to use
__locale_t in public function prototypes, nor in any internal code.

	* ctype/ctype-c99_l.c, ctype/ctype.h, ctype/ctype_l.c
	* include/monetary.h, include/stdlib.h, include/time.h
	* include/wchar.h, locale/duplocale.c, locale/freelocale.c
	* locale/global-locale.c, locale/langinfo.h, locale/locale.h
	* locale/localeinfo.h, locale/newlocale.c
	* locale/nl_langinfo_l.c, locale/uselocale.c
	* localedata/bug-usesetlocale.c, localedata/tst-xlocale2.c
	* stdio-common/vfscanf.c, stdlib/monetary.h, stdlib/stdlib.h
	* stdlib/strfmon_l.c, stdlib/strtod_l.c, stdlib/strtof_l.c
	* stdlib/strtol.c, stdlib/strtol_l.c, stdlib/strtold_l.c
	* stdlib/strtoll_l.c, stdlib/strtoul_l.c, stdlib/strtoull_l.c
	* string/strcasecmp.c, string/strcoll_l.c, string/string.h
	* string/strings.h, string/strncase.c, string/strxfrm_l.c
	* sysdeps/ieee754/float128/strtof128_l.c
	* sysdeps/ieee754/float128/wcstof128.c
	* sysdeps/ieee754/float128/wcstof128_l.c
	* sysdeps/ieee754/ldbl-128ibm/strtold_l.c
	* sysdeps/ieee754/ldbl-64-128/strtold_l.c
	* sysdeps/ieee754/ldbl-opt/nldbl-compat.c
	* sysdeps/ieee754/ldbl-opt/nldbl-strfmon_l.c
	* sysdeps/ieee754/ldbl-opt/nldbl-strtold_l.c
	* sysdeps/ieee754/ldbl-opt/nldbl-wcstold_l.c
	* sysdeps/powerpc/powerpc32/power7/strcasecmp.S
	* sysdeps/powerpc/powerpc64/power7/strcasecmp.S
	* sysdeps/x86_64/strcasecmp_l-nonascii.c
	* sysdeps/x86_64/strncase_l-nonascii.c, time/strftime_l.c
	* time/strptime_l.c, time/time.h, wcsmbs/mbsrtowcs_l.c
	* wcsmbs/wchar.h, wcsmbs/wcscasecmp.c, wcsmbs/wcsncase.c
	* wcsmbs/wcstod.c, wcsmbs/wcstod_l.c, wcsmbs/wcstof.c
	* wcsmbs/wcstof_l.c, wcsmbs/wcstol_l.c, wcsmbs/wcstold.c
	* wcsmbs/wcstold_l.c, wcsmbs/wcstoll_l.c, wcsmbs/wcstoul_l.c
	* wcsmbs/wcstoull_l.c, wctype/iswctype_l.c
	* wctype/towctrans_l.c, wctype/wcfuncs_l.c
	* wctype/wctrans_l.c, wctype/wctype.h, wctype/wctype_l.c:
	Change all uses of __locale_t to locale_t.
2017-06-20 20:30:06 -04:00
Rafal Luzynski
70ec56e9f2 localedata: ce_RU: update weekdays from CLDR [BZ #21207]
[BZ #21207]
        * locales/ce_RU (day): Updated (imported) from CLDR.  Uppercase letters
        left unchanged.
        * locales/ce_RU (abday): Minor updates to match (day): Latin uppercase
        "I" replaced with Cyrillic "Ӏ" ("Palochka", Unicode: U04C0).  Trailing
        spaces removed.
2017-06-13 07:20:16 -04:00
Rafal Luzynski
dfe82f3faa localedata: Remove trailing spaces [BZ #20275]
Many locales have strings that end with an unnecessary trailing space.
Remove them.  Also corrects some notational errors in passing.

        * localedata/locales/mag_IN (abday, day): Consistently use
	uppercase hexadecimal in <Uxxxx> notation.
        * localedata/locales/so_DJ (mon): Use "<U0020>" instead of " ".
        * localedata/locales/so_ET (mon): Likewise.
        * localedata/locales/so_KE (mon): Likewise.
        * localedata/locales/so_SO (mon): Likewise.

	[BZ #20275]
	* localedata/locales/anp_IN (abday, day): Remove trailing spaces.
        * localedata/locales/bho_IN (abday, day): Likewise.
        * localedata/locales/doi_IN (abday, day): Likewise.
        * localedata/locales/hi_IN (day, abday): Likewise.
        * localedata/locales/hne_IN (abday, day): Likewise.
        * localedata/locales/mag_IN (abday, day): Likewise.
        * localedata/locales/ne_NP (abday, day): Likewise.
        * localedata/locales/pa_IN (abday, day): Likewise.
        * localedata/locales/raj_IN (abday, day): Likewise.
        * localedata/locales/the_NP (abday, day): Likewise.
        * localedata/locales/sq_AL (abday, day): Likewise.
        * localedata/locales/ln_CD (name_ms): Likewise.
        * localedata/locales/quz_PE (lang_name): Likewise.
        * localedata/locales/ta_IN (name_mr, name_mrs, name_miss): Likewise.
        * localedata/locales/te_IN (name_mr, name_mrs, name_miss): Likewise.

        * localedata/locales/ht_HT (abday, day): Remove trailing space
        after "len" and "lendi".

        * localedata/locales/mt_MT (mon): Remove trailing space after
        "Diċembru".
2017-06-13 07:16:00 -04:00
Santhosh Thottingal
b05eca0e1d Correct collation rules for Malayalam.
[BZ #19922]
	* locales/iso14651_t1_common: Add collation rules for U+07DA to U+07DF.

	[BZ #19919]
	* locales/iso14651_t1_common: Correct collation of U+0D36 and U+0D37.
2017-06-11 10:08:37 -04:00
Carlos O'Donell
8ba5c9b8d6 Bug 20686: Add el_GR@euro support.
Despite the fact that el_GR is ISO-8859-7:2003 which contains the euro
symobl, it is not possible to know this apriori to selecting the el_GR
locale. Therefore you don't know if el_GR can possibly have the 2003
ammendments which include the euro symbol. This is resolved by creating
an el_GR@euro locale similar to all the other @euro locales for non-UTF8
charsets.
2017-05-03 15:37:04 -04:00
Christopher Chittleborough
3e2cf872a5 Bug 21399: Fix CP1254 comment for U+00EC 2017-04-19 08:10:35 -04:00
Egmont Koblinger
ea1898dded localedata: hu_HU: fix multiple sorting bugs (bug 18934)
Fix the incorrect sorting order of a digraph and its geminated variant,
regression introduced by a faulty fix to bug 13547 in commit
b008d4c856.

Fix two inconsistencies in sorting unusual capitalization of digraphs
(bug #18587).

Enable DIACRIT_FORWARD to work around bug #17750.

Sort foreign accents after the Hungarian ones.

Add extensive unittests containing all the examples from The Rules of
Hungarian Orthography and many more, including explanatory comments.
2017-03-28 10:35:58 -04:00
Mike FABIAN
0b38d66a4e Bug 20313: Update to Unicode 9.0.0
* Unicode 9.0.0 Support: Character encoding, character type info, and
  transliteration tables are all updated to Unicode 9.0.0, using
  generator scripts contributed by Mike FABIAN (Red Hat).
2017-02-21 06:30:38 -05:00
Joseph Myers
bfff8b1bec Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
Andreas Schwab
c0ff3befa9 Revert "Fix ChangeLog typo"
This reverts commit d2d43afa11.
2016-12-31 08:53:42 +01:00
Andreas Schwab
d2d43afa11 Fix ChangeLog typo 2016-12-30 10:34:27 +01:00
Mike Frysinger
a035eb6928 localedata: bs_BA: fix yesexpr/noexpr [BZ #20974]
Both regexes end with a "*." which means the previous match can be
omitted, and then the . allows them to match any input at all.

This means tools like coreutils' `rm -i` will always delete things
when prompted because the yesexpr regex matches all inputs (even
the negative ones).
2016-12-30 01:49:53 -05:00
Mike Frysinger
aa4d00ca39 localedata: GBK: add mapping for 0x80->Euro sign [BZ #20864]
Microsoft long ago added a mapping for 0x80 to the Euro sign to their
CP936.  While GBK 1.0 doesn't include this mapping, it is compatible,
and Microsoft and glibc alias the two codepages.  We could split them
apart so GBK wouldn't include the mapping, but that seems like a lot
of work for little gain.
2016-11-26 17:20:22 -05:00
Ernestas Kulik
7ed2b54451 localedata: lt_LT: use hyphens in d_fmt [BZ #20497]
The standard currently in effect (LST ISO 8601:1997) mandates the use
of hyphens (as opposed to full stops, currently) in date formats.  It
also matches current CLDR data (v29), Wikipedia's & Wikia's settings,
and Microsoft's Lithuanian Style Guide.
2016-08-24 16:07:02 -04:00
Märt Põder
89b7dd27ea locales: et_EE: locale has wrong {p,n}_cs_precedes value [BZ #20459]
According to "Requirements of information technology in Estonian
language and cultural environment" the monetary symbol should be
written after the amount number:

  https://www.evs.ee/products/evs-8-2008
2016-08-15 13:20:04 +02:00
Martin Pitt
71b67af1ff locales: en_CA: update d_fmt [BZ #9842]
The date format in en_CA/LC_TIME specifies the date format as "%d/%m/%y".
However, it should be "%Y-%m-%d".  This is the standard date format in
Canada as specified by the Canadian Standards Association in CSA Z234.5:1989,
which adopts the ISO 8601 standard.

Here's the web page from the National Research Council of Canada
citing ISO 8601 as the standard date/time format in Canada:
http://www.nrc-cnrc.gc.ca/eng/services/time/faq/#Q8
 International Standard ISO 8601 specifies numeric representations
 of date and time. The recommended full format is of the form
 2001-12-31 23:59:28.73 UTC. The intent of this standard is to avoid
 confusion in international communications which can arise with the
 many different national notations. This format has the advantage
 that it permits dates to be readily sorted in chronological order
 by computer systems.

Windows 8+ and OS X also switched to this format.
2016-08-10 18:02:27 +08:00
Mike Frysinger
65f07b8e1f localedata: change M$ to Microsoft 2016-08-10 00:49:14 +08:00
Aurelien Jarno
d461c9682d localedata: fix de_LI locale
Fix the postal_fmt and country_name entries to continue on the following
line without indentation.

localedata/Changelog:
	* locales/de_LI (postal_fmt): Fix indentation.
	(country_name): Likewise.
2016-07-07 14:49:20 +02:00
Aurelien Jarno
f3ea66b37f New locale de_LI
The Principality of Liechtenstein currently does not have a corresponding
locale. Given the links with Switzerland, the best is to base the locale
on the de_CH one (German is the official language) and only change the
country related categories: LC_ADDRESS. and LC_TELEPHONE.

localedata/Changelog:
	* locales/de_LI: New locale.
	* SUPPORTED: Add de_LI.
2016-07-06 18:18:10 +02:00
Gunnar Hjalmarsson
f175e37513 localedata: id_ID: Februari instead of Pebruari [BZ #20316] 2016-07-03 22:24:03 -04:00
Simion Onea
4457bc6350 localedata: ro_RO: update Tuesday translation [BZ #18911]
Enough fonts support ț now that we can change the Tuesday translation
to be what it should rather than use the incorrect ţ.
2016-06-16 01:28:42 -04:00
Paras pradhan
bc24924027 localedata: ne_NP: misc updates [BZ #1170]
This locale was originally copied from ne_IN and it shows: many
fields are incorrect for the Nepal territory, and many fields are
missing translations.  I've vetted most of these against CLDR as
not all fields are covered by it.

LC_TIME
  abday
    tuesday:    मगल -> मङगल
    thursday:   बिहि -> बिही
  day
    tuesday:    मगलबार -> मङगलबार
    thursday:   बिहिबार -> बिहीबार
  abmon:
    january:    जनवरी  -> जन
    february:   फरवरी  -> फब
    april:      अपरल  -> अपरि
    may:        मई    -> म
    july:       जलाई  -> जला
    august:     अगसत  -> अग
    september:  सितमबर -> सपट
    october:    अकटबर -> अकट
    november:   नवमबर  -> नोभ
    december:   दिसमबर -> डिस
  mon:
    february:   फरवरी  -> फबरअरी
    april:      अपरल  -> अपरिल
    may:        मई    -> म
    september:  सितमबर -> सपटमबर
    october:    अकटबर -> अकटोबर
    november:   नवमबर -> नोभमबर
    december:   दिसमबर -> डिसमबर
  d_t_fmt:      %A %d %b %Y %I:%M:%S %p %Z -> %Y %B %d %I:%M:%S %p
  d_fmt:        %A %d %b %Y                -> %Y %B %d %A
  t_fmt:        %I:%M:%S  %Z               -> %H:%M:%S
  t_fmt_ampm:   %I:%M:%S %p %Z             -> %I:%M:%S %p

LC_NAME:
  name_fmt:     %p%t%f%t%g -> %p%t%g%t%m%t%f
  name_gen:     setting to ज्यू
  name_mr:      setting to श्रीमान्
  name_mrs:     setting to श्रीमती
  name_miss:    setting to सुश्री

LC_ADDRESS:
  postal_fmt:   %z%c%T%s%b%e%r -> %f%N%h%s%N%T

LC_TELEPHONE:
  tel_int_fmt:  +%c ;%a ;%l -> +%c %a%t%l
2016-06-11 02:15:09 -04:00
Mike Frysinger
277da2ab88 unicode-gen: include standard comment file header
We deployed this header to all the locale files, so make sure
we include it in the generated ones too so we don't lose it.
2016-06-11 02:10:52 -04:00
Mike Frysinger
686db256f6 localedata: pt_BR/pt_PT: make days/months lowercase [BZ #19133] 2016-06-11 01:30:07 -04:00
Eduardo Trápani
39e050698f localedata: eo: new Esperanto locale [BZ #16190] 2016-06-11 01:24:26 -04:00
Neskie Manuel
b86970392c localedata: chr_US: new Cherokee locale [BZ #12143] 2016-05-07 20:01:55 -04:00
Dimitris Pappas
5301c98776 charmaps: IBM875: fix mapping of iota/upsilon variants [BZ #18453]
The letters ΐ/ϊ and ΰ/ϋ are swapped in the EBCDIC 875 map.
Verified against the original IBM spec.
2016-05-07 19:55:55 -04:00
Arnas Udovičius
c2fc6747ec localedata: sgs_LT: new locale [BZ #12450]
Need to also update the database to include the new code.
2016-05-01 15:18:04 -04:00
Mike Frysinger
045cd3190a localedata: add more translit entries
Some of the newer symbols we're using are missing translit entries which
causes troubles when generating the locales with older encodings.

tr_TR: ₺ -> "TL"
uz_UZ: ʻ -> "'"
common:
  ֏ -> "AMD"
  ₪ -> "ILS"
  ₱ -> "PHP"
  ₸ -> "KZT"
  ₾ -> "GEL"
2016-04-27 01:09:20 -04:00
Mike Frysinger
b2cae5d3f5 tst-fmon/tst-numeric: switch malloc to static stack space [BZ #19671]
The current test code doesn't check the return value of malloc.
This should rarely (if ever) cause a problem, but rather than add
some return value checks, just statically allocate the buffer on
the stack.  This will never fail (or if it does, we've got much
bigger problems that don't matter to the test).
2016-04-23 16:14:01 -04:00
Mike Frysinger
4964bb4591 tst-langinfo: update yesexpr/noexpr baselines 2016-04-23 14:14:55 -04:00
Mike Frysinger
00277a3f81 localedata: CLDRv29: update LC_MESSAGES yes/no strings [BZ #15264] [BZ #16975]
The yes/no strings should be based on the dictionary words.  That means
they are capitalized based on the dictionary rather than position in the
sentence (e.g. the first word).
  bo_CN: nostr: changing མེན to མིན།
  bo_CN: yesstr: changing ཨིན to ཡིན།
  dz_BT: nostr: changing མེན to མེན་
  dz_BT: yesstr: changing ཨིན to ཨིན་
  en_CA: yesstr: changing Yes to yes
  en_CA: nostr: changing No to no
  en_US: yesstr: changing Yes to yes
  en_US: nostr: changing No to no
  es_ES: nostr: changing No to no
  es_ES: yesstr: changing Si to sí
  fi_FI: nostr: changing Ei to ei
  fi_FI: yesstr: changing Kyllä to kyllä
  ig_NG: yesstr: changing Ee to Eye
  ko_KR: nostr: changing 아니오 to 아니요
  ky_KG: nostr: changing Жок to жок
  ky_KG: yesstr: changing Ооба to ооба
  ms_MY: nostr: changing Tidak to tidak
  ms_MY: yesstr: changing Ya to ya
  te_IN: nostr: changing కాదు to వద్దు
  te_IN: yesstr: changing అవను to అవును
  ur_PK: nostr: changing نهيں to نہیں
  ur_PK: yesstr: changing بلكل to ہاں
  uz_UZ: nostr: changing Yo'q to yo‘q
  uz_UZ: yesstr: changing Ha to ha
  uz_UZ@cyrillic: nostr: changing Йўқ to йўқ
  uz_UZ@cyrillic: yesstr: changing Ҳа to ҳа
  wae_CH: nostr: changing Nei to nei
  wae_CH: yesstr: changing Ja to ja
  yo_NG: nostr: changing Bẹ́ẹ̀ kọ́ to Bẹ́ẹ̀kọ́
  yo_NG: yesstr: changing Bẹ́ẹ̀ ni to Bẹ́ẹ̀ni

Some of the translations were just wrong.
  el_GR: nostr: changing no to όχι
  el_GR: yesstr: changing yes to ναι
  km_KH: nostr: changing no:NO:n:N to ទេ​៖ n
  km_KH: yesstr: changing yes:YES:y:Y to បាទ/ចាស​៖ y
  ug_CN: nostr: changing No to ياق
  ug_CN: yesstr: changing Yes to ھەئە

Add missing translations for a number of locales:
  af_ZA: nostr: setting to nee
  af_ZA: yesstr: setting to ja
  am_ET: nostr: setting to አይ
  am_ET: yesstr: setting to አዎን
  ast_ES: nostr: setting to non
  ast_ES: yesstr: setting to sí
  be_BY: nostr: setting to не
  be_BY: yesstr: setting to так
  bem_ZM: nostr: setting to Awe
  bem_ZM: yesstr: setting to Ee
  bg_BG: nostr: setting to не
  bg_BG: yesstr: setting to да
  brx_IN: nostr: setting to नहीं
  brx_IN: yesstr: setting to हाँ
  bs_BA: nostr: setting to ne
  bs_BA: yesstr: setting to da
  ca_ES: nostr: setting to no
  ca_ES: yesstr: setting to sí
  da_DK: nostr: setting to nej
  da_DK: yesstr: setting to ja
  de_DE: nostr: setting to nein
  de_DE: yesstr: setting to ja
  en_DK: nostr: setting to yes
  en_DK: yesstr: setting to no
  et_EE: nostr: setting to ei
  et_EE: yesstr: setting to jah
  eu_ES: nostr: setting to ez
  eu_ES: yesstr: setting to bai
  fa_IR: nostr: setting to نه
  fa_IR: yesstr: setting to بله
  ff_SN: nostr: setting to Alaa
  ff_SN: yesstr: setting to Eey
  fo_FO: nostr: setting to nei
  fo_FO: yesstr: setting to já
  fr_BE: nostr: setting to non
  fr_BE: yesstr: setting to oui
  fr_CH: nostr: setting to non
  fr_CH: yesstr: setting to oui
  fr_FR: nostr: setting to non
  fr_FR: yesstr: setting to oui
  fr_LU: nostr: setting to non
  fr_LU: yesstr: setting to oui
  fur_IT: nostr: setting to no
  fur_IT: yesstr: setting to sì
  fy_DE: nostr: setting to nee
  fy_DE: yesstr: setting to ja
  ga_IE: nostr: setting to níl
  ga_IE: yesstr: setting to tá
  gd_GB: nostr: setting to chan eil
  gd_GB: yesstr: setting to tha
  gl_ES: nostr: setting to non
  gl_ES: yesstr: setting to si
  gu_IN: nostr: setting to નહીં
  gu_IN: yesstr: setting to હા
  he_IL: nostr: setting to לא
  he_IL: yesstr: setting to כן
  hi_IN: nostr: setting to नहीं
  hi_IN: yesstr: setting to हाँ
  hr_HR: nostr: setting to ne
  hr_HR: yesstr: setting to da
  hu_HU: nostr: setting to nem
  hu_HU: yesstr: setting to igen
  id_ID: nostr: setting to tidak
  id_ID: yesstr: setting to ya
  is_IS: nostr: setting to nei
  is_IS: yesstr: setting to já
  it_CH: nostr: setting to no
  it_CH: yesstr: setting to sì
  it_IT: nostr: setting to no
  it_IT: yesstr: setting to sì
  ka_GE: nostr: setting to არა
  ka_GE: yesstr: setting to კი
  kk_KZ: nostr: setting to жоқ
  kk_KZ: yesstr: setting to иә
  kl_GL: nostr: setting to naagga
  kl_GL: yesstr: setting to aap
  kn_IN: nostr: setting to ಇಲ್ಲ
  kn_IN: yesstr: setting to ಹೌದು
  ko_KR: yesstr: setting to 예
  lb_LU: nostr: setting to nee
  lb_LU: yesstr: setting to jo
  lg_UG: nostr: setting to Nedda
  lg_UG: yesstr: setting to Ye
  lt_LT: nostr: setting to ne
  lt_LT: yesstr: setting to taip
  lv_LV: nostr: setting to nē
  lv_LV: yesstr: setting to jā
  mg_MG: nostr: setting to Tsia
  mg_MG: yesstr: setting to Eny
  mn_MN: nostr: setting to үгүй
  mn_MN: yesstr: setting to тийм
  mr_IN: nostr: setting to नाहीःना
  mr_IN: yesstr: setting to होयःहो
  mt_MT: nostr: setting to le
  mt_MT: yesstr: setting to iva
  nb_NO: nostr: setting to nei
  nb_NO: yesstr: setting to ja
  ne_NP: nostr: setting to होइन
  ne_NP: yesstr: setting to हो
  nl_NL: nostr: setting to nee
  nl_NL: yesstr: setting to ja
  nn_NO: nostr: setting to nei
  nn_NO: yesstr: setting to ja
  or_IN: nostr: setting to ନା
  or_IN: yesstr: setting to ହଁ
  os_RU: nostr: setting to нӕйы
  os_RU: yesstr: setting to уойы
  pa_IN: nostr: setting to ਨਹੀਂ
  pa_IN: yesstr: setting to ਹਾਂ
  pl_PL: nostr: setting to nie
  pl_PL: yesstr: setting to tak
  pt_BR: nostr: setting to não
  pt_BR: yesstr: setting to sim
  pt_PT: nostr: setting to não
  pt_PT: yesstr: setting to sim
  ro_RO: nostr: setting to nu
  ro_RO: yesstr: setting to da
  ru_RU: nostr: setting to нет
  ru_RU: yesstr: setting to да
  ru_UA: nostr: setting to нет
  ru_UA: yesstr: setting to да
  se_NO: nostr: setting to ii
  se_NO: yesstr: setting to jo
  sl_SI: nostr: setting to ne
  sl_SI: yesstr: setting to da
  so_DJ: nostr: setting to maya
  so_DJ: yesstr: setting to haa
  so_SO: nostr: setting to maya
  so_SO: yesstr: setting to haa
  sq_AL: nostr: setting to jo
  sq_AL: yesstr: setting to po
  sr_RS@latin: nostr: setting to ne
  sr_RS@latin: yesstr: setting to da
  sr_RS: nostr: setting to не
  sr_RS: yesstr: setting to да
  sv_SE: nostr: setting to nej
  sv_SE: yesstr: setting to ja
  sw_KE: nostr: setting to Hapana
  sw_KE: yesstr: setting to Ndiyo
  yue_HK: nostr: setting to 唔係
  yue_HK: yesstr: setting to 係
  zu_ZA: nostr: setting to cha
  zu_ZA: yesstr: setting to yebo
2016-04-23 03:59:08 -04:00
Mike Frysinger
06f61f191b localedata: LC_MESSAGES.{yes,no}expr: standardize yY/nN [BZ #15262]
The vast majority of languages include yY/nN in their yes/no regexes.
Standardize the few that were missing them.
  ms_MY: noexpr: add nN
  nan_TW@latin: yesexpr: add yY
  nan_TW@latin: noexpr: add nN
  se_NO: noexpr: add nN

This also highlighted a few that were incorrectly using yY/nN because
they clashed with their localized messages:
  uz_UZ: yesexpr: change ^[+1YyHh] to ^[+1ҲҳHh]
  uz_UZ: noexpr: change ^[-0JjNn] to ^[-0ЙйNnYyJj]
  uz_UZ@cyrillic: yesexpr: change ^[+1ҲҳYy] to ^[+1ҲҳHh]
  uz_UZ@cyrillic: noexpr: change ^[-0ЙйNn] to [-0ЙйNnYyJj]
  yo_NG: move nN (short for Bẹ́ẹ̀ni) from noexpr to yesexpr
2016-04-23 03:54:27 -04:00
Mike Frysinger
f982160ead localedata: LC_MESSAGES.{yes,no}expr: add +1/-0 to all regexes [BZ #15263]
A handful of regexes were allowing +1 for yesexpr and -0 for noexpr,
and it's the i18n definition.  Standardize all locales by allowing
these language-independent values in them.

Example change for en_US goes from ^[yY] to ^[+1yY], and from ^[nN]
to ^[-0nN].
2016-04-23 03:41:56 -04:00
Mike Frysinger
ebf01e8334 localedata: standard LC_MESSAGES string regexes a bit
A few locales add .* to the end of regexes, but this isn't necessary.
Drop it to simplify things.  e.g. It goes from ^[yY].* to ^[yY].  Did
it for all these locales for yesexpr & noexpr:
  aa_ET ak_GH am_ET an_ES ar_EG as_IN ast_ES ayc_PE az_AZ be_BY
  be_BY@latin bem_ZM ber_DZ ber_MA bg_BG bhb_IN bo_IN br_FR ca_ES
  ce_RU csb_PL cs_CZ cv_RU cy_GB da_DK de_DE dv_MV dz_BT el_GR en_CA
  en_DK en_US es_ES et_EE eu_ES fa_IR ff_SN fi_FI fo_FO fr_BE fr_CH
  fr_FR fr_LU fur_IT fy_DE ga_IE gd_GB gl_ES gv_GB ha_NG he_IL hi_IN
  hr_HR hsb_DE ht_HT hu_HU ia_FR id_ID ig_NG is_IS it_CH it_IT ka_GE
  kk_KZ kl_GL kn_IN ks_IN ks_IN@devanagari ku_TR kw_GB ky_KG li_NL
  lij_IT lt_LT lv_LV mg_MG mhr_RU mi_NZ mk_MK mn_MN my_MM nan_TW@latin
  nb_NO nds_DE nds_NL ne_NP nhn_MX niu_NU niu_NZ nl_NL nn_NO oc_FR
  or_IN os_RU pa_IN pa_PK pap_AW pap_CW pl_PL ps_AF pt_BR pt_PT quz_PE
  raj_IN ro_RO ru_RU ru_UA sa_IN sc_IT sd_IN@devanagari se_NO sk_SK
  sl_SI so_DJ so_SO sq_AL sv_SE sw_KE szl_PL tcy_IN te_IN tg_TJ the_NP
  tk_TM ti_ER ti_ET tt_RU tt_RU@iqtelif ug_CN unm_US ur_IN ur_PK uz_UZ
  uz_UZ@cyrillic vi_VN wa_BE wae_CH wo_SN yi_US yo_NG

A few locales were missing ^ anchors, so add them.  e.g. It goes from
[oOyY].* to ^[oOyY].
  fil_PH ik_CA iu_CA

Some locales are defining a subset of values when there are fuller ones
available (according to the language).  Switch them to copy instead.
  aa_DJ: copy aa_ET
  en_AU: copy en_US
  en_GB: copy en_US
  en_HK: copy en_US
  en_IN: copy en_US
  en_NG: copy en_US
  en_NZ: copy en_US
  en_PH: copy en_US
  en_SG: copy en_US
  en_ZA: copy en_US
  es_US: copy es_ES
  li_BE: copy nl_BE
2016-04-23 03:22:21 -04:00
Timur Birsh
f3d92ffb18 localedata: kk_KZ: various updates [BZ #15578]
Tweak some of the collation settings for a few characters.

Add/update various fields:
  LC_MESSAGES
    yesstr: set to иә
    nostr: set to жоқ
  LC_MONETARY
    mon_decimal_point: change . to ,
    mon_thousands_sep: change to a non-breaking space
    p_sep_by_space: change 1 to 2
    set int_{p,n}_* fields
  LC_NUMERIC
    thousands_sep: change , to a non-breaking space
  LC_TIME
    abday: change saturday from Сн to Сб
  LC_TELEPHONE
    tel_dom_fmt: set to (%A) %l
    int_select: set to 8~10
  LC_ADDRESS:
    country_post: set to KAZ
    country_ab2: set to KZ
    country_ab3: set to KAZ
    country_isbn: set to 978-601
    lang_name: set to қазақ тілі
2016-04-23 03:13:24 -04:00
Mike Frysinger
0a410e76f5 localedata: CLDRv29: update LC_TIME week/first_week,workday fields
I've spot checked a number of these, including some that were def
wrong (like ff_SN).  It also fixes all open week-related bugs.

Since ff_SN is the only one that changes its base date, I also made
sure that its ordering of day translations were correct.  Looks like
another case Petr brought up where the week field was not actually
checked against the day arrays.

I also took the opportunity to drop first_weekday/first_workday when
the value aligned with the defaults (1 & 2 respectively).  This didn't
impact too many locales In practice because the majority omitted them
already.

A few locales were defining some values incorrectly for their region:
  ak_GH: week: changing [7, 19971130, 7] to [7, 19971130, 1]
  ak_GH: first_weekday: changing 1 to 2
  ayc_PE: week: changing [7, 19971130, 7] to [7, 19971130, 1]
  bem_ZM: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  bem_ZM: first_weekday: changing 1 to 2
  en_IE: first_weekday: changing 2 to 1
  en_US: week: changing [7, 19971130, 7] to [7, 19971130, 1]
  es_CO: first_weekday: changing 2 to 1
  es_ES: week: changing [7, 19971130, 5] to [7, 19971130, 4]
  ff_SN: week: changing [7, 19971129, 1] to [7, 19971130, 1]
  ff_SN: first_weekday: changing 1 to 2
  ga_IE: first_weekday: changing 2 to 1
  ht_HT: week: changing [7, 19971130, 7] to [7, 19971130, 1]
  ht_HT: first_weekday: changing 1 to 2
  mk_MK: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  mt_MT: first_weekday: changing 2 to 1
  quz_PE: week: changing [7, 19971130, 7] to [7, 19971130, 1]
  sr_ME: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  sr_RS: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  sr_RS@latin: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  sw_KE: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  sw_KE: first_weekday: changing 2 to 1
  uk_UA: week: changing [7, 19971130, 4] to [7, 19971130, 1]
  unm_US: week: changing [7, 19971130, 4] to [7, 19971130, 1]

Some locales were copying locales that had the wrong week settings, so
that content had to be duplicated so the values could be adjusted:
  el_CY: week: setting to [7, 19971130, 1]
  en_AG: week: setting to [7, 19971130, 1]
  en_AG: first_weekday: changing 2 to 1
  en_ZM: week: setting to [7, 19971130, 1]
  es_CU: week: setting to [7, 19971130, 1]
  nl_AW: week: setting to [7, 19971130, 1]
  sw_TZ: first_weekday: setting to 2
  ta_LK: first_weekday: setting to 2

The majority of locales were omitting the week field thus getting the
default [7, 19971130, 0 (localedef) / 7 (ISO standard)].  Unfortunately,
neither of those are used by any locales, so we end up having to define
the field just to se the ndays field.  In practice, this rarely matters
due to it usage, and the first two fields match the defaults.
  aa_DJ: setting to [7, 19971130, 1]
  aa_ER: setting to [7, 19971130, 1]
  aa_ER@saaho: setting to [7, 19971130, 1]
  aa_ET: setting to [7, 19971130, 1]
  af_ZA: setting to [7, 19971130, 1]
  am_ET: setting to [7, 19971130, 1]
  an_ES: setting to [7, 19971130, 4]
  anp_IN: setting to [7, 19971130, 1]
  ar_AE: setting to [7, 19971130, 1]
  ar_BH: setting to [7, 19971130, 1]
  ar_DZ: setting to [7, 19971130, 1]
  ar_EG: setting to [7, 19971130, 1]
  ar_IN: setting to [7, 19971130, 1]
  ar_IQ: setting to [7, 19971130, 1]
  ar_JO: setting to [7, 19971130, 1]
  ar_KW: setting to [7, 19971130, 1]
  ar_LB: setting to [7, 19971130, 1]
  ar_LY: setting to [7, 19971130, 1]
  ar_MA: setting to [7, 19971130, 1]
  ar_OM: setting to [7, 19971130, 1]
  ar_QA: setting to [7, 19971130, 1]
  ar_SA: setting to [7, 19971130, 1]
  ar_SD: setting to [7, 19971130, 1]
  ar_SS: setting to [7, 19971130, 1]
  ar_SY: setting to [7, 19971130, 1]
  ar_TN: setting to [7, 19971130, 1]
  ar_YE: setting to [7, 19971130, 1]
  as_IN: setting to [7, 19971130, 1]
  ast_ES: setting to [7, 19971130, 4]
  az_AZ: setting to [7, 19971130, 1]
  be_BY: setting to [7, 19971130, 1]
  be_BY@latin: setting to [7, 19971130, 1]
  ber_DZ: setting to [7, 19971130, 1]
  ber_MA: setting to [7, 19971130, 1]
  bg_BG: setting to [7, 19971130, 4]
  bhb_IN: setting to [7, 19971130, 1]
  bho_IN: setting to [7, 19971130, 1]
  bn_BD: setting to [7, 19971130, 1]
  bn_IN: setting to [7, 19971130, 1]
  bo_CN: setting to [7, 19971130, 1]
  br_FR: setting to [7, 19971130, 4]
  brx_IN: setting to [7, 19971130, 1]
  bs_BA: setting to [7, 19971130, 1]
  byn_ER: setting to [7, 19971130, 1]
  ca_AD: setting to [7, 19971130, 4]
  ca_ES: setting to [7, 19971130, 4]
  ca_ES@euro: setting to [7, 19971130, 4]
  ca_FR: setting to [7, 19971130, 4]
  ca_IT: setting to [7, 19971130, 4]
  ce_RU: setting to [7, 19971130, 1]
  cmn_TW: setting to [7, 19971130, 1]
  crh_UA: setting to [7, 19971130, 1]
  cv_RU: setting to [7, 19971130, 1]
  cy_GB: setting to [7, 19971130, 4]
  de_BE: setting to [7, 19971130, 4]
  de_LU: setting to [7, 19971130, 4]
  doi_IN: setting to [7, 19971130, 1]
  dv_MV: setting to [7, 19971130, 1]
  dz_BT: setting to [7, 19971130, 1]
  el_GR: setting to [7, 19971130, 4]
  el_GR@euro: setting to [7, 19971130, 4]
  en_AU: setting to [7, 19971130, 1]
  en_BW: setting to [7, 19971130, 1]
  en_CA: setting to [7, 19971130, 1]
  en_HK: setting to [7, 19971130, 1]
  en_IE: setting to [7, 19971130, 4]
  en_IN: setting to [7, 19971130, 1]
  en_NG: setting to [7, 19971130, 1]
  en_NZ: setting to [7, 19971130, 1]
  en_PH: setting to [7, 19971130, 1]
  en_SG: setting to [7, 19971130, 1]
  en_ZA: setting to [7, 19971130, 1]
  en_ZW: setting to [7, 19971130, 1]
  es_AR: setting to [7, 19971130, 1]
  es_BO: setting to [7, 19971130, 1]
  es_CL: setting to [7, 19971130, 1]
  es_CO: setting to [7, 19971130, 1]
  es_CR: setting to [7, 19971130, 1]
  es_DO: setting to [7, 19971130, 1]
  es_EC: setting to [7, 19971130, 1]
  es_ES@euro: setting to [7, 19971130, 4]
  es_GT: setting to [7, 19971130, 1]
  es_HN: setting to [7, 19971130, 1]
  es_MX: setting to [7, 19971130, 1]
  es_NI: setting to [7, 19971130, 1]
  es_PA: setting to [7, 19971130, 1]
  es_PE: setting to [7, 19971130, 1]
  es_PR: setting to [7, 19971130, 1]
  es_PY: setting to [7, 19971130, 1]
  es_SV: setting to [7, 19971130, 1]
  es_US: setting to [7, 19971130, 1]
  es_UY: setting to [7, 19971130, 1]
  es_VE: setting to [7, 19971130, 1]
  eu_ES: setting to [7, 19971130, 4]
  fa_IR: setting to [7, 19971130, 1]
  fil_PH: setting to [7, 19971130, 1]
  fo_FO: setting to [7, 19971130, 4]
  fr_CA: setting to [7, 19971130, 1]
  fr_CH: setting to [7, 19971130, 4]
  fr_LU: setting to [7, 19971130, 4]
  fy_NL: setting to [7, 19971130, 4]
  ga_IE: setting to [7, 19971130, 4]
  gd_GB: setting to [7, 19971130, 4]
  gez_ER: setting to [7, 19971130, 1]
  gez_ET: setting to [7, 19971130, 1]
  gl_ES: setting to [7, 19971130, 4]
  gu_IN: setting to [7, 19971130, 1]
  gv_GB: setting to [7, 19971130, 4]
  hak_TW: setting to [7, 19971130, 1]
  ha_NG: setting to [7, 19971130, 1]
  he_IL: setting to [7, 19971130, 1]
  hi_IN: setting to [7, 19971130, 1]
  hne_IN: setting to [7, 19971130, 1]
  hr_HR: setting to [7, 19971130, 1]
  hy_AM: setting to [7, 19971130, 1]
  id_ID: setting to [7, 19971130, 1]
  ig_NG: setting to [7, 19971130, 1]
  ik_CA: setting to [7, 19971130, 1]
  is_IS: setting to [7, 19971130, 4]
  it_CH: setting to [7, 19971130, 4]
  it_IT: setting to [7, 19971130, 4]
  it_IT@euro: setting to [7, 19971130, 4]
  iu_CA: setting to [7, 19971130, 1]
  ja_JP: setting to [7, 19971130, 1]
  ka_GE: setting to [7, 19971130, 1]
  kk_KZ: setting to [7, 19971130, 1]
  kl_GL: setting to [7, 19971130, 1]
  km_KH: setting to [7, 19971130, 1]
  kn_IN: setting to [7, 19971130, 1]
  kok_IN: setting to [7, 19971130, 1]
  ko_KR: setting to [7, 19971130, 1]
  ks_IN: setting to [7, 19971130, 1]
  ks_IN@devanagari: setting to [7, 19971130, 1]
  ku_TR: setting to [7, 19971130, 1]
  kw_GB: setting to [7, 19971130, 4]
  ky_KG: setting to [7, 19971130, 1]
  lg_UG: setting to [7, 19971130, 1]
  lij_IT: setting to [7, 19971130, 4]
  lo_LA: setting to [7, 19971130, 1]
  lt_LT: setting to [7, 19971130, 4]
  lv_LV: setting to [7, 19971130, 1]
  lzh_TW: setting to [7, 19971130, 1]
  mag_IN: setting to [7, 19971130, 1]
  mai_IN: setting to [7, 19971130, 1]
  mg_MG: setting to [7, 19971130, 1]
  mhr_RU: setting to [7, 19971130, 1]
  mi_NZ: setting to [7, 19971130, 1]
  ml_IN: setting to [7, 19971130, 1]
  mni_IN: setting to [7, 19971130, 1]
  mn_MN: setting to [7, 19971130, 1]
  mr_IN: setting to [7, 19971130, 1]
  ms_MY: setting to [7, 19971130, 1]
  mt_MT: setting to [7, 19971130, 1]
  my_MM: setting to [7, 19971130, 1]
  nan_TW: setting to [7, 19971130, 1]
  nan_TW@latin: setting to [7, 19971130, 1]
  ne_NP: setting to [7, 19971130, 1]
  nhn_MX: setting to [7, 19971130, 1]
  niu_NU: setting to [7, 19971130, 1]
  niu_NZ: setting to [7, 19971130, 1]
  nl_BE: setting to [7, 19971130, 4]
  nl_BE@euro: setting to [7, 19971130, 4]
  nr_ZA: setting to [7, 19971130, 1]
  nso_ZA: setting to [7, 19971130, 1]
  oc_FR: setting to [7, 19971130, 4]
  om_ET: setting to [7, 19971130, 1]
  om_KE: setting to [7, 19971130, 1]
  or_IN: setting to [7, 19971130, 1]
  os_RU: setting to [7, 19971130, 1]
  pa_IN: setting to [7, 19971130, 1]
  pap_AW: setting to [7, 19971130, 1]
  pap_CW: setting to [7, 19971130, 1]
  pa_PK: setting to [7, 19971130, 1]
  ps_AF: setting to [7, 19971130, 1]
  pt_BR: setting to [7, 19971130, 1]
  pt_PT: setting to [7, 19971130, 4]
  pt_PT@euro: setting to [7, 19971130, 4]
  raj_IN: setting to [7, 19971130, 1]
  ro_RO: setting to [7, 19971130, 1]
  ru_RU: setting to [7, 19971130, 1]
  ru_UA: setting to [7, 19971130, 1]
  rw_RW: setting to [7, 19971130, 1]
  sa_IN: setting to [7, 19971130, 1]
  sat_IN: setting to [7, 19971130, 1]
  sd_IN: setting to [7, 19971130, 1]
  sd_IN@devanagari: setting to [7, 19971130, 1]
  se_NO: setting to [7, 19971130, 4]
  shs_CA: setting to [7, 19971130, 1]
  sid_ET: setting to [7, 19971130, 1]
  si_LK: setting to [7, 19971130, 1]
  sl_SI: setting to [7, 19971130, 1]
  so_DJ: setting to [7, 19971130, 1]
  so_ET: setting to [7, 19971130, 1]
  so_KE: setting to [7, 19971130, 1]
  so_SO: setting to [7, 19971130, 1]
  sq_AL: setting to [7, 19971130, 1]
  ss_ZA: setting to [7, 19971130, 1]
  st_ZA: setting to [7, 19971130, 1]
  sv_FI: setting to [7, 19971130, 4]
  sv_SE: setting to [7, 19971130, 4]
  ta_IN: setting to [7, 19971130, 1]
  tcy_IN: setting to [7, 19971130, 1]
  te_IN: setting to [7, 19971130, 1]
  tg_TJ: setting to [7, 19971130, 1]
  the_NP: setting to [7, 19971130, 1]
  th_TH: setting to [7, 19971130, 1]
  ti_ER: setting to [7, 19971130, 1]
  ti_ET: setting to [7, 19971130, 1]
  tig_ER: setting to [7, 19971130, 1]
  tk_TM: setting to [7, 19971130, 1]
  tl_PH: setting to [7, 19971130, 1]
  tn_ZA: setting to [7, 19971130, 1]
  tr_CY: setting to [7, 19971130, 1]
  tr_TR: setting to [7, 19971130, 1]
  ts_ZA: setting to [7, 19971130, 1]
  tt_RU: setting to [7, 19971130, 1]
  tt_RU@iqtelif: setting to [7, 19971130, 1]
  ug_CN: setting to [7, 19971130, 1]
  ur_IN: setting to [7, 19971130, 1]
  ur_PK: setting to [7, 19971130, 1]
  uz_UZ: setting to [7, 19971130, 1]
  uz_UZ@cyrillic: setting to [7, 19971130, 1]
  ve_ZA: setting to [7, 19971130, 1]
  vi_VN: setting to [7, 19971130, 1]
  wa_BE: setting to [7, 19971130, 4]
  wal_ET: setting to [7, 19971130, 1]
  wo_SN: setting to [7, 19971130, 1]
  xh_ZA: setting to [7, 19971130, 1]
  yi_US: setting to [7, 19971130, 1]
  yo_NG: setting to [7, 19971130, 1]
  yue_HK: setting to [7, 19971130, 1]
  zh_CN: setting to [7, 19971130, 1]
  zh_HK: setting to [7, 19971130, 1]
  zh_SG: setting to [7, 19971130, 1]
  zh_TW: setting to [7, 19971130, 1]
  zu_ZA: setting to [7, 19971130, 1]

Finally, set first_weekday in all the locales that were omitting it
and wanted something other than the default of 1.
  aa_DJ: setting to 7
  aa_ER: setting to 2
  aa_ER@saaho: setting to 2
  ar_AE: setting to 7
  ar_BH: setting to 7
  ar_DZ: setting to 7
  ar_EG: setting to 7
  ar_IQ: setting to 7
  ar_JO: setting to 7
  ar_KW: setting to 7
  ar_LB: setting to 2
  ar_LY: setting to 7
  ar_MA: setting to 7
  ar_OM: setting to 7
  ar_QA: setting to 7
  ar_SD: setting to 7
  ar_SS: setting to 2
  ar_SY: setting to 7
  az_AZ: setting to 2
  be_BY: setting to 2
  be_BY@latin: setting to 2
  ber_DZ: setting to 7
  ber_MA: setting to 7
  bn_BD: setting to 6
  bs_BA: setting to 2
  byn_ER: setting to 2
  dv_MV: setting to 6
  en_NG: setting to 2
  es_BO: setting to 2
  es_CL: setting to 2
  es_EC: setting to 2
  es_UY: setting to 2
  fo_FO: setting to 2
  fr_CH: setting to 2
  gd_GB: setting to 2
  gez_ER: setting to 2
  ha_NG: setting to 2
  hr_HR: setting to 2
  hy_AM: setting to 2
  ig_NG: setting to 2
  is_IS: setting to 2
  it_CH: setting to 2
  ka_GE: setting to 2
  kk_KZ: setting to 2
  kl_GL: setting to 2
  ku_TR: setting to 2
  ky_KG: setting to 2
  lg_UG: setting to 2
  mg_MG: setting to 2
  mn_MN: setting to 2
  ms_MY: setting to 2
  niu_NU: setting to 2
  pap_AW: setting to 2
  pap_CW: setting to 2
  pt_PT: setting to 2
  pt_PT@euro: setting to 2
  rw_RW: setting to 2
  se_NO: setting to 2
  si_LK: setting to 2
  so_DJ: setting to 7
  so_SO: setting to 2
  sq_AL: setting to 2
  tg_TJ: setting to 2
  ti_ER: setting to 2
  tig_ER: setting to 2
  tk_TM: setting to 2
  tt_RU: setting to 2
  tt_RU@iqtelif: setting to 2
  uz_UZ: setting to 2
  uz_UZ@cyrillic: setting to 2
  vi_VN: setting to 2
  wo_SN: setting to 2
  yo_NG: setting to 2
2016-04-23 03:01:45 -04:00
Guy Rutenberg
146ffc146f localedata: en_IL: new English locale [BZ #19963] 2016-04-21 22:49:53 -04:00
Claude Paroz
c33ab198fd localedata: ln_CD: new locale [BZ #12676] 2016-04-21 22:48:10 -04:00
Mike Frysinger
36f91a5db6 localedata: fix LC_TELEPHONE in a few locales
A bunch of locales were copying the wrong source locale -- looks like they
were basically TODOs from the original imports.  This lead to bad values
for int_prefix for them.
2016-04-16 03:41:35 -04:00
Mike Frysinger
3544cbc8f4 localedata: LC_IDENTIFICATION: delete uncommon fields
Very few locales set audience/application/abbreviation, and
even the ones that do, set them largely to default/useless
values.  Drop them from the few locales until we decide we
want to set these everywhere (to something useful).
2016-04-16 03:29:36 -04:00
Mike Frysinger
223f17cf7b localedata: CLDRv29: update LC_MONETARY int_curr_symbol & currency_symbol
This updates a few locales based on CLDR v29 data.  I've verified most by
hand while the rest I know are correct.

For int_curr_symbol, it should be 3 characters followed by a space:
  ar_SS: changing SDG  to SSP
  bem_ZM: changing ZMK  to ZMW
  dz_BT: changing BTN  to BTN   # Just changing " " to "<U0020>".
  en_ZW: changing ZWD  to USD
  es_SV: changing SVC  to USD
  lv_LV: changing LVL  to EUR
  ne_NP: changing INR  to NPR
  pap_AW: changing ANG  to AWG
  the_NP: changing INR  to NPR
Some of these require updates iso-4217.def.

For currency_symbol, it should be the standard/localized symbol name:
  aa_DJ: changing $ to Fdj
  ar_SA: changing ريال to ر.س
  ar_SS: changing ج.س. to £
  az_AZ: changing man. to ₼
  bg_BG: changing лв to лв.
  ce_RU: changing руб to ₽
  crh_UA: changing gr to ₴
  cv_RU: changing t to ₽
  de_CH: changing Fr. to CHF
  dz_BT: changing དངུལ་ཀྲམ་ to Nu.
  en_BW: changing Pu to P
  en_DK: changing ¤ to kr.
  en_PH: changing Php to ₱
  en_ZW: changing Z$ to $
  es_BO: changing $b to Bs
  es_DO: changing $ to RD$
  es_HN: changing L. to L
  es_PA: changing B/ to B/.
  es_SV: changing ₡ to $
  fil_PH: changing PhP to ₱
  he_IL: changing שח to ₪
  hy_AM: changing Դ to ֏
  ka_GE: changing ლ to ₾
  kk_KZ: changing тг to ₸
  ko_KR: changing ₩ to ₩
  lg_UG: changing /- to USh
  lv_LV: changing Ls to €
  mg_MG: changing AR to Ar
  mhr_RU: changing ТЕҤ to ₽
  my_MM: changing Ks to K
  os_RU: changing сом to ₽
  pap_AW: changing f to ƒ
  pap_CW: changing f to ƒ
  ps_AF: changing افغانۍ to ؋
  rw_RW: changing Frw to FRw
  ru_RU: changing руб to ₽
  ru_UA: changing гр to ₴
  sd_IN@devanagari: changing रु to ₹
  se_NO: changing  ru to kr
  si_LK: changing ₨ to රු
  so_SO: changing $ to S
  sq_AL: changing Lek to L
  ti_ER: changing $ to Nfk
  ti_ET: changing $ to Br
  tl_PH: changing PhP to ₱
  tr_TR: changing TL to ₺
  tt_RU: changing руб to ₽
  tt_RU@iqtelif: changing sum to ₽
  uz_UZ: changing so'm to soʻm
Note: Some of the characters might not render as they're still quite new
in the Unicode database.
2016-04-15 13:42:29 -04:00
Mike Frysinger
900f59f084 localedef: check LC_IDENTIFICATION.category values
Currently localedef accepts any value for the category keyword.  This has
allowed bad values to propagate to the vast majority of locales (~90%).
Add some logic to only accept a few standards.
2016-04-15 12:44:05 -04:00
Mike Frysinger
1a06eee869 localedata: LC_IDENTIFICATION.category: set to ISO 30112 2014 standard
The ISO 30112 standard defines the valid values for the category
keyword as only a few options:
	posix:1993
	i18n:2004
	i18n:2012

The vast majority of locales had changed the "i18n" string to the
name of its own locale (e.g. "ak_GH:2013") as well as tweaking the
date (presumably thinking it should be the date of submission).

Convert all of them to "i18n:2012" for consistency.  A follow up
change will update localedef to actually check/validate the field.
2016-04-15 12:37:38 -04:00
Mike Frysinger
f8da6e93a6 localedata: CLDRv29: update LC_ADDRESS.country_name translations
This updates a bunch of locales based on CLDR v29 data:
  bg_BG: changing Bulgaria to България
  bo_CN: changing ཀྲུང་ཧྭ་མི་དམངས་སྤྱི་མཐུན་རྒྱལ་ཁབ། to རྒྱ་ནག
  bo_IN: changing རྒྱ་གར to རྒྱ་གར་
  cy_GB: changing Cymru to Y Deyrnas Unedig
  dz_BT: changing འབྲུག། to འབྲུག
  en_US: changing USA to United States
  es_US: changing USA to Estados Unidos
  gd_GB: changing Breatainn Mhòr to An Rìoghachd Aonaichte
  ha_NG: changing Nigeria to Najeriya
  mk_MK: changing Macedonia to Македонија
  mn_MN: changing Mongolia to Монгол
  sq_MK: changing Macedonia to Maqedoni
  sr_RS@latin: changing Srbija i Crna Gora to Srbija
  tr_CY: changing Northern Cyprus to Kıbrıs
  tr_TR: changing Turkey to Türkiye
  ug_CN: changing 中华人民共和国 to جۇڭگو
  uz_UZ: changing O'zbekistan to Oʻzbekiston
  vi_VN: changing Việt nam to Việt Nam
  wae_CH: changing Switzerland to Schwiz
  yi_US: changing די פֿאראײניקטע שטאַטן to פֿאַראייניגטע שטאַטן
  yo_NG: changing Nigeria to Orílẹ́ède Nàìjíríà
  yue_HK: changing 香港 to 中華人民共和國香港特別行政區
  zu_ZA: changing Mzansi Afrika to i-South Africa

These all look largely straightforward.  Many had English translations
instead of native, and a few have been updated.  I can't verify some of
them as I'm not personally familiar, but the CLDR data matches.

The USA->United States seems a little odd, but that is also what the
CLDR database uses everywhere (rather than "United States of America").

We can also fill in a country name where there wasn't one before.
Many look correct to me (mostly the English ones), but there's also
many that I have no idea.  But it can't be worse than leaving it
blank ? :)
  ar_AE: changing to الإمارات العربية المتحدة
  ar_BH: changing to البحرين
  ar_DZ: changing to الجزائر
  ar_EG: changing to مصر
  ar_IN: changing to الهند
  ar_IQ: changing to العراق
  ar_JO: changing to الأردن
  ar_KW: changing to الكويت
  ar_LB: changing to لبنان
  ar_LY: changing to ليبيا
  ar_MA: changing to المغرب
  ar_OM: changing to عُمان
  ar_QA: changing to قطر
  ar_SA: changing to المملكة العربية السعودية
  ar_SD: changing to السودان
  ar_SS: changing to جنوب السودان
  ar_SY: changing to سوريا
  ar_TN: changing to تونس
  ar_YE: changing to اليمن
  as_IN: changing to ভাৰত
  ast_ES: changing to España
  az_AZ: changing to Azərbaycan
  be_BY: changing to Беларусь
  bn_IN: changing to ভারত
  br_FR: changing to Frañs
  brx_IN: changing to भारत
  bs_BA: changing to Bosna i Hercegovina
  ca_AD: changing to Andorra
  ca_ES: changing to Espanya
  ca_FR: changing to França
  ca_IT: changing to Itàlia
  ce_RU: changing to Росси
  da_DK: changing to Danmark
  de_AT: changing to Österreich
  de_BE: changing to Belgien
  de_CH: changing to Schweiz
  de_LU: changing to Luxemburg
  el_CY: changing to Κύπρος
  el_GR: changing to Ελλάδα
  en_AG: changing to Antigua & Barbuda
  en_AU: changing to Australia
  en_BW: changing to Botswana
  en_CA: changing to Canada
  en_DK: changing to Denmark
  en_GB: changing to United Kingdom
  en_HK: changing to Hong Kong SAR China
  en_IE: changing to Ireland
  en_IN: changing to India
  en_NZ: changing to New Zealand
  en_PH: changing to Philippines
  en_SG: changing to Singapore
  en_ZW: changing to Zimbabwe
  es_AR: changing to Argentina
  es_BO: changing to Bolivia
  es_CL: changing to Chile
  es_CO: changing to Colombia
  es_CU: changing to Cuba
  es_DO: changing to República Dominicana
  es_EC: changing to Ecuador
  es_ES: changing to España
  es_GT: changing to Guatemala
  es_HN: changing to Honduras
  es_MX: changing to México
  es_NI: changing to Nicaragua
  es_PA: changing to Panamá
  es_PE: changing to Perú
  es_PR: changing to Puerto Rico
  es_PY: changing to Paraguay
  es_SV: changing to El Salvador
  es_UY: changing to Uruguay
  es_VE: changing to Venezuela
  eu_ES: changing to Espainia
  fil_PH: changing to Pilipinas
  fo_FO: changing to Føroyar
  fr_BE: changing to Belgique
  fr_CA: changing to Canada
  fr_CH: changing to Suisse
  fr_FR: changing to France
  fr_LU: changing to Luxembourg
  fur_IT: changing to Italie
  fy_DE: changing to Dútslân
  fy_NL: changing to Nederlân
  ga_IE: changing to Éire
  gl_ES: changing to España
  gu_IN: changing to ભારત
  gv_GB: changing to Rywvaneth Unys
  he_IL: changing to ישראל
  hi_IN: changing to भारत
  hr_HR: changing to Hrvatska
  hu_HU: changing to Magyarország
  id_ID: changing to Indonesia
  is_IS: changing to Ísland
  it_CH: changing to Svizzera
  it_IT: changing to Italia
  ja_JP: changing to 日本
  ka_GE: changing to საქართველო
  kk_KZ: changing to Қазақстан
  kl_GL: changing to Kalaallit Nunaat
  kn_IN: changing to ಭಾರತ
  kok_IN: changing to भारत
  ko_KR: changing to 대한민국
  ks_IN: changing to ہِنٛدوستان
  ks_IN@devanagari: changing to भारत
  kw_GB: changing to Rywvaneth Unys
  ky_KG: changing to Кыргызстан
  lt_LT: changing to Lietuva
  lv_LV: changing to Latvija
  mg_MG: changing to Madagasikara
  ml_IN: changing to ഇന്ത്യ
  mr_IN: changing to भारत
  ms_MY: changing to Malaysia
  mt_MT: changing to Malta
  nb_NO: changing to Norge
  ne_NP: changing to नेपाल
  nl_AW: changing to Aruba
  nl_BE: changing to België
  nl_NL: changing to Nederland
  nn_NO: changing to Noreg
  or_IN: changing to ଭାରତ
  os_RU: changing to Уӕрӕсе
  pa_IN: changing to ਭਾਰਤ
  pa_PK: changing to ਪਾਕਿਸਤਾਨ
  pl_PL: changing to Polska
  pt_BR: changing to Brasil
  pt_PT: changing to Portugal
  ru_RU: changing to Россия
  ru_UA: changing to Украина
  sd_IN@devanagari: changing to भारत
  se_NO: changing to Norga
  si_LK: changing to ශ්‍රී ලංකාව
  sk_SK: changing to Slovensko
  sl_SI: changing to Slovenija
  sq_AL: changing to Shqipëri
  sv_SE: changing to Sverige
  ta_IN: changing to இந்தியா
  ta_LK: changing to இலங்கை
  ur_IN: changing to بھارت
  ur_PK: changing to پاکستان
2016-04-15 00:52:23 -04:00
Mike Frysinger
0e8120ae53 localedata: fix LC_ADDRESS.country_car entries
These entries have been checked mostly against Wikipedia, but also using
the sources it cites (like the UN and other treaty sources).

Fix incorrect values:
  en_BW:  changing RB to BW
  kl_GL:  changing GRO to KN
  km_KH:  changing LAO to KH
  my_MM:  changing BA to MYA
  oc_FR:  changing F to F
  tr_CY:  changing TR to CY
  wae_CH: changing DH to CH

Add missing entries:
  aa_DJ:  changing to DJI
  ak_GH:  changing to GH
  ar_OM:  changing to OM
  ar_SS:  changing to SUD
  ar_YE:  changing to YAR
  bo_CN:  changing to CHN
  cmn_TW: changing to RC
  dv_MV:  changing to MV
  dz_BT:  changing to BHT
  en_AG:  changing to AG
  es_HN:  changing to HN
  es_PR:  changing to PR
  hak_TW: changing to RC
  lzh_TW: changing to RC
  nan_TW: changing to RC
  nan_TW@latin: changing to RC
  nl_AW:  changing to AUA
  pap_AW: changing to AUA
  so_DJ:  changing to DJI
  the_NP: changing to NEP
  ug_CN:  changing to CHN
  yue_HK: changing to HK
  zh_CN:  changing to CHN
  zh_HK:  changing to HK
  zh_TW:  changing to RC
2016-04-15 00:41:22 -04:00
Mike Frysinger
76c3498180 localedata: CLDRv29: update LC_ADDRESS.country_num values
This updates a few locales based on CLDR v29 data.

Add missing fields:
  as_IN: changing to 356
  dv_MV: changing to 462
  kk_KZ: changing to 398
  my_MM: changing to 104
  rw_RW: changing to 646
  tt_RU: changing to 643

Update ones that are wrong:
  dz_BT: changing BHU to 064
  en_PH: changing 360 to 608
  km_KH: changing 418 to 116
  ky_KG: changing 643 to 417
  tr_CY: changing 792 to 196
  wo_SN: changing 450 to 686

As a result of fixing these, I had to update country_ab[23]:
  dz_BT: changing BHU to BTN
  en_PH: changing ID/IDN to PH/PHL
  km_KH: changing LA/LAO to KH/KHM
  ky_KG: changing KY/KYR to KG/KGZ
  tr_CY: changing TR/TUR to CY/CYP
  wo_SN: changing MG/MDG to SN/SEN

Pad with leading zeros to match the standard and other locales:
  ber_DZ: changing 12 to 012
  ca_AD: changing 20 to 020
  en_AG: changing 28 to 028
  hy_AM: changing 51 to 051
  li_BE: changing 56 to 056
  wa_BE: changing 56 to 056

I hand checked the first two sets against ISO 3166-1 directly.
2016-04-15 00:27:11 -04:00
Mike Frysinger
fe1f87ac12 localedata: LC_PAPER: use copy directives everywhere
There are only two page sizes that locales use: US-Letter and A4.
For the former, move to copying the en_US locale, while for the
latter, move to copying the i18n locale.  This lets us clean up
all the stray comments like FIXME.

There should be no functional differences here.
2016-04-15 00:15:14 -04:00