Commit Graph

22711 Commits

Author SHA1 Message Date
David Beaumont
dfc8b8b746 ICU-20697 Delete now unused files and documentation for the old ICU LDML tooling. 2020-07-14 20:27:28 +02:00
Michael Block
f917c43cf1 ICU-21178 Adding the trailing space back into two RBBI test cases. 2020-07-07 16:05:05 -07:00
Makoto Kato
c9037ca8d3 ICU-11992 uprv_tzname doesn't return valid time zone on Android 2020-07-06 10:11:20 -07:00
John Wilcock
6fe86f3934 ICU-21173 Add support for more currency variants. ICU4C equivalent of…
See #1184
2020-07-03 04:51:15 +02:00
Hugo van der Merwe
3fca290880 ICU-21174 Add a const version of MaybeStackVector::getAlias().
(Also makes a tiny tweak to appendAll() documentation.)
2020-07-02 01:56:08 +02:00
Markus Scherer
4d428cb8f3 ICU-21176 spoof checker: remove whitelist/blacklist metaphors from API docs 2020-07-01 15:21:05 -07:00
John Wilcock
9219c6ae03 ICU-13733 Added test for mismatching currency format for strict-mode parsing
See #1169
2020-06-30 02:22:57 +02:00
Diego Barrios Romero
de0306daaa ICU-21170 Fix function prototypes 2020-06-25 15:31:11 -07:00
Łukasz Wojniłowicz
ed56301abd ICU-20545 Ensure that path ends with detected file separator
CharString, when asked, appends U_FILE_SEP_CHAR at the end of the string
it holds, if it won't find U_FILE_SEP_CHAR or U_FILE_ALT_SEP_CHAR there.
The problem starts if the dir variable uses
U_FILE_ALT_SEP_CHAR which is not equal to U_FILE_SEP_CHAR. Then the
resulting path could look like this
../data\
instead of this
../data/

This patch uses U_FILE_SEP_CHAR unless it detects that the dir variable
doesn't use it, and uses U_FILE_ALT_SEP_CHAR instead.
2020-06-24 11:38:41 -07:00
Markus Scherer
ef12882fdb ICU-21144 LocaleMatcher setMaxDistance(), isMatch() 2020-06-23 13:56:49 -07:00
Andy Heninger
99dc49a0c0 ICU-20869 Fix compiler warning in FixedDecimal::getFractionalDigits().
Fix a clang compiler warning and a potential undefined behavior arising
from casting an out-of-range double to an int. See the Jira ticket for a
more detailed description of the problem.

This PR is to fix the immediate problem. Longer term, the function
may be replaced entirely - see issue ICU-21147.
2020-06-19 11:31:42 -07:00
Łukasz Wojniłowicz
cd5b025ef8 ICU-20545 Detect file separator char from dir
If udata_create won't find U_FILE_SEP_CHAR at the end of a dir variable,
then it appends it. The problem starts if the dir variable uses
U_FILE_ALT_SEP_CHAR which is not equal to U_FILE_SEP_CHAR. Then the
resulting path could look like this
../data\mappings/cns-11643-1992.ucm
instead of this
../data/mappings/cns-11643-1992.ucm

This patch uses U_FILE_SEP_CHAR unless it detects that the dir variable
doesn't use it, and uses U_FILE_ALT_SEP_CHAR instead.
2020-06-18 10:54:25 -07:00
Hugo van der Merwe
55127d6778 ICU-21165 Add LdmlConverter UNITS output, update SUPPLEMENTAL_DATA.
- Produce new supplementalData.txt and units.txt with:

      ant -f build-icu-data.xml -DoutDir=/tmp/new_dir \
          -DcldrVersion=37 -DoutputTypes=UNITS,SUPPLEMENTAL_DATA
2020-06-18 09:57:34 +02:00
Hugo van der Merwe
6a1df9e16c ICU-21169 Add SingleUnitImpl::getSimpleUnitID().
Also:
- Use BytesTrie not UCharsTrie.
- Add a nullptr check for a uprv_malloc.
2020-06-18 09:27:03 +02:00
Frank Tang
982c4799bf ICU-21161 Mark uloc_getDisplayScriptInContext static
Remove from urename.h
2020-06-17 23:26:33 -07:00
Andy Heninger
1eef362329 ICU-13565 Break Iteration, remove the dictionary bit from the implementation.
For identifying text that needs to be handled by a word dictionary for Break Iteration,
change from using a bit in the character category to sorting all dictionary categories
together, and recording the boundary between the non-dictionary and dictionary ranges.

This is internal to the implementaion. It does not affect behavior.
It does increase the number of character categories that can be handled using a
compact 8 bit Trie, from 127 to 255.
2020-06-17 12:00:14 -07:00
Frank Tang
e7bd5b1cef ICU-21109 minimum grouping digits in DecimalFormat
See #1152
2020-06-11 14:32:52 -07:00
Fredrik Roubert
0735ea8c6f ICU-21143 Applying non-zero offset to null pointer is undefined behaviour.
The result of pointer end + 1 will not be used if end is nullptr so it
doesn't really matter that the result of this operation is undefined,
but it's therefore also unnecessary to perform the operation at all.

Changing this removes this unnecessary operation and by doing so gives
the undefined behaviour sanitizer one thing less to worry about.
2020-06-04 15:13:36 +02:00
Andy Heninger
f0ad454691 ICU-13565 RBBI, make all state table row data be unsigned. 2020-06-01 20:05:17 -07:00
Jeff Genovy
723037953b ICU-21119 Enable verbose output from ICU data build when building DEBUG on Windows 2020-05-29 16:02:56 -07:00
Shane F. Carr
3ff6627ce6 ICU-21134 Copy additional data when toNumberFormatter is used
See #1156
2020-05-28 22:33:58 -05:00
Frank Tang
ec7e29f2b6 ICU-13786 Fix addLikelySubtags/minimizeSubtags
See #1140
2020-05-27 18:36:36 -07:00
Frank Tang
c5ebb80a73 ICU-13565 Reduce size of BreakIterator brk files
See #1100
2020-05-27 14:26:10 -07:00
Steven R. Loomis
4231ca5be0 ICU-21098 fix ticket URLs for logKnownIssue tickets.
- Still allows "1234" or "cldrbug:1234" format ticket IDs
- However, docs recommend "ICU-1234" or "CLDR-1234" format
in the future.
- Other ticket IDs could be used, but won't be linkified.
2020-05-20 15:58:51 -07:00
Markus Scherer
eaee0b175e ICU-21029 LocaleMatcher: add option to turn off default locale 2020-05-20 15:16:28 -07:00
Stephan Szabo
b6eb747550 ICU-10879 Split out OBJECTS from Makefiles into separate files 2020-05-20 11:37:05 -07:00
Shane F. Carr
715d254a02 ICU-21081 Make U_ASSERT C++14 compatible 2020-05-08 19:03:43 -07:00
Jeff Genovy
82a5959b86 ICU-21102 Fix broken builds on Windows when using a pre-built data file (from the tgz). 2020-05-07 14:13:06 -07:00
Peter Edberg
6fdd303532 ICU-21096 adjust logKnownIssues for ICU rbbitst 2020-05-06 17:29:49 -07:00
Peter Edberg
d39899350d ICU-21099 udat_toCalendarDateField should handle all UDateFormatFields and out of range 2020-04-28 09:58:50 -07:00
yumaoka
a951ab59c7 ICU-21094 tzdata2020a to ICU. 2020-04-25 14:17:28 -04:00
Robert Melo
440cef61a7 ICU-21071 Fix lenient parse rules
- Check non-lenient rules before call lenint parsing
- Remove logKnownIssue 9503 from test code
- Adjust TestAllLocales test on ICU4C
- Add lenient checks on ICU4J
2020-04-24 15:46:48 -03:00
Craig Cornelius
5944e1856c ICU-21052 Fix two compilations with UCONFIG_NO_CONVERSION option in t…
See #1127
2020-04-23 22:10:36 -07:00
Shane F. Carr
a5c940dfd8
ICU-21087 Merge maint/maint-67 to master 2020-04-22 20:15:39 -05:00
Peter Edberg
125e29d549 ICU-21061 integrate CLDR release-37-beta3 to master
(cherry-picked from 71fa037844)
2020-04-22 10:49:10 -07:00
Frank Tang
f0ada59042 ICU-20949 Fix compound unit in "ar", "ne" locales
Do not assume the "one" pattern always contains "{0}"
2020-04-22 10:39:01 -07:00
Shane F. Carr
b5973eee64 ICU-21061 ICU-21085 Merge latest changes from double-conversion 2020-04-21 20:31:38 -05:00
Elango Cheran
925376a904 ICU-21055 Remove test inputs causing noknownissues test run to hang 2020-04-21 12:49:30 -07:00
Craig Cornelius
2f4e2e43ce ICU-21053 Fix compile problems with UCONFIG_NO_FORMATTING = 1
See #1115
2020-04-20 16:47:36 -07:00
Frank Tang
3dce0ab5c6 ICU-21082 Fix error: no member named 'abs' in"std" 2020-04-18 00:36:20 -07:00
Hugo van der Merwe
e03fa70541 ICU-21060 Fix behaviour of -per-, -and-, and dimensionless units. 2020-04-18 00:57:02 -05:00
Peter Edberg
71fa037844 ICU-21061 integrate CLDR release-37-beta3 to master 2020-04-16 09:29:47 -07:00
Frank Tang
a91a97c0c3 ICU-21069 Fix ucptrie_swap pointer logic
See #1102
2020-04-15 14:39:08 -07:00
Markus Scherer
b9d1ba87f5 ICU-20936 copy the new direction field 2020-04-14 15:12:43 -07:00
Shane F. Carr
9b2092fa89 ICU-21073 Fix ICU Data Build Tool in "locale" mode. 2020-04-14 16:13:57 -05:00
Elango Cheran
3fb3929f80 ICU-21040 Fix segfaults in no data tests 2020-04-10 13:56:10 -07:00
Frank Tang
79248911dc ICU-21065 Move U_DEFINE_LOCAL_OPEN_POINTER
Move IU_DEFINE_LOCAL_OPEN_POINTER outside U_CDECL_BEGIN / _END
to fix conflicting declaration of C function
2020-04-07 22:01:24 -07:00
Hugo van der Merwe
cb544f47e0 ICU-21060 Fix heap-use-after-free bug. 2020-04-07 12:40:39 -05:00
Hugo van der Merwe
99f9802fec ICU-21060 Fix the foo-per-a-b -> foo-b-per-a bug. 2020-04-06 18:46:51 -05:00
Steven R. Loomis
628a935488 ICU-20979 API Change Report for 67 RC 2020-04-03 10:14:01 -07:00
Shane F. Carr
94c2c578a9 ICU-20979 Update TODOs in formatting code to point to open issues.
Also see: ICU-20920 ICU-21059 ICU-20429 ICU-21058
2020-04-03 01:57:33 -05:00
Shane F. Carr
3b0772fff9 ICU-21015 Fixing gcc compiler warnings 2020-04-03 01:56:07 -05:00
Shane F. Carr
b7facd4874 ICU-20979 Updating double-conversion from upstream master 2020-04-02 14:39:11 -05:00
Peter Edberg
d45bf0c80a ICU-20979 delete no-longer-relevant comment related to U_HIDE macros 2020-04-02 09:38:35 -07:00
Peter Edberg
b8a338fd7d ICU-20979 integrate CLDR release-37-beta2 to master 2020-04-01 18:27:51 -07:00
Peter Edberg
dcfe6c4ada ICU-20979 BRS 67rc, ICU4C API promotion 2020-04-01 16:28:42 -07:00
Daniel Ju
b718ccf17a ICU-20979 Update ICU4C readme 2020-04-01 15:43:18 -07:00
Peter Edberg
c5cabf1953 ICU-21022 Update logKnownIssue to refer to ticket for fixing in a future release 2020-04-01 15:30:37 -07:00
Jeff Genovy
822eb4e622 ICU-20979 Fixing minor MSVC warnings 2020-04-01 14:31:28 -07:00
Shane F. Carr
bda3a3e68c ICU-13724 Removing obsolete numberformat2test.cpp 2020-03-31 15:02:24 -05:00
Peter Edberg
e7995571db ICU-20979 BRS 67rc, update urename.h 2020-03-31 09:14:17 -07:00
Fredrik Roubert
879f6728f0 ICU-20803 Pass ByteSink to _canonicalize().
This eliminates the need for the fixed size scratch buffer inside of
locale_set_default_internal() and also eliminates the need for counting
bytes, something that ByteSink and CharString now will handle correctly,
when needed.

None of this should have any externally visible effect (apart from
removing the arbitrary size limit imposed by the fixed size scratch
buffer), it's all about cleaning up implementation internals.
2020-03-27 17:47:32 +01:00
Younies Mahmoud
20c29becd6 ICU-20964 Fix small typo mistake
See #974
2020-03-27 09:16:55 -07:00
Elango Cheran
72e07c0d52 ICU-21026 fix GCC warnings of signed-int left shift 2020-03-26 17:54:08 -07:00
Daniel Ju
73b50e6463 ICU-20979 BRS67RC Update version numbers, regenerate configure, update ICU4C README 2020-03-26 17:21:05 -07:00
Jeff Genovy
f78136f653 ICU-20435 Fix parallel builds with Cygwin to 3.x 2020-03-26 10:18:23 -07:00
Shane F. Carr
ac4540f8a4 ICU-20418 Number skeletons: implement star wildcard; user guide fixes
See #1060
2020-03-26 00:15:03 -05:00
Shane F. Carr
0d26f83f83 ICU-21005 Fix exhaustive test failure in IntlTestNumberFormat. 2020-03-26 00:09:31 -05:00
Shane F. Carr
b03feb6338 ICU-20920 Changing "Sequence" to "Mixed" in ICU4C MeasureUnit 2020-03-25 16:13:39 -05:00
Peter Edberg
1084c1430a ICU-21022 Use logKnownIssue to avoid TestDateFormatRoundTrip exhaustive fail 2020-03-25 10:50:42 -07:00
Shane F. Carr
dd8484d721 ICU-20919 Merge branch 'maint/maint-66' into 66-merge-branch 2020-03-25 12:42:19 -05:00
Shane F. Carr
b186f2cff6 ICU-20912 Make C/J Currency consistent on lowercase/uppercase currency equality
- Adds additional tests for Currency equality behavior
2020-03-25 12:21:34 -05:00
Peter Edberg
cfef59f0b8 ICU-13790 Add data tests to verify certain chars present in pinyin, stroke 2020-03-25 08:33:49 -07:00
Campion
b525045209 ICU-10858 Fix missing fTimeZoneFormat assignment in SimpleDateFormat::operator= (#963) 2020-03-24 20:04:35 -07:00
Peter Edberg
c163f7dc29 ICU-20979 integrate CLDR release-37-beta to master 2020-03-24 13:28:54 -07:00
Shane F. Carr
6edd38f35f ICU-20806 Removing obsolete number formatting methods.
See #1034
2020-03-24 15:21:32 -05:00
Shane F. Carr
fc6612cc56 ICU-20920 Add support for CLDR 37 unit identifiers in ICU4C
See #964
2020-03-24 14:15:19 -05:00
Steven R. Loomis
2ced2624fc ICU-20976 GCC 8 fixes phase 1
Some initial fixes for GCC 8

- setup a GCC 8 buildbot with -Wextra
- rewrite ucol_sit to use CharString
- workaround for gcc7+ on mac
  see https://github.com/arbor-sim/arbor/issues/562#issuecomment-409970434
- fix ucnv_2022 strcpy site
2020-03-24 11:51:20 -07:00
Shane Carr
1e24bcd721 ICU-20956 Fix monetary symbol getters in DecimalFormat
See #987
2020-03-23 20:12:14 -05:00
Frank Tang
f6622ab2f1 ICU-21016 Special handling of Spanish and Hebrew list format until CLDR get the data
See #1043
2020-03-19 19:36:15 -07:00
Jeff Genovy
fcb9b5579a ICU-21013 Fix OOM not checked in vtzone.cpp
Also change NULL to nullptr
2020-03-18 13:55:23 -07:00
H.J. Lu
9b4d092de2 ICU-21019 x86: Add .note.gnu.property section for Intel CET
Intel Control-flow Enforcement Technology (CET):

https://software.intel.com/en-us/articles/intel-sdm

contains shadow stack (SHSTK) and indirect branch tracking (IBT).  When
CET is enabled, ELF object files must be marked with .note.gnu.property
section.  GCC provides <cet.h> which can be included in assembly codes
to generate CET maker when compiling with -fcf-protection.
2020-03-18 10:55:22 -07:00
Yuta Saito
1a01c5edc1 ICU-20978 Reduce unnecessary building for tools when cross-building 2020-03-17 18:42:30 -07:00
Markus Scherer
1b71013da0 ICU-20916 late computation of roundedThreshold to ensure it is up to date 2020-03-17 18:28:32 -07:00
Steven R. Loomis
cb8e278ee6 ICU-20797 fix UBS compilation error and UBS in test code
Two issues here:

- fix 2 build issue in i18n when compiling with clang++ -fsanitize=undefined
the following two symbols were not exported (and they should be):
  typeinfo for icu::CollationCacheEntry
  typeinfo for icu::numparse::impl::CodePointMatcher

- remove undefined behavior warning in NumberFormatTestTuple.. minor, but very annoying
when repeated many times during every test run. Tends to mask real errors.

> numberformattesttuple.cpp:319:5: runtime error: member access within null pointer of type 'NumberFormatTestTuple'
2020-03-17 09:11:58 -07:00
Steven R. Loomis
f63a8bb4e7 ICU-20600 build icu-data-bin files
make dist will now produce:
- icu4c-67.1-data-bin-b.zip
- icu4c-67.1-data-bin-l.zip
2020-03-17 08:30:33 -07:00
Markus Scherer
524748c6bf ICU-20984 StringPiece & ByteSink overloads for char8_t* 2020-03-16 10:49:21 -07:00
Markus Scherer
d2d144a5bd ICU-21009 LocaleMatcher() order[] always set all elements 2020-03-14 14:07:29 -07:00
Jeff Genovy
5c8b142a92 ICU-21014 Fix OOM not checked in dtitvfmt.cpp and dtitvinf.cpp 2020-03-13 18:05:46 -07:00
Peter Edberg
69b3523593 ICU-20987 integrate CLDR release-37-alpha3 to master, adjust MeasureUnitTest.java tools 2020-03-13 12:01:39 -07:00
Shane F. Carr
2d83fc2278 ICU-20809 Remove FieldPositionIterator from listformatter.h 2020-03-11 21:13:45 -05:00
Frank Tang
9aed97703e ICU-21011 Debug print detail size of brk files.
To print it,
$ CXXFLAGS="-DRBBI_DEBUG" ./runConfigureICU --enable-debug --disable-release  Linux/gcc --disable-layoutex
$ make clean
$ U_RBBIDEBUG="size" make
2020-03-11 11:36:28 -07:00
Peter Edberg
d6eabe4155 ICU-20879 fix typo in tests, calender → calendar 2020-03-10 23:06:21 -07:00
Markus Scherer
d490170520 ICU-20700 uloc_acceptLanguage API doc update 2020-03-10 08:23:40 -07:00
Markus Scherer
72cd937620 ICU-20936 add LocaleMatcher.Builder.setDirection(with-one-way vs. only-two-way) 2020-03-10 08:22:28 -07:00
shaobero
bdba9c93e4 ICU-20945 Making icudt66.dll for x64 as PE32+ 2020-03-09 10:28:50 -07:00
Markus Scherer
d2ea4513dc ICU-20700 reimplement acceptLanguage() using the LocaleMatcher; replace older accept-language-string parsing by LocalePriorityList 2020-03-08 08:01:31 -07:00
Markus Scherer
3edff03393 ICU-20915 LocaleMatcher no match: always getSupportedIndex()=-1; remove defaultLocaleIndex field; constructor check if locales are equivalent to default, not just equal; simplify locale sorting; minor builder & test deflaking 2020-03-08 07:54:46 -07:00
Frank Tang
94c9ff2089 ICU-20991 Trace BreakIterator/BreakEngine creation
See #1014
2020-03-06 14:18:43 -08:00
Shane F. Carr
01523b4da6 ICU-20974 Fix exhaustive test failures 2020-03-06 01:34:51 -08:00
Jeff Genovy
7302079653 ICU-21000 Fix abort called by DateTimePatternGenerator::getDefaultHourCycle
If you call the API getDefaultHourCycle on an empty DateTimePatternGenerator
instance (ie: no locale) then it calls UPRV_UNREACHABLE which calls abort().
We should return an error code instead of aborting.
2020-03-05 18:19:04 -08:00
Jeff Genovy
ce7e060d50 ICU-21001 Fixing problems found by running valgrind.
This makes fixes in order to run the icu4c tests (intltest, cintltst,
iotest, and icuinfo) cleanly under valgrind with --leak-check=full.
2020-03-05 14:34:20 -08:00
Jeff Genovy
bd08ba2c5b ICU-21004 Fix buffer over-read in ucal_open
The issue shows under valgrind or as an Address Sanitizer failure.
2020-03-05 14:09:34 -08:00
Shane Carr
0b7f6b1864 ICU-20974 Correctly handle extreme values of double. 2020-03-05 13:40:59 -08:00
Peter Edberg
ad00acac12 ICU-20980 integrate CLDR release-36-1 (final) to maint-66 2020-03-05 13:25:35 -08:00
Frank Tang
be3ee4cc63 ICU-20967 add millisecond to DateIntervalFormat
See #978
2020-03-05 10:55:19 -08:00
Shane Carr
e572de5516 ICU-20961 Return correct currency plural pattern from DecimalFormat 2020-03-04 19:43:57 -08:00
Daniel Ju
6119662f03 ICU-20980 BRS66GA Update version numbers and README files 2020-03-04 11:12:47 -08:00
Peter Edberg
0778144ff6 ICU-20987 integrate CLDR release-37-alpha2 to master (new tools) 2020-03-03 15:01:28 -08:00
Peter Edberg
63e480dedc ICU-20987 integrate CLDR release-37-alpha1 to master (using new tooling) 2020-02-27 16:27:50 -08:00
Jeff Genovy
e1aa9119ad ICU-20975 BRS66rc Fix warnings found when building ICU4C samples. 2020-02-26 13:04:08 -08:00
Shane Carr
bb1f00efb8 ICU-20919 Merge branch 'maint/maint-66' into maint-66-merge
Conflicts:
	icu4j/main/shared/data/icudata.jar
2020-02-21 18:21:05 -08:00
Laurent Stacul
3b58179396 ICU-20972 Fix invalid conversion from const char8_t* to const char* (C++20) 2020-02-20 13:09:18 -08:00
Markus Scherer
af9ef2650b ICU-20893 Unicode 13 data 2020feb19 2020-02-19 22:02:35 -08:00
Jeff Genovy
77fcded28b ICU-20969 Fix file permissions (-x) on ICU4C source files. 2020-02-19 17:00:06 -08:00
Peter Edberg
d95621c57f ICU-20975 BRS66rc update Unicode license copyright 2020-02-19 15:23:29 -08:00
Vincent Torri
caa25fd878 ICU-20842 Fix library names in pkg-config files on Windows (mingw-w64)
since the move of the DLL to bin/ the library names in .pc files is
wrong. With ICU 65.1, icu-uc.pc contains

Libs: -L${libdir} -licuuc65 -licudt65

the version number should not appear. Indeed, the linker looks for the
libraries in $prefix/lib in the following order (see [1]):

libxxx.dll.a
xxx.dll.a
libxxx.a
cygxxx.dll
libxxx.dll
xxx.dll

As the is only the import library with no versioning (which is normal),
the is a link error when using ICU pc files.

[1] https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux/4/html/Using_ld_the_GNU_Linker/win32.html
2020-02-19 14:57:59 -08:00
Steven R. Loomis
532954ba14 ICU-20975 update API Change Report for 66
- Compared against ICU4C 65.1
- no substantive change. Just dropped 'preview'
2020-02-19 14:56:41 -08:00
Mojca Miklavec
4a3a457b38 ICU-20852 Fix Makefile to build on Sparc Solaris 2020-02-19 12:42:40 -08:00
Peter Edberg
ea907cdf5f ICU-20975 integrate CLDR release-36-1-beta to maint-66 2020-02-18 23:02:26 -08:00
Andy Heninger
14bcaaf58e ICU-20876 Regex Grapheme Cluster matching with Break Iterators.
Change the implementation of grapheme cluster matching in regex to use an ICU
break iterator instead of a little one-off state machine.

The old implementation had fallen behind the Unicode UAX-29 specification for
graphem clusters, and could not be easily updated.

The implementation follows the same general pattern that is used for finding
word boundaries with an ICU break iterator. In reviewing that code, a few
improvements to the handling of ICU error codes were also made.

Also note that this change adds a new dependency on Break Iteration.  Regex
patterns that previously would work with ICU builds that were configured with
no break iteration will now fail. But only if they include \X for matching
grapheme cluster boundaries.
2020-02-18 18:28:10 -08:00
Daniel Ju
925f255092 ICU-20975 BRS66RC Update version numbers, README files, configure 2020-02-18 14:40:46 -08:00
Jeff Genovy
783ee4c902 ICU-20322 On MinGW, move the DLLs to the "bin" directory.
This change builds on Vincent Torri's changes.

This installs the ICU DLL files in $prefix/bin instead of $prefix/lib.

Note: In order to disable this change in behavior you can edit
the "mh-mingw*" file(s). If you set the variable MINGW_MOVEDLLSTOBINDIR
to NO instead of YES, then it will retain the previous behavior of
installing the DLLs into the bin folder.
2020-02-12 14:19:15 -08:00
Vincent Torri
5fbb4d3a5f ICU-20322 Install dll in $prefix/bin instead of $prefix/lib 2020-02-12 14:19:15 -08:00
Andrew Paprocki
cd41cba82a ICU-20895 ICU_TIMEZONE_FILES_DIR_PREFIX_ENV_VAR
Adds `ICU_TIMEZONE_FILES_DIR_PREFIX_ENV_VAR`, similar to
`ICU_DATA_DIR_PREFIX_ENV_VAR`, that specifies an environment variable
to retrieve and prepend to the ICU time zone data file path.
2020-02-12 12:45:12 -05:00
Frank Tang
6ea0fc7713 ICU-20834 Implement UTS35 Locale ID Canonicalization
See #951
2020-02-11 22:44:39 -08:00
Mihai Nita
dd50e38f45 ICU-20738 Best-match pattern for 'sS' uses <appendItem> data 2020-02-10 07:59:52 -08:00
Shane Carr
9eca171a39 ICU-20954 Fix currency spacing in suffix. 2020-02-06 09:56:32 -08:00
Elango Cheran
1a9fb8ec33 ICU-13836 C++ port of adding exponent for better plurals for compact decimal format 2020-02-05 09:08:48 -08:00
Andy Heninger
d6b88d49e3 ICU-20939 Fix problem w regexp \b boundaries & UTF-8 text
In regular expressions, when testing for word boundaries with \b, the
boundaries were incorrect when in Unicode mode, meaning that an ICU word break
iterator is being used to find the boundaries, and the text being matched is
UTF-8 encoded.

The bug stemmed from a misunderstanding of how string indexes work with UText
and break iterators, leading to the inclusion of code to convert from UTF-8 to
UTF-16 indexing, when what was wanted was the original UTF-8 index everywhere.
Removing the indexing conversion fixes the problem.
2020-02-03 16:51:17 -08:00
Frank Tang
b7d08bc04a ICU-20958 Prevent SEGV_MAPERR in append
See #971
2020-02-03 13:22:30 -08:00
Andy Heninger
54a60fe6f4 ICU-11548 Improve regex static UnicodeSets handling
Compiled regular expression patterns make use of several shared common
UnicodeSets. This change simplifies the creation and use of these
static UnicodeSets.

- Pointer fields to the static sets are removed from the compiled patterns,
  and the static variables are accessed directly. The deleted pointers
  were a hold-over from earlier code that did not use shared statics.

- The UnicodeSet pattern literals are changed from hex constants to
  u"string literals".

- The size of fRuleSets (from regexst.h) is changed from a hard-coded 10
  to the number of UnicodeSets actually required. Doing this required
  a change to regexcst.pl to export the required size. Changing and
  rerunning this perl code resulted in massive but benign changes to
  the generated file regexcst.h, the result of perl having changed its
  order of enumeration of hashes since the file was last regenerated.

- UnicodeSets are frozen when possible. Should result in faster matching.
2020-01-30 15:13:07 -08:00
Egor Pugin
76f190024d ICU-20938 Add --skip-dll-export option to genccode to prevent exporting statically linked ICU data from executables. 2020-01-23 12:00:29 -08:00
Keita Suzuki
a4a5c603ac ICU-20767 Potential negative index access in one of the sample codes 2020-01-22 13:13:27 -08:00
Frank Tang
7a5139ad95 ICU-20934 Fix TZ test error
Somehow these tests are now fail on trunks.
Per https://mm.icann.org/pipermail/tz-announce/2019-July/000056.html
     Brazil has canceled DST and will stay on standard time indefinitely.

Cherry-picked from: 11ad8d69fb
2020-01-20 14:58:55 +01:00
Shane Carr
8c717b514e ICU-20665 Removing number-dependence from ICU4C FormattedStringBuilder fields.
See #727
2020-01-17 11:22:02 +01:00
Frank Yung-Fong Tang
21df05234d ICU-20673 Allow built-in translit ID w/o data.
See #958
2020-01-16 21:28:01 -08:00
Shane Carr
0ad2f9590b ICU-20418 Fix indentation of CHECK_NULL in number_skeletons.cpp 2020-01-14 11:52:27 +01:00
Shane Carr
fe98d870b2 ICU-20418 Adding concise number skeletons in ICU4C 2020-01-14 11:52:27 +01:00
Shane Carr
df8841aa6f ICU-20418 Adding *internal* parse method for core unit identifiers.
Also see ICU-20286
2020-01-14 11:52:27 +01:00
Shane Carr
b24538eb05 ICU-20921 Adding find and compare to StringPiece 2020-01-14 11:52:27 +01:00
Joshua Root
a3078fb8c8 ICU-20875 Include <cstddef> for max_align_t
The definition of max_align_t is not guaranteed to be available unless
the appropriate header is included. Since use of <stddef.h> from C++ is
deprecated, that's <cstddef>, and max_align_t is thus defined under the
std namespace rather than in the global namespace.
2020-01-09 15:42:52 -08:00
Caio Lima
09d409f5f4 ICU-20442 Adding support for hour-cycle on DateTimePatternGenerator
DateTimePatternGenerator needs to consider the hour-cycle preferred by
Locale. This means that we need to to override the hour-cycle when a
locale contains "hc" keyword. This patch is adding such functionality.
In addition, "DateTimePatternGenerator::adjustFieldTypes" should adjust
hour field to properly follow tr35
spec(https://www.unicode.org/reports/tr35/tr35-dates.html#dfst-hour).
2020-01-09 16:45:56 +01:00
Smaarn
996da8faac ICU-20871 Fixed: no rule was defined to create the $(OUTDIR) directory if it didn't exist.
This would cause failures during cross compilation cases such as:

make[6]: Leaving directory '/spksrc/spk/bazarr/work-qoriq-6.1/icu/source/data'
make[5]: *** No rule to make target 'out', needed by 'out/icudt64b.dat'.  Stop.
2020-01-08 15:42:35 +01:00
Hugh McMaster
5aae52d3ef ICU-20924 Use pkg-config to generate the path to pkgdata.inc 2020-01-07 14:19:02 -08:00
Frank Tang
11ad8d69fb ICU-20934 Fix TZ test error
Somehow these tests are now fail on trunks.
Per https://mm.icann.org/pipermail/tz-announce/2019-July/000056.html
     Brazil has canceled DST and will stay on standard time indefinitely.
2020-01-03 20:52:11 -08:00
Frank Tang
4a8483be91 ICU-20900 Fix createCanonical
See #922
2020-01-03 15:00:04 -08:00
Markus Scherer
60b567d6ab ICU-20917 LocaleMatcher: prefer a more-default locale 2020-01-02 18:00:52 -08:00
Frank Tang
79fac50101 ICU-20310 omit "-true" in toLanguageTag
See #952
2019-12-30 15:39:59 -08:00
Markus Scherer
cb1d4f5903 ICU-20916 UBSan & ErrorProne fixes 2019-12-20 14:56:31 -08:00