Commit Graph

804 Commits

Author SHA1 Message Date
Markus Scherer
acf2b4cc82 ICU-13186 stop prepending UTF-8 BOM to some Unicode files
X-SVN-Rev: 40149
2017-06-02 22:52:19 +00:00
Markus Scherer
b2ead3e2e1 ICU-8130 UTS 46 conformance test using Unicode IdnaTest.txt
X-SVN-Rev: 40130
2017-05-23 04:44:58 +00:00
Markus Scherer
cb722b8bcd ICU-12985 Unicode 10 minor data update
X-SVN-Rev: 40104
2017-05-03 23:26:47 +00:00
Markus Scherer
edce2be62c ICU-12985 Unicode 10 data 20170418, new property values, adjust tools & tests
X-SVN-Rev: 40079
2017-04-26 21:17:13 +00:00
Shane Carr
66c49f8887 ICU-13115 Reject decimal format patterns that have a trailing ','
X-SVN-Rev: 40068
2017-04-19 23:42:18 +00:00
Shane Carr
1b0ce88028 ICU-13117 Ignore trailing semicolon in decimal format patterns.
X-SVN-Rev: 40067
2017-04-19 22:29:28 +00:00
Shane Carr
29490300c1 ICU-13149 Adding test cases for ICU 59 gray area bugs to numberformattestspecification.txt.
X-SVN-Rev: 40058
2017-04-15 08:09:15 +00:00
Shane Carr
8016edea2a ICU-13149 Updating numberformattestspecification.txt with ICU4C behaviors and bringing the two copies of the file into sync.
X-SVN-Rev: 40057
2017-04-15 06:30:05 +00:00
Andy Heninger
9ed9930b61 ICU-12160 Add regex test to verify behavior of hitEnd() after find().
X-SVN-Rev: 39978
2017-03-31 23:09:56 +00:00
Andy Heninger
77416a6e7c ICU-13058 Break iteration, propagate emoji rule updates into remaining unfixed rule tailorings.
X-SVN-Rev: 39925
2017-03-24 05:54:13 +00:00
Andy Heninger
af55f69558 ICU-13058 Break iteration tests & rules update for new prop data. Tests are now passing, but changes are not yet propagated into all rule tailorings.
X-SVN-Rev: 39922
2017-03-24 01:31:00 +00:00
Markus Scherer
e9ae3ff964 ICU-12900 copyright scanner has been fixed, remove the extra ICU-style copyright lines from Unicope data files
X-SVN-Rev: 39920
2017-03-23 17:01:58 +00:00
Steven R. Loomis
02d7708876 ICU-13058 copyright check
X-SVN-Rev: 39919
2017-03-23 15:49:17 +00:00
Markus Scherer
466a569c58 ICU-12900 mostly still Unicode 9.0 but Unicode 10 beta (20170322) segmentation & bidi data and draft emoji 5.0 (also 20170322)
X-SVN-Rev: 39915
2017-03-23 02:14:00 +00:00
Andy Heninger
51e21b5242 ICU-13058 Add RBBI test of Unicode emoji-test.txt file, and partial update of break rules recent emoji changes.
X-SVN-Rev: 39909
2017-03-23 00:20:20 +00:00
Andy Heninger
170693e51e ICU-13012 Remove stray ^S character from line_normal_cj.txt.
X-SVN-Rev: 39786
2017-03-13 23:17:12 +00:00
Peter Edberg
04c115425d ICU-13010 Add Extend* to rule GB11′, update tests and add more emoji cluster tests
X-SVN-Rev: 39726
2017-03-02 21:04:09 +00:00
Andy Heninger
8e5c017e03 ICU-12930 Fix assertion failure in regex compile.
X-SVN-Rev: 39663
2017-02-10 23:30:24 +00:00
Peter Edberg
e0f3d87e99 ICU-12948 First-pass preliminary integration of CLDR 31 @13174
X-SVN-Rev: 39601
2017-01-24 08:22:23 +00:00
Peter Edberg
850118a0e0 ICU-12948 Add new elements in structLocale
X-SVN-Rev: 39596
2017-01-22 09:23:49 +00:00
Peter Edberg
183b124712 ICU-12940 Add "ars" as alias to "ar_SA" locale, to "ar" for coll/rbnf
X-SVN-Rev: 39587
2017-01-20 20:07:27 +00:00
Yoshito Umaoka
04b460f204 ICU-12779 Added/fixed Unicode copyright comments, commiting changes on behalf of Norbert.
X-SVN-Rev: 39459
2016-10-17 21:55:45 +00:00
Peter Edberg
5abe054173 ICU-12801 integrate CLDR release-30-0-1 into icu4c trunk
X-SVN-Rev: 39445
2016-10-13 06:08:08 +00:00
Norbert Runge
0ca4234c58 ICU-12761 Adds Unicode copyright notice.
X-SVN-Rev: 39388
2016-09-28 22:12:27 +00:00
Steven R. Loomis
2bf1e2f0c1 ICU-12701 more IBM i test fixes.
* intltest now builds.
* fix for unpax-icu.sh to include the utf-16be file.
* added an emacs coding line to 'encoded.utf16be'

X-SVN-Rev: 39304
2016-09-21 00:07:32 +00:00
Peter Edberg
f82d62a85c ICU-12729 CLDR release-30-d02 integration, merge icu4c branch into trunk
X-SVN-Rev: 39273
2016-09-19 05:09:40 +00:00
Steven R. Loomis
dea458fef7 ICU-12515 C filtered break
* sync rbbitst.txt with J
* fix an issue where isBoundary() didn't check the trie's presence

X-SVN-Rev: 39211
2016-09-13 19:58:55 +00:00
Andy Heninger
ce42f64b31 ICU-12664 Break rules update for revised Emoji ZWJ sequences.
X-SVN-Rev: 39100
2016-08-30 21:40:09 +00:00
Markus Scherer
962c595814 ICU-12669 draft Emoji 4.0 properties data
X-SVN-Rev: 39067
2016-08-18 21:08:23 +00:00
Steven R. Loomis
0c5b2b597d ICU-12455 BRS - BOM fix
X-SVN-Rev: 38915
2016-07-01 16:59:16 +00:00
Peter Edberg
b45dac8cfc ICU-12455 Integrate early CLDR 30 data for ICU4C 58m1
X-SVN-Rev: 38905
2016-06-30 23:41:56 +00:00
Michael Ow
a1d4c18ac4 ICU-12564 Fix bom in testdata file
X-SVN-Rev: 38873
2016-06-22 18:10:35 +00:00
Andy Heninger
482cd252bd ICU-12526 Regenerated Unicode LineBreakTest.txt file from Laurentiu.
X-SVN-Rev: 38867
2016-06-19 01:34:46 +00:00
Michael Ow
8d7b93d89f ICU-12564 Fix some bom issues in the testdata
X-SVN-Rev: 38854
2016-06-16 20:57:46 +00:00
Michael Ow
61607c2773 ICU-12564 Update copyright notice in trunk
X-SVN-Rev: 38848
2016-06-15 18:58:17 +00:00
Markus Scherer
1a6d70fa44 ICU-12526 Unicode 9 data 2016-jun-09
X-SVN-Rev: 38820
2016-06-10 00:01:46 +00:00
Yoshito Umaoka
00ca13e126 ICU-12564 Reverted r38761 and r38762, because we want to prepend the Unicode copyright for existing source files, instead of replacing copyright comments.
X-SVN-Rev: 38776
2016-05-31 21:45:07 +00:00
Michael Ow
c9f199a30f ICU-12564 Update copyright notice in ICU4C
X-SVN-Rev: 38761
2016-05-26 22:32:17 +00:00
Markus Scherer
5e69db5c2f ICU-12526 integrate Unicode 9 beta
X-SVN-Rev: 38753
2016-05-19 22:48:18 +00:00
Andy Heninger
25a04f741a ICU-10698 Test word break of 'What is Unicode' in Japanese, resolve C vs. J differences.
X-SVN-Rev: 38699
2016-05-04 23:55:22 +00:00
Andy Heninger
2e088aff9c ICU-11723 Test dictionary breaking of 'アレルギー性結膜炎'
X-SVN-Rev: 38692
2016-05-03 22:44:32 +00:00
Andy Heninger
66537179d7 ICU-11996 CJKBreakEngine divideUpDictionaryRange, pick up test case from ICU4J.
X-SVN-Rev: 38678
2016-04-29 23:51:24 +00:00
Andy Heninger
0338b5470a ICU-11999 BreakIterator, UnhandledBreakEngine consuming too many characters. Updated test file from ICU4J.
X-SVN-Rev: 38670
2016-04-29 21:32:46 +00:00
Andy Heninger
7265eeae4c ICU-11556 rbbitst.txt test data file, add explicit locale.
X-SVN-Rev: 38644
2016-04-25 18:10:08 +00:00
Andy Heninger
ac9c717990 ICU-11556 Line Break rules update for L2/16-043R, don't break CA$; also LB rules refactored for reduced memory consumption.
X-SVN-Rev: 38643
2016-04-22 23:07:12 +00:00
Stuart Gill
d64e688fec ICU-11621 modify test data file to parallel changes in Java
X-SVN-Rev: 38541
2016-03-18 19:55:45 +00:00
Andy Heninger
5184ca0bee ICU-12373 Fix definition grapheme break rules; update to latest LineBreakTest file.
X-SVN-Rev: 38520
2016-03-15 18:36:49 +00:00
Michael Ow
b9f9d3a9e1 ICU-12187 Fix UTF-8 file properties in ICU4C 57
X-SVN-Rev: 38437
2016-02-29 20:04:41 +00:00
Andy Heninger
9d9256f3b7 ICU-12081 Initial implementation Emoji break rules and a new RBBI monkey test.
X-SVN-Rev: 38387
2016-02-26 21:58:26 +00:00
John Emmons
f7d261ebbc ICU-12091 Merge preliminary CLDR 29 data for 57m1
X-SVN-Rev: 38287
2016-02-05 04:12:07 +00:00
Michael Ow
a197e45227 ICU-12083 Fix bom in icu4c 57m1
X-SVN-Rev: 38246
2016-02-01 17:14:55 +00:00
Andy Heninger
8dba7301b7 ICU-11554 Fix regex bug with look-behind matching & UTF-8 input.
X-SVN-Rev: 38056
2015-10-09 20:01:46 +00:00
Andy Heninger
9b8403900a ICU-11507 Regex, fix problem with backrefs of unpaired surrogates.
X-SVN-Rev: 38038
2015-10-06 23:59:28 +00:00
Michael Ow
04b5b54998 ICU-11869 Update 2022 code in ICU4C
X-SVN-Rev: 38014
2015-09-23 20:49:25 +00:00
Markus Scherer
69543d559e ICU-11016 add support for minimum grouping digits (technology preview, not using CLDR data yet)
X-SVN-Rev: 37946
2015-09-11 19:18:00 +00:00
Steven R. Loomis
98f5987b43 ICU-11248 use '@ss=' and not x-uli
remove an old test hack.

X-SVN-Rev: 37940
2015-09-10 07:00:30 +00:00
Travis Keep
b49ad21ea8 ICU-11740 Final sync of C and J copies of data driven tests.
X-SVN-Rev: 37910
2015-09-09 00:09:32 +00:00
Michael Ow
784e59b2ae ICU-11869 Update jis8 test for edge case
X-SVN-Rev: 37907
2015-09-08 22:37:30 +00:00
Travis Keep
bc33f20e3c ICU-11740 Add more tests to prove that this change resolves other tickets.
X-SVN-Rev: 37843
2015-08-28 18:38:37 +00:00
John Emmons
01e0c332a5 ICU-11846 CLDR 28 data integration
X-SVN-Rev: 37836
2015-08-27 20:23:27 +00:00
Travis Keep
b7cf3ca31c ICU-11740 Disallow copy and assign on VisibleDigitsWithExponent enable data driven tests that were known to fail with the legacy DecimalFormat code but pass with the new DecimalFormat code.
X-SVN-Rev: 37832
2015-08-26 21:15:37 +00:00
Travis Keep
2860cd48c4 ICU-11740 DecimalFormat refactor.
X-SVN-Rev: 37821
2015-08-25 23:31:01 +00:00
Steven R. Loomis
f87d28cfd2 ICU-11248 merge to trunk: FilteredBreakIteratorBuilder work
* passes rbbi extended tests
* uses <locale en@x-uli=true> in rbbitst.txt,
so added a "known issue" for this when en@ss=standard will suffice.

X-SVN-Rev: 37721
2015-08-05 00:03:18 +00:00
Peter Edberg
2ae320dbdf ICU-11673 Add new Japanese name for Georgia to cjdict
X-SVN-Rev: 37608
2015-06-23 02:09:49 +00:00
Peter Edberg
d88c68d067 ICU-11688 Add Thai words for "update" and "event" to dictionary
X-SVN-Rev: 37606
2015-06-23 00:44:09 +00:00
John Emmons
75ed4ce808 ICU-11728 First cut CLDR 28 data integration
X-SVN-Rev: 37524
2015-06-10 18:38:06 +00:00
Travis Keep
9672f10d99 ICU-10458 Add more data driven tests for parsing currency.
X-SVN-Rev: 37507
2015-06-09 18:01:13 +00:00
Travis Keep
df332d3722 ICU-10458 Keep data driven tests in sync.
X-SVN-Rev: 37496
2015-06-03 17:10:45 +00:00
Travis Keep
5b1be7f0d4 ICU-11692 Keep test data files in sync.
X-SVN-Rev: 37480
2015-05-29 23:58:03 +00:00
Markus Scherer
fc290db4c1 ICU-11574 near-final Unicode 8 data with new block API constants, code adjustments & fixes; svn merge --reintegrate branches/markus/uni80
X-SVN-Rev: 37473
2015-05-28 23:16:00 +00:00
Travis Keep
762078d2f1 ICU-11692 Add DecimalFormat data driven tests.
X-SVN-Rev: 37471
2015-05-28 20:09:43 +00:00
Michael Ow
c9a8c21d51 ICU-11595 Merge CLDR update to trunk
X-SVN-Rev: 37293
2015-03-27 17:25:30 +00:00
John Emmons
26a401e17a ICU-10750 Remove obsolete files from source/data/xml
X-SVN-Rev: 37187
2015-03-07 16:06:51 +00:00
John Emmons
368eb4bb16 ICU-11555 Integrate CLDR 27 data
X-SVN-Rev: 37169
2015-03-06 22:58:33 +00:00
Peter Edberg
00038112bb ICU-11019 C: Add Thai words for "browser" and "post" to dictionary
X-SVN-Rev: 37126
2015-03-04 07:11:04 +00:00
Markus Scherer
7115587929 ICU-11535 test genrb -e with UTF-16BE not ISCII
X-SVN-Rev: 37082
2015-02-26 22:48:16 +00:00
Peter Edberg
43f62124cd ICU-9379 C: Update BreakIterator createInstance to handle linebreak variant files; update tests
X-SVN-Rev: 37059
2015-02-24 22:37:10 +00:00
Andy Heninger
ce09d8a4bc ICU-11393 Regex, add pattern chars R v and h
X-SVN-Rev: 37057
2015-02-24 00:24:59 +00:00
Andy Heninger
ec3f77f878 ICU-5312 Regular Expressions Named Capture.
X-SVN-Rev: 37040
2015-02-18 23:56:19 +00:00
Markus Scherer
a9d7c3e4bd ICU-11449 reorder single scripts not groups, scripts/groups can start on top-16-bit boundaries, data formatVersion 5 for new scripts data and optional reorderRanges appended to reorderCodes
X-SVN-Rev: 36924
2015-01-07 03:37:11 +00:00
Yoshito Umaoka
cbe7c4983b ICU-11466 Added a word break test case for Hangul, starting with Latin text. Such case did not work well with ICU4J 52, but works fine with other ICU versions.
X-SVN-Rev: 36915
2015-01-06 18:57:38 +00:00
Markus Scherer
590a85cbf1 ICU-11448 collation common tertiary weights for normal Hiragana, change builder to cope with below-common tertiary weights
X-SVN-Rev: 36906
2014-12-27 00:38:34 +00:00
John Emmons
0d0a369333 ICU-11060 Fix naming convention for @alt values from CLDR
X-SVN-Rev: 36886
2014-12-17 23:25:39 +00:00
John Emmons
ad5308aa12 ICU-11417 CLDR 27 data integration for ICU55M1
X-SVN-Rev: 36850
2014-12-11 16:10:49 +00:00
Andy Heninger
b3bd196383 ICU-11373 Regex, check for integer overflow in maxMatchLength computation.
X-SVN-Rev: 36794
2014-12-02 19:24:31 +00:00
Andy Heninger
c9b7734a0e ICU-11373 disable prematurely checked in test case.
X-SVN-Rev: 36793
2014-12-02 17:10:04 +00:00
Andy Heninger
2e6a8c568c ICU-11374 Regular Expression, improve checking of integer overflow.
X-SVN-Rev: 36792
2014-12-02 17:05:27 +00:00
Markus Scherer
3cdd2bafe2 ICU-11388 regression test from Java
X-SVN-Rev: 36782
2014-11-30 19:56:32 +00:00
Markus Scherer
5d211663e6 ICU-11375 ICU root collation: remove Cyrillic contractions
X-SVN-Rev: 36735
2014-11-17 00:41:41 +00:00
Andy Heninger
f5d14979c6 ICU-11369 Regex, fix incorrect optimization of patterns with a zero length quantifier {0}
X-SVN-Rev: 36727
2014-11-12 23:42:14 +00:00
Andy Heninger
9db433b745 ICU-11370 Regex, add missing range check of length of a look-behind expression.
X-SVN-Rev: 36724
2014-11-12 23:10:21 +00:00
Peter Edberg
d87c86274c ICU-10326 Add dictionary-based word/line break for Burmese/Myanmar
X-SVN-Rev: 36397
2014-09-08 22:16:21 +00:00
Peter Edberg
602bb30ae4 ICU-10872 Fix en_US_POSIX word break for colon (C)
X-SVN-Rev: 36381
2014-09-07 07:05:59 +00:00
Andy Heninger
746aadf44a ICU-10942 Time Zone data loading, merge from branch.
X-SVN-Rev: 36342
2014-09-04 01:07:38 +00:00
Peter Edberg
1b8eb15e1a ICU-11173 CLDR tags/release-26-d01 into ICU4C trunk with related test & lib code updates
X-SVN-Rev: 36313
2014-09-02 23:18:20 +00:00
Markus Scherer
ec55298e96 ICU-11042 Han radical-stroke order data in the root collator instead of CJK tailorings, adjusted code & tests (merged from branches/markus/rootunihan)
X-SVN-Rev: 36162
2014-08-14 18:47:27 +00:00
Andy Heninger
e03585d7cf ICU-11049 fix regex find() memory overrun.
X-SVN-Rev: 36124
2014-08-06 21:49:08 +00:00
Markus Scherer
9bc6f268fb ICU-10998 collation reorder others=Zzzz, recognize only others = reset
X-SVN-Rev: 36017
2014-07-09 22:53:09 +00:00
Markus Scherer
77b420f820 ICU-8962 test & fix collation import from same bundle & from und=root
X-SVN-Rev: 36009
2014-07-08 01:10:13 +00:00
Markus Scherer
b3cb5dcc86 ICU-8260 bug fixes
X-SVN-Rev: 35766
2014-05-28 23:03:52 +00:00