Commit Graph

138 Commits

Author SHA1 Message Date
Andy Heninger
80dde4d8de ICU-4850 Sentence break rule update. Use rule chaining. Conform to TR29.
X-SVN-Rev: 18586
2005-09-26 23:56:32 +00:00
Andy Heninger
d733d65d28 ICU-4269 rbbi sentence break monkey test & rule updates. Work in in progress, sentence breaks not in good shape now.
X-SVN-Rev: 18534
2005-09-15 23:23:24 +00:00
Andy Heninger
d51e8e8a35 ICU-4766 rbbi word rules & tests updated for Unicode 4.1 handling of trailing format chars
X-SVN-Rev: 18510
2005-09-10 03:52:54 +00:00
Eric Mader
5dc3d7c9d4 ICU-4561 Update copyright notices for ICU 3.4
X-SVN-Rev: 17822
2005-06-07 23:38:09 +00:00
Andy Heninger
9ce9eda382 ICU-4157 Fix line_th rbbi rules to use new line break char properties.
X-SVN-Rev: 17501
2005-04-23 00:03:52 +00:00
Andy Heninger
504c1776d5 ICU-4157 Add text boundary test files from the Unicode site.
X-SVN-Rev: 17500
2005-04-22 21:49:52 +00:00
Andy Heninger
2b714f2bce ICU-4157 Word Break, fix problem with CR <combining> sequences
X-SVN-Rev: 17427
2005-03-31 01:45:27 +00:00
Andy Heninger
cd85b65d35 ICU-4157 Unicode 4.1 RBBI rule updates + required implementation fixes
X-SVN-Rev: 17376
2005-03-23 02:13:53 +00:00
Eric Mader
41ca4f63ee ICU-4428 update copyright notices for ICU 3.3.
X-SVN-Rev: 17296
2005-03-08 22:49:59 +00:00
Eric Mader
fc2dbe2d00 ICU-4216 updated Thai break dictionary from thai7.ucs.
X-SVN-Rev: 17201
2005-02-15 19:44:09 +00:00
Andy Heninger
448c2e114b ICU-4157 RBBI Rule updates for Unicode 4.1
X-SVN-Rev: 17118
2005-01-13 23:42:12 +00:00
Andy Heninger
5cf9c75c52 ICU-4157 4.1 RBBI changes. Stub out TestLineBreaks, which is looping; real fix to come later.
X-SVN-Rev: 17106
2005-01-11 00:49:22 +00:00
Eric Mader
b1f7808255 ICU-4179 fix copyright notices so that the cpyscan tool recognizes them.
X-SVN-Rev: 16710
2004-11-01 23:54:44 +00:00
Andy Heninger
244f4e3ac2 ICU-4157 Word Break, undo failed attempt at a branch for TR29 proposed updates
X-SVN-Rev: 16625
2004-10-25 23:50:11 +00:00
Andy Heninger
d6a3a3e9f5 ICU-4157 Word Break, TR29 proposed changes for Unicode 4.1
X-SVN-Rev: 16623
2004-10-25 23:41:39 +00:00
Deborah Goldsmith
225c380bde ICU-3561 Locale-based text boundaries
X-SVN-Rev: 16582
2004-10-21 01:03:01 +00:00
George Rhoten
06966de3a9 ICU-4098 Allow some break iterators to be removed from the build.
X-SVN-Rev: 16311
2004-09-13 16:00:27 +00:00
Eric Mader
14fbb48bf9 ICU-3770 Updated copyright notices for ICU 3.0
X-SVN-Rev: 15385
2004-05-18 22:01:41 +00:00
Andy Heninger
33b949fbcd ICU-3728 extend rbbi monkey test to cover following(), previous() funcs, line break fixes
X-SVN-Rev: 15373
2004-05-18 18:23:22 +00:00
Andy Heninger
9602743a35 ICU-3728 extend rbbi monkey test to cover following(), previous() funcs
X-SVN-Rev: 15347
2004-05-17 23:16:00 +00:00
Andy Heninger
222ac0d067 ICU-3278 Word Break Rules, fix failures with long-running monkey tests.
X-SVN-Rev: 15329
2004-05-17 02:49:46 +00:00
Eric Mader
0a217cf782 ICU-3700 updated rules for Unicode 4.0.1.
X-SVN-Rev: 15286
2004-05-12 23:29:24 +00:00
Andy Heninger
f1f3be34f8 ICU-3170 More RBBI tweaks for Unicode 4.01 update
X-SVN-Rev: 14912
2004-04-08 23:38:02 +00:00
Andy Heninger
ef9f2f2fbc ICU-3170 Sentence Break Rules use new STerm property.
X-SVN-Rev: 14893
2004-04-07 01:09:34 +00:00
Andy Heninger
d23bf8bf5e ICU-3170 Grapheme Cluster Boundary rule tweak for Unicode 4.0
X-SVN-Rev: 14880
2004-04-06 05:31:17 +00:00
Eric Mader
6aac9dbeb8 ICU-3473 Update copyright notices for ICU 2.8.
X-SVN-Rev: 14180
2003-12-18 23:16:48 +00:00
George Rhoten
12aaf39bc2 ICU-2292 Remove redundant spaces.
X-SVN-Rev: 14034
2003-12-08 17:50:35 +00:00
Andy Heninger
4a005de2c6 ICU-2292 comments, minor update.
X-SVN-Rev: 13895
2003-11-26 03:04:38 +00:00
George Rhoten
0f1f0a4a35 ICU-2292 Remove redundant spaces.
X-SVN-Rev: 13868
2003-11-25 00:23:46 +00:00
Syn Wee Quek
8feb899d7d ICU-2292 line break rules updated, 15 mins testmonkey passes
X-SVN-Rev: 13663
2003-11-11 21:24:09 +00:00
Syn Wee Quek
31a8625180 ICU-2292 word break rules updated, 15 mins testmonkey passes
X-SVN-Rev: 13654
2003-11-11 05:00:08 +00:00
Syn Wee Quek
cea200bf0a ICU-2292 sentence break rules updated
X-SVN-Rev: 13649
2003-11-09 20:32:00 +00:00
Syn Wee Quek
41ac2f557b ICU-2292 added safe forward and backwards rules
X-SVN-Rev: 13648
2003-11-09 06:52:44 +00:00
Syn Wee Quek
7bf4d520f6 ICU-2292 added safe rules for forward and backwards
X-SVN-Rev: 13643
2003-11-08 06:21:45 +00:00
Syn Wee Quek
558442a420 ICU-2292 line breaks passing on default option
X-SVN-Rev: 13636
2003-11-07 22:49:38 +00:00
Syn Wee Quek
ab056703bd ICU-2292 added support for old data rules
X-SVN-Rev: 13614
2003-11-07 02:02:06 +00:00
Syn Wee Quek
1ef0ff982e ICU-2292 rbbiapts all test passing
X-SVN-Rev: 13613
2003-11-07 00:04:13 +00:00
Syn Wee Quek
3a0f8e87ea ICU-2292 word breaks comments clean up
X-SVN-Rev: 13606
2003-11-06 20:04:39 +00:00
Syn Wee Quek
3250a0a8ee ICU-2292 word breaks fixed and passing (i think)
X-SVN-Rev: 13604
2003-11-06 19:45:57 +00:00
Syn Wee Quek
469c2d5b76 ICU-2292 first cut of performance improvements, test failures commented out.
X-SVN-Rev: 13596
2003-11-05 23:50:39 +00:00
Andy Heninger
a9cdcba39e ICU-2924 RBBI rule builder, changes for safe point rules. Work in progress.
X-SVN-Rev: 13578
2003-11-05 02:03:44 +00:00
George Rhoten
c74773f5ec ICU-2924 Fix missing $ signs in rules
X-SVN-Rev: 13488
2003-10-24 22:47:46 +00:00
Andy Heninger
3c9eea1d58 ICU-2924 RBBI, fix rule roundtriping error with !! rule options.
X-SVN-Rev: 13470
2003-10-22 00:43:37 +00:00
Andy Heninger
d52bbb8da6 ICU-1117 RBBI Sentence Breaks, destinguish sentences with terminators (.!? etc.) from those without.
X-SVN-Rev: 13468
2003-10-21 21:46:05 +00:00
Andy Heninger
515ffb9930 ICU-2924 RBBI, add !!LBCMNoChain rule option to suppress line break combining mark chaining
X-SVN-Rev: 13461
2003-10-17 23:30:02 +00:00
Andy Heninger
94a9e101e7 ICU-2924 RBBI, line break rules, monkey test, a few more fixes
X-SVN-Rev: 13402
2003-10-13 22:01:53 +00:00
Andy Heninger
ccba9cce88 ICU-2924 Line break update - fix more monkey failures, getting closer.
X-SVN-Rev: 13397
2003-10-13 06:01:21 +00:00
Andy Heninger
a3f8e5695e ICU-2924 RBBI, line break rules, monkey test, better conformance to spec
X-SVN-Rev: 13394
2003-10-11 00:44:36 +00:00
Andy Heninger
72109e9494 ICU-2924 Line break update - fix some test failures.
X-SVN-Rev: 13370
2003-10-09 05:39:58 +00:00
Andy Heninger
d4524826ed ICU-2924 RBBI, new style rule format, new line break rules. (14 known test failures, will fix real soon.)
X-SVN-Rev: 13364
2003-10-09 01:13:08 +00:00
Andy Heninger
d2e0f3a9ac ICU-2128 fix RBBI Word rules and monkey test. All failures GONE!
X-SVN-Rev: 13288
2003-10-02 22:34:25 +00:00
Andy Heninger
5f352ade23 ICU-2924 RBBI Line Break Rule Updates, work in progress.
X-SVN-Rev: 12706
2003-07-29 06:35:54 +00:00
Andy Heninger
6bbbeb7637 ICU-2924 RBBI Line Break Rule Updates, work in progress.
X-SVN-Rev: 12701
2003-07-28 06:40:25 +00:00
Andy Heninger
e371874f36 ICU-2924 RBBI line break rules and monkey test, work in progress
X-SVN-Rev: 12685
2003-07-25 01:15:04 +00:00
Andy Heninger
a7562f974b ICU-2924 RBBI Line Break Rule Updates, work in progress.
X-SVN-Rev: 12643
2003-07-21 05:37:08 +00:00
Andy Heninger
9f3ad9e3c7 ICU-3042 RBBI, distinguish hard & soft line breaks
X-SVN-Rev: 12632
2003-07-16 01:02:16 +00:00
Alan Liu
de95737116 ICU-2959 update copyright dates to include 2003
X-SVN-Rev: 12253
2003-06-03 20:58:22 +00:00
Andy Heninger
e0e7b8f937 ICU-2093 clean up comments in break rule files.
X-SVN-Rev: 12197
2003-05-30 16:07:39 +00:00
Andy Heninger
894c39af36 ICU-2093 Word Breaks, monkey test and rule fixes.
X-SVN-Rev: 12171
2003-05-29 21:15:14 +00:00
Andy Heninger
4a211d4dd1 ICU-2093 line break rule updated; monkey test added (not complete, Grapheme Cluster only so far.)
X-SVN-Rev: 12115
2003-05-27 16:29:25 +00:00
Andy Heninger
3ab36cfb04 ICU-2093 RBBI Title rules, fix bug introduced in previous checkin
X-SVN-Rev: 12038
2003-05-21 20:07:41 +00:00
Andy Heninger
a46dbcf2ea ICU-2093 RBBI Title rules, replace hex ranges with property expression.
X-SVN-Rev: 12026
2003-05-20 18:45:43 +00:00
Andy Heninger
1b2b7444d8 ICU-2093 RBBI Tests updated; title break rules tweaked
X-SVN-Rev: 12025
2003-05-20 18:38:41 +00:00
Andy Heninger
73a3d184bb ICU-2093 rbbi rules and tests updated
X-SVN-Rev: 11974
2003-05-16 22:05:35 +00:00
Andy Heninger
71070da39f ICU-2093 RBBI rule make dependencies for UnicodeSet properties adjusted.
Check for empty UnicodeSets added to builder.

X-SVN-Rev: 11476
2003-04-09 00:09:14 +00:00
Andy Heninger
806b6d974f ICU-2093 Update word breakr rules to latest Unicode TR, work in progress
X-SVN-Rev: 11472
2003-04-08 05:35:13 +00:00
Andy Heninger
9e3648ad6c ICU-2093 Update grapheme cluster rules to latest Unicode TR
X-SVN-Rev: 11461
2003-04-04 23:41:03 +00:00
Eric Mader
4b2c117bb4 ICU-2603 Added $ALPlus to rules, modified to not include any Thai characters)
X-SVN-Rev: 10786
2002-12-24 20:53:22 +00:00
Andy Heninger
cb90bfba0f ICU-2555 RBBI line break reverse rule modified for better performance
X-SVN-Rev: 10562
2002-12-09 22:36:10 +00:00
Andy Heninger
8745b2cda9 ICU-2555 RBBI line break reverse rule modified for better performance
X-SVN-Rev: 10561
2002-12-09 21:37:21 +00:00
Vladimir Weinstein
012f463115 ICU-2107 added copyright notices
X-SVN-Rev: 10522
2002-12-06 01:40:42 +00:00
Andy Heninger
10ace04b12 ICU-2342 LineBreak rules, fix problem with Greek, Cyrillic
X-SVN-Rev: 9952
2002-10-03 17:53:15 +00:00
Andy Heninger
3144b2665e ICU-2231 RBBI Sentence Break Rules and test updated to match draft of TR 29
X-SVN-Rev: 9823
2002-08-30 21:37:59 +00:00
Andy Heninger
4aefb52904 ICU-2066 Update RBBI rules and tests for char and word breaks
X-SVN-Rev: 9643
2002-08-09 03:14:43 +00:00
Andy Heninger
0bc2ccb78a ICU-45 add word tag value for Ideographics
X-SVN-Rev: 9315
2002-07-24 19:10:18 +00:00
Andy Heninger
e32993b2d8 ICU-45 RBBI, getRuleStatus() works after previous().
More Tests.
Private includes removed from public header
Break rule tag status added to word break rules.

X-SVN-Rev: 9284
2002-07-22 22:02:08 +00:00
Andy Heninger
70621f8923 ICU-45 new builder for RBBI rules, remove obsolete RBBI files
X-SVN-Rev: 8941
2002-06-25 18:53:10 +00:00
Andy Heninger
32c09250b7 ICU-45 new builder for RBBI rules, initial checkin
X-SVN-Rev: 8939
2002-06-25 17:23:07 +00:00
Andy Heninger
2bbb6a2944 ICU-1126 Updated Break Iterator rules. Title for UTR21 compliance, Word and Line for CJK Extension A
Fix cintltst failure introduced by addition of title break iterators.

X-SVN-Rev: 7822
2002-03-01 00:46:21 +00:00
Andy Heninger
5ed2ac8c14 ICU-1126 Add title break iterator
X-SVN-Rev: 7799
2002-02-28 01:09:32 +00:00
Andy Heninger
fc369ffa92 ICU-1101 Break iterator rules - surrogate support copied from plain rules to Thai rules.
X-SVN-Rev: 6960
2001-11-16 23:20:54 +00:00
Andy Heninger
f51931f6c6 ICU-1101 RBBI Rules - Updated .brk files with Eric Mader's changes to the Java rules.
X-SVN-Rev: 6409
2001-10-24 00:26:23 +00:00
Andy Heninger
162a818bfb ICU-1273 Break Iterator, treat PUA chars as ideographs.
Fix incorrect sentence break for extended Letters.

X-SVN-Rev: 6295
2001-10-17 23:20:54 +00:00
Andy Heninger
8a42d02aeb ICU-1101 RBBI Rules for Surrogates,
Devanagiri rules update from Java.

X-SVN-Rev: 5580
2001-08-24 17:48:50 +00:00
Eric Mader
c94f70ff6e ICU-603 Updated the Thai word and line break files based
on the latest ICU4Jj fixes.

X-SVN-Rev: 2454
2000-09-19 21:03:25 +00:00
Madhu K
478438a905 ICU-45 Fails on big-endian platforms. They are supposed to be
big-endian.  The ninth byte should be 01 for big-endian.

X-SVN-Rev: 555
2000-01-13 01:10:40 +00:00
Richard Gillam
5385352ad6 ICU-45 Initial check-in of RuleBasedBreakIterator and DiscionaryBasedBreakIterator.
X-SVN-Rev: 504
2000-01-08 02:18:42 +00:00
Richard Gillam
bbccafffa4 ICU-45 Initial check-in of RuleBasedBreakIterator and DictionaryBasedBreakIterator.
X-SVN-Rev: 501
2000-01-08 01:57:41 +00:00