Commit Graph

385 Commits

Author SHA1 Message Date
Alan Liu
26fda5bab3 Convert non-ASCII characters to Unicode escapes
X-SVN-Rev: 2602
2000-10-06 23:16:40 +00:00
Alan Liu
0842019826 InterIndic transliterator support
X-SVN-Rev: 2601
2000-10-06 23:08:08 +00:00
Madhu K
344330dfa2 Added tests for transliteration
X-SVN-Rev: 2600
2000-10-06 21:42:38 +00:00
Madhu K
7f0bab0a86 .equals() correction
X-SVN-Rev: 2571
2000-10-04 23:12:33 +00:00
Madhu K
7ffcdd1311 Added tests for RBBI and UnicodeFilterLogic
X-SVN-Rev: 2568
2000-10-04 21:58:59 +00:00
Madhu K
f4df91ee89 Tests for UnicodeFilterLogic initial checkin
X-SVN-Rev: 2567
2000-10-04 21:58:12 +00:00
Madhu K
3eeda4d03a Tests for Rule based break iterator initial checkin
X-SVN-Rev: 2566
2000-10-04 21:57:45 +00:00
Eric Mader
6bdb5d6430 Fixed a small bug in the rule for a syllable with at least one consonant.
X-SVN-Rev: 2518
2000-09-26 22:46:50 +00:00
Eric Mader
7bea47df66 Update character break rules to match Devanagari syllables.
X-SVN-Rev: 2517
2000-09-26 21:47:31 +00:00
Eric Mader
4b03f9d53d Initial checkin of demo of Devanagari break rules.
X-SVN-Rev: 2516
2000-09-26 21:46:54 +00:00
Eric Mader
67f67bbe1f Moved to src/com/ibm/tools/rbbi
X-SVN-Rev: 2512
2000-09-25 21:55:02 +00:00
Eric Mader
68e2f02cae Moved to src/com/ibm/data/misc
X-SVN-Rev: 2511
2000-09-25 21:54:21 +00:00
Eric Mader
1d2d2b83a0 Filter out null characters in printWordList.
X-SVN-Rev: 2510
2000-09-25 21:33:43 +00:00
Eric Mader
33c8d5c668 Removed null characters so it can be checked in as text.
X-SVN-Rev: 2509
2000-09-25 21:31:46 +00:00
Eric Mader
ea758baa19 New version minus null characters so it can be checked in as a text file.
X-SVN-Rev: 2508
2000-09-25 21:06:47 +00:00
Alan Liu
229efe01a9 Move hex and other static methods from UInfo to Utility
X-SVN-Rev: 2492
2000-09-22 20:06:12 +00:00
Eric Mader
b84bfcbdb7 Initial checkin of the break dictionary builder in this package.
X-SVN-Rev: 2487
2000-09-22 18:58:26 +00:00
Eric Mader
f7f539f40f Initial checkin of the tool to write .brk data files for ICU
X-SVN-Rev: 2486
2000-09-22 18:57:44 +00:00
Eric Mader
c723eaeb35 Get english.dict from com/ibm/data/misc
X-SVN-Rev: 2485
2000-09-22 18:52:47 +00:00
Eric Mader
00d33b6476 printWordList: stop flag is 0xFFFF not -1
X-SVN-Rev: 2484
2000-09-22 18:51:16 +00:00
Eric Mader
65a8417978 Checkin in new central location. (Don't know why everything thinks this file
is binary...)

X-SVN-Rev: 2482
2000-09-22 18:48:05 +00:00
Eric Mader
f2a1c6bdb7 Checkin in new central location.
X-SVN-Rev: 2481
2000-09-22 18:47:34 +00:00
Eric Mader
4d40f33f9d Initial checkin of the word list for the Thai word break dictionary
X-SVN-Rev: 2480
2000-09-22 18:45:51 +00:00
Alan Liu
5175e1933f Move hex and other static methods from UInfo to Utility
X-SVN-Rev: 2475
2000-09-21 22:37:55 +00:00
Alan Liu
20ed04f412 Convert non-ASCII characters to Unicode
X-SVN-Rev: 2463
2000-09-20 22:51:23 +00:00
Alan Liu
1b306ad4fe Convert non-ASCII characters to Unicode
X-SVN-Rev: 2461
2000-09-20 22:39:12 +00:00
Eric Mader
6dead1ec6d Initial checkin: this application writes the .brk files for ICU.
X-SVN-Rev: 2455
2000-09-19 21:05:46 +00:00
Alan Liu
8789277d67 Add missing newline to end of file (to make diff et al happy)
X-SVN-Rev: 2453
2000-09-19 19:26:47 +00:00
Alan Liu
d39ed7b0eb Convert non-ASCII characters to Unicode
X-SVN-Rev: 2452
2000-09-19 19:24:47 +00:00
Alan Liu
663e3b071d Add missing newline to end of file (to make diff et al happy)
X-SVN-Rev: 2451
2000-09-19 18:37:36 +00:00
Alan Liu
fcd4c3780e ICU-564 reduce C++ code memory leaks
X-SVN-Rev: 2447
2000-09-19 17:04:47 +00:00
Eric Mader
308e4ef84b In writeTablesToFile we need to look at the littleEndian flag
when writing the endianness flag

X-SVN-Rev: 2435
2000-09-15 16:07:52 +00:00
Ram Viswanadha
68b726a228 no message
X-SVN-Rev: 2429
2000-09-11 15:27:48 +00:00
Alan Liu
58c0f1bf5b Implement anchors.
X-SVN-Rev: 2408
2000-08-31 17:11:42 +00:00
Alan Liu
5cc3a06755 Implement anchors.
X-SVN-Rev: 2400
2000-08-30 20:40:30 +00:00
Alan Liu
2ccf765e0a Add test for anchors
X-SVN-Rev: 2305
2000-08-17 15:31:40 +00:00
Alan Liu
2209a45f2d ICU-89 change int8_t to uint8_t per compiler warnings
X-SVN-Rev: 2193
2000-08-11 17:31:41 +00:00
Alan Liu
f8afd6398e Add tests of CharacterIterator API
X-SVN-Rev: 2044
2000-07-26 16:21:30 +00:00
Alan Liu
1c254cf440 Conformance test suite fixes
X-SVN-Rev: 2043
2000-07-26 16:20:40 +00:00
Alan Liu
019802e4df Minor cleanup
X-SVN-Rev: 2042
2000-07-26 16:20:16 +00:00
Eric Mader
b2df4167b0 Updated some comments in ParseRule()
X-SVN-Rev: 2035
2000-07-25 21:11:59 +00:00
Alan Liu
10691bf5c2 For normalizer conformance testing
X-SVN-Rev: 2013
2000-07-21 21:27:56 +00:00
Alan Liu
4407b8c40e Change typeMask to typeBit#, go from 32 different classes to 64, keep all combining chars, handle canonical singletons
X-SVN-Rev: 2012
2000-07-21 21:26:34 +00:00
Alan Liu
f59bd17017 Regenerate
X-SVN-Rev: 2011
2000-07-21 21:26:12 +00:00
Alan Liu
b52b7de68d Add normalizer conformance test
X-SVN-Rev: 2010
2000-07-21 21:21:03 +00:00
Eric Mader
f0e480d865 Converted and cleaned up rules.
X-SVN-Rev: 1979
2000-07-20 17:15:06 +00:00
Eric Mader
c737bdcf44 When processing "*" and "+" need to add any new states created
by looping back to decision point list. Also did some minor cleanup.

X-SVN-Rev: 1978
2000-07-20 17:03:33 +00:00
Eric Mader
88c7f47c12 Fixed check for what categories are in $dictionary.
X-SVN-Rev: 1977
2000-07-20 17:02:08 +00:00
Eric Mader
01717271e9 Need to close output file when dumping word list.
X-SVN-Rev: 1976
2000-07-20 17:01:36 +00:00
Eric Mader
fbfe74d4c3 New syntax, copied most recent default rules.
X-SVN-Rev: 1952
2000-07-19 21:17:18 +00:00