ICU-10299 Fix CjkBreakEngine fSet to include 30FC,FF70; fix broken test data (ICU4C)

X-SVN-Rev: 34118
This commit is contained in:
Peter Edberg 2013-08-29 05:13:36 +00:00
parent 6ddf597269
commit 2f02059dda
2 changed files with 5 additions and 4 deletions

View File

@ -1,6 +1,6 @@
/** /**
******************************************************************************* *******************************************************************************
* Copyright (C) 2006-2012, International Business Machines Corporation * Copyright (C) 2006-2013, International Business Machines Corporation
* and others. All Rights Reserved. * and others. All Rights Reserved.
******************************************************************************* *******************************************************************************
*/ */
@ -667,7 +667,8 @@ CjkBreakEngine::CjkBreakEngine(DictionaryMatcher *adoptDictionary, LanguageType
cjSet.addAll(fHanWordSet); cjSet.addAll(fHanWordSet);
cjSet.addAll(fKatakanaWordSet); cjSet.addAll(fKatakanaWordSet);
cjSet.addAll(fHiraganaWordSet); cjSet.addAll(fHiraganaWordSet);
cjSet.add(UNICODE_STRING_SIMPLE("\\uff70\\u30fc")); cjSet.add(0xFF70); // HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK
cjSet.add(0x30FC); // KATAKANA-HIRAGANA PROLONGED SOUND MARK
setCharacters(cjSet); setCharacters(cjSet);
} }
} }

View File

@ -757,11 +757,11 @@ Bangkok)•</data>
<locale ja> <locale ja>
<word> <word>
<data>•私<400>達<400>に<400>一<400><400><400>の<400>コンピュ<400><400>タ<400>が<400>ある<400>。<0>奈々<400>は<400>ワ<400><400>ド<400>で<400>ある<400>。•</data> <data>•私<400>達<400>に<400>一<400><400><400>の<400>コンピュータ<400>が<400>ある<400>。<0>奈々<400>は<400>ワード<400>で<400>ある<400>。•</data>
<locale root> <locale root>
<word> <word>
<data>•私<400>達<400>に<400>一<400><400><400>の<400>コンピュ<400><400>タ<400>が<400>ある<400>。<0>奈々<400>は<400>ワ<400><400>ド<400>で<400>ある<400>。•</data> <data>•私<400>達<400>に<400>一<400><400><400>の<400>コンピュータ<400>が<400>ある<400>。<0>奈々<400>は<400>ワード<400>で<400>ある<400>。•</data>
# UBreakIteratorType UBRK_SENTENCE, Locale "el" # UBreakIteratorType UBRK_SENTENCE, Locale "el"
# Add break after Greek question mark (cldrbug #2069). # Add break after Greek question mark (cldrbug #2069).