Go to file
Andy Heninger d6b88d49e3 ICU-20939 Fix problem w regexp \b boundaries & UTF-8 text
In regular expressions, when testing for word boundaries with \b, the
boundaries were incorrect when in Unicode mode, meaning that an ICU word break
iterator is being used to find the boundaries, and the text being matched is
UTF-8 encoded.

The bug stemmed from a misunderstanding of how string indexes work with UText
and break iterators, leading to the inclusion of code to convert from UTF-8 to
UTF-16 indexing, when what was wanted was the original UTF-8 index everywhere.
Removing the indexing conversion fixes the problem.
2020-02-03 16:51:17 -08:00
.ci-builds ICU-20673 Allow built-in translit ID w/o data. 2020-01-16 21:28:01 -08:00
.github ICU-20198 Add issue browse link to PR template. (#200) 2018-10-09 14:05:33 -05:00
docs ICU-20744 Add VS Code configuration files for c++ 2020-01-29 14:21:01 +01:00
icu4c ICU-20939 Fix problem w regexp \b boundaries & UTF-8 text 2020-02-03 16:51:17 -08:00
icu4j ICU-20099 Implementing Java ListFormatter proposals for ICU 67. 2020-01-22 16:45:31 -08:00
tools ICU-20693 Tidyups and small fixes for ICU conversion code 2020-01-21 10:20:51 +01:00
vendor/double-conversion ICU-20796 Updating double-conversion from upstream master. 2019-09-27 13:03:30 -07:00
.appveyor.yml ICU-20434 ICU4C AppVeyor CI builds failing on Cygwin 3.0.0. Use Cygwin to 2.11 for now. 2019-02-18 21:54:44 -08:00
.cpyskip.txt ICU-20873 Add KEYS file to cpyskip.txt for Travis Linter. 2019-12-04 16:56:43 -08:00
.gitattributes ICU-20892 Highlight .cpyskip.txt on GitHub.com 2019-12-18 10:39:43 -08:00
.gitignore ICU-20793 ignore binary file from git 2019-09-05 18:04:59 +02:00
.travis.yml ICU-20814 new tool for checking API tags vs. ifdef guards; add FORCE guards, and move some guards, to make this work 2019-09-23 17:04:53 -07:00
KEYS ICU-20873 Add PGP key for jefgen to KEYS file 2019-12-04 16:56:43 -08:00
README.md ICU-20855 Add OSS-Fuzz fuzzing badge 2019-10-09 11:10:50 -07:00

International Components for Unicode

This is the repository for the International Components for Unicode. The ICU project is under the stewardship of The Unicode Consortium.

ICU Logo

Build Status (master branch)

Build Status
TravisCI Build Status
Azure Pipelines Build Status
Azure Pipelines (Exhaustive Tests) Build Status
AppVeyor Build status
Fuzzing Fuzzing Status

Subdirectories and Information

License

Please see ./icu4c/LICENSE (C and J are under an identical license file.)

Copyright © 2016 and later Unicode, Inc. and others. All Rights Reserved. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. Terms of Use and License