Go to file
Carlos O'Donell 466f2be6c0 Add generic C.UTF-8 locale (Bug 17318)
We add a new C.UTF-8 locale. This locale is not builtin to glibc, but
is provided as a distinct locale. The locale provides full support for
UTF-8 and this includes full code point sorting via STRCMP-based
collation (strcmp or wcscmp).

The collation uses a new keyword 'codepoint_collation' which drops all
collation rules and generates an empty zero rules collation to enable
STRCMP usage in collation. This ensures that we get full code point
sorting for C.UTF-8 with a minimal 1406 bytes of overhead (LC_COLLATE
structure information and ASCII collating tables).

The new locale is added to SUPPORTED. Minimal test data for specific
code points (minus those not supported by collate-test) is provided in
C.UTF-8.in, and this verifies code point sorting is working reasonably
across the range. The locale was tested manually with the full set of
code points without failure.

The locale is harmonized with locales already shipping in various
downstream distributions. A new tst-iconv9 test is added which verifies
the C.UTF-8 locale is generally usable.

Testing for fnmatch, regexec, and recomp is provided by extending
bug-regex1, bugregex19, bug-regex4, bug-regex6, transbug, tst-fnmatch,
tst-regcomp-truncated, and tst-regex to use C.UTF-8.

Tested on x86_64 or i686 without regression.

Reviewed-by: Florian Weimer <fweimer@redhat.com>
2021-09-06 11:30:28 -04:00
argp Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
assert Update copyright dates with scripts/update-copyrights 2021-01-02 12:17:34 -08:00
benchtests Remove sysdeps/*/tls-macros.h 2021-08-18 09:15:20 -07:00
bits Update floating-point feature test macro handling for C2X 2021-06-01 14:22:06 +00:00
catgets Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
ChangeLog.old Update ChangeLog.old/ChangeLog.23. 2021-08-01 21:33:43 -04:00
conform Allow #pragma GCC in headers in conformtest 2021-08-27 17:47:46 +00:00
crypt Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
csu Use __executable_start as the lowest address for profiling [BZ #28153] 2021-08-24 06:44:18 -07:00
ctype Update copyright dates with scripts/update-copyrights 2021-01-02 12:17:34 -08:00
debug Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
dirent Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
dlfcn Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
elf Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
gmon Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
gnulib Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
grp Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
gshadow Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
hesiod Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
htl Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
hurd Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
iconv Add generic C.UTF-8 locale (Bug 17318) 2021-09-06 11:30:28 -04:00
iconvdata Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
include Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
inet Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
intl Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
io Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
libio Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
locale Add 'codepoint_collation' support for LC_COLLATE. 2021-09-06 11:06:45 -04:00
localedata Add generic C.UTF-8 locale (Bug 17318) 2021-09-06 11:30:28 -04:00
login Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
mach Update copyright dates with scripts/update-copyrights 2021-01-02 12:17:34 -08:00
malloc Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
manual Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
math Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
mathvec Update copyright dates with scripts/update-copyrights 2021-01-02 12:17:34 -08:00
misc Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
nis Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
nptl Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
nptl_db Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
nscd Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
nss Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
po po/nl.po: Update Dutch translation. 2021-08-01 20:52:28 -04:00
posix Add generic C.UTF-8 locale (Bug 17318) 2021-09-06 11:30:28 -04:00
pwd Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
resolv Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
resource y2038: Add support for 64-bit time on legacy ABIs 2021-06-15 10:42:11 -03:00
rt Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
scripts Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
setjmp nptl: Move __pthread_unwind_next into libc 2021-04-21 19:49:50 +02:00
shadow Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
signal Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
socket socket: Add time64 alias for setsockopt 2021-07-22 19:16:26 +02:00
soft-fp Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
stdio-common Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
stdlib Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
string Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
sunrpc Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
support support: Add support_wait_for_thread_exit 2021-08-30 13:43:56 +02:00
sysdeps AArch64: Update A64FX memset not to degrade at 16KB 2021-09-06 10:23:24 +01:00
sysvipc Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
termios Update copyright dates with scripts/update-copyrights 2021-01-02 12:17:34 -08:00
time Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
timezone Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
wcsmbs Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
wctype Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
.gitattributes Assume __NR_openat is always defined 2016-03-23 23:35:08 +01:00
.gitignore Add *.pyc to .gitignore 2015-05-18 15:26:26 +05:30
abi-tags Remove the bulk of the NaCl port. 2017-05-20 08:09:10 -04:00
aclocal.m4 configure: Replaced obsolete AC_TRY_COMPILE 2021-06-04 10:16:00 -03:00
config.h.in x86-64: Remove assembler AVX512DQ check 2021-08-24 07:05:35 -07:00
config.make.in Add pthread-in-libc, libpthread-routines-var, librt-routines-var 2021-05-03 08:13:32 +02:00
configure configure: Allow LD to be LLD 13.0.0 or above [BZ #26558] 2021-08-31 20:23:34 -07:00
configure.ac configure: Allow LD to be LLD 13.0.0 or above [BZ #26558] 2021-08-31 20:23:34 -07:00
CONTRIBUTED-BY Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
COPYING Update to latest versions of GPL-2.0 and LGPL-2.1 2013-09-09 12:52:48 +10:00
COPYING.LIB Update to latest versions of GPL-2.0 and LGPL-2.1 2013-09-09 12:52:48 +10:00
extra-lib.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
gen-locales.mk Improve gen-locales.mk and gen-locale.sh to make test files with @ options work 2018-02-27 17:01:57 +01:00
INSTALL Update install.texi, and regenerate INSTALL. 2021-08-01 16:48:43 -04:00
libc-abis riscv: support GNU indirect function 2021-01-10 21:25:13 -05:00
libof-iterator.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
LICENSES Prefer https to http for gnu.org and fsf.org URLs 2019-09-07 02:43:31 -07:00
MAINTAINERS Add MAINTAINERS 2017-05-11 13:38:30 -04:00
Makeconfig Force building with -fno-common 2021-07-09 20:09:14 +02:00
Makefile Install shared objects under their ABI names 2021-06-28 08:33:57 +02:00
Makefile.help Update copyright dates with scripts/update-copyrights 2021-01-02 12:17:34 -08:00
Makefile.in New make target to only build benchmark binaries 2016-04-20 10:23:28 +05:30
Makerules Install shared objects under their ABI names 2021-06-28 08:33:57 +02:00
NEWS Add generic C.UTF-8 locale (Bug 17318) 2021-09-06 11:30:28 -04:00
o-iterator.mk Fri Mar 17 12:58:37 1995 Roland McGrath <roland@churchy.gnu.ai.mit.edu> 1995-03-17 18:42:51 +00:00
README Documentation for the RISC-V 32-bit port 2020-08-27 08:17:44 -07:00
Rules Move malloc hooks into a compat DSO 2021-07-22 18:37:59 +05:30
SHARED-FILES Port shared code information from the wiki 2021-09-03 22:00:37 +05:30
shlib-versions Move malloc hooks into a compat DSO 2021-07-22 18:37:59 +05:30
test-skeleton.c Remove "Contributed by" lines 2021-09-03 22:06:44 +05:30
version.h Open master branch for glibc 2.35 development 2021-08-01 21:54:40 -04:00

This directory contains the sources of the GNU C Library.
See the file "version.h" for what release version you have.

The GNU C Library is the standard system C library for all GNU systems,
and is an important part of what makes up a GNU system.  It provides the
system API for all programs written in C and C-compatible languages such
as C++ and Objective C; the runtime facilities of other programming
languages use the C library to access the underlying operating system.

In GNU/Linux systems, the C library works with the Linux kernel to
implement the operating system behavior seen by user applications.
In GNU/Hurd systems, it works with a microkernel and Hurd servers.

The GNU C Library implements much of the POSIX.1 functionality in the
GNU/Hurd system, using configurations i[4567]86-*-gnu.

When working with Linux kernels, this version of the GNU C Library
requires Linux kernel version 3.2 or later.

Also note that the shared version of the libgcc_s library must be
installed for the pthread library to work correctly.

The GNU C Library supports these configurations for using Linux kernels:

	aarch64*-*-linux-gnu
	alpha*-*-linux-gnu
	arc*-*-linux-gnu
	arm-*-linux-gnueabi
	csky-*-linux-gnuabiv2
	hppa-*-linux-gnu
	i[4567]86-*-linux-gnu
	x86_64-*-linux-gnu	Can build either x86_64 or x32
	ia64-*-linux-gnu
	m68k-*-linux-gnu
	microblaze*-*-linux-gnu
	mips-*-linux-gnu
	mips64-*-linux-gnu
	powerpc-*-linux-gnu	Hardware or software floating point, BE only.
	powerpc64*-*-linux-gnu	Big-endian and little-endian.
	s390-*-linux-gnu
	s390x-*-linux-gnu
	riscv32-*-linux-gnu
	riscv64-*-linux-gnu
	sh[34]-*-linux-gnu
	sparc*-*-linux-gnu
	sparc64*-*-linux-gnu

If you are interested in doing a port, please contact the glibc
maintainers; see https://www.gnu.org/software/libc/ for more
information.

See the file INSTALL to find out how to configure, build, and install
the GNU C Library.  You might also consider reading the WWW pages for
the C library at https://www.gnu.org/software/libc/.

The GNU C Library is (almost) completely documented by the Texinfo manual
found in the `manual/' subdirectory.  The manual is still being updated
and contains some known errors and omissions; we regret that we do not
have the resources to work on the manual as much as we would like.  For
corrections to the manual, please file a bug in the `manual' component,
following the bug-reporting instructions below.  Please be sure to check
the manual in the current development sources to see if your problem has
already been corrected.

Please see https://www.gnu.org/software/libc/bugs.html for bug reporting
information.  We are now using the Bugzilla system to track all bug reports.
This web page gives detailed information on how to report bugs properly.

The GNU C Library is free software.  See the file COPYING.LIB for copying
conditions, and LICENSES for notices about a few contributions that require
these additional notices to be distributed.  License copyright years may be
listed using range notation, e.g., 1996-2015, indicating that every year in
the range, inclusive, is a copyrightable year that would otherwise be listed
individually.