This is the mail archive of the
libc-alpha@sources.redhat.com
mailing list for the glibc project.
Re: [PATCH] More regex microoptimization
- From: "Paolo Bonzini" <bonzini at gnu dot org>
- To: "Ulrich Drepper" <drepper at redhat dot com>
- Cc: <libc-alpha at sources dot redhat dot com>
- Date: Thu, 11 Mar 2004 18:43:00 +0100
- Subject: Re: [PATCH] More regex microoptimization
- References: <20040310113331.GA5994@fencepost> <404F67FC.5030707@redhat.com>
> In the "real world" UTF-8 is now the
> predominent encoding for a locale
Well, I believe that ISO-8859 (plus people that leave LC_ALL=C or unset) is
still predominant, and in addition many UTF-8 regexes are optimized to
mb_cur_max == 1 (see optimize_utf8 in regcomp.c). But if you think that the
range of optimizable regexes is overall a minority, that's another objection.
I will prepare another patch, you will choose the one to apply.
Paolo