This is the mail archive of the libc-ports@sources.redhat.com mailing list for the libc-ports project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [ARM] Optimised strchr and strlen

From: Richard Henderson <rth at twiddle dot net>
To: David Gilbert <david dot gilbert at linaro dot org>
Cc: "Joseph S. Myers" <joseph at codesourcery dot com>, libc-ports at sourceware dot org, patches at linaro dot org
Date: Wed, 21 Dec 2011 13:20:40 -0800
Subject: Re: [ARM] Optimised strchr and strlen
References: <20111219172122.GA10120@davesworkthinkpad> <Pine.LNX.4.64.1112202301310.2743@digraph.polyomino.org.uk> <CA+1XiSf+AcS6EXUmbo7AOhe5fR8WJ43LA90bPukwNrXnaK=O+g@mail.gmail.com>

On 12/21/2011 02:55 AM, David Gilbert wrote:
> That 'simple' one is showing the benefit at the short lengths,
> the 'smarter' one I have is doing 8 bytes/loop and is nice on the long
> strings - but as you can see worse at the short ones.

Having not seen your "smarter" strchr, it's hard to suggest anything
concrete.  I'd have thought that there's enough slack in load delay
that one or two arithmetic operations could be done without penalty...

Something like performing a simple compare loop looking for "alignment plus":

...
	bic	r3, r0, #7
	and	r1, r1, #255
	adds	r3, r3, #32
1:
	ldrb	r2, [r0],#1
	cmp	r2, r1
	cbz	r2, .Lfound_zero
	it	ne
	cmpne	r0, r3
	bne	1b
	cmp	r2, r1
	beq	.Lfound
	@ Here, r0 is aligned.  Do something word-based.
...

or even just

	and	r3, r0, #7
	and	r1, r1, #255
	rsb	r3, r3, #32
1:
	ldrb	r2, [r0],#1
	cmp	r2, r1
	beq	.Lfound
	subs	r3, r3, #1
	cbz	r2, .Lfound_zero
	bne	1b
	@ Here, r0 is aligned.  Do something word-based.


r~

Follow-Ups:
- Re: [ARM] Optimised strchr and strlen
  - From: David Gilbert

References:
- [ARM] Optimised strchr and strlen
  - From: Dr. David Alan Gilbert
- Re: [ARM] Optimised strchr and strlen
  - From: Joseph S. Myers
- Re: [ARM] Optimised strchr and strlen
  - From: David Gilbert

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]