This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.

From: Ulrich Drepper <drepper at gmail dot com>
To: Dmitrieva Liubov <liubov dot dmitrieva at gmail dot com>
Cc: "H.J. Lu" <hjl dot tools at gmail dot com>, libc-alpha at sourceware dot org, marius dot cornea at intel dot com
Date: Wed, 22 Feb 2012 10:14:31 -0500
Subject: Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.
Authentication-results: mr.google.com; spf=pass (google.com: domain of drepper@gmail.com designates 10.60.26.166 as permitted sender) smtp.mail=drepper@gmail.com; dkim=pass header.i=drepper@gmail.com
References: <CAHjhQ92+2qNPksQhJbXKTBh5T_SF0MR9MF5NU6SrGgHLaQ=Q4g@mail.gmail.com><CAOPLpQfz7pWOctuXXWLxP7OHf5BrHWOgooxCpzN6AH3CQAa-CQ@mail.gmail.com><CAHjhQ93ioevQXp5qSAzyVp9_nsH1=zGhz9dcCRXDfX96mdkwcw@mail.gmail.com><CAHjhQ922M2B1kodKH4DVis3oTFABgwJ5FLGx_=hqsn-mOuK8Lg@mail.gmail.com><CAMe9rOpcgvcru1bDEA6DPU-YTC8sKAc_1_ZfFNtryJB9kDtG_g@mail.gmail.com><CAHjhQ93fdwEKHqMYJmipaSsn9LJSJURDT0VVsrDn_83boiKDAg@mail.gmail.com><4F3D7C8D.8010202@twiddle.net> <CAMe9rOpkFN9Kx01W8YwacshNYaF8h_Y9u0KZiz4oEqxLCKRjPg@mail.gmail.com><CAHjhQ93Q3GhCioWHXOpHo1yWdeEZKjvG70ZwzoTwRkC_N27-mg@mail.gmail.com>

On Tue, Feb 21, 2012 at 15:32, Dmitrieva Liubov
<liubov.dmitrieva@gmail.com> wrote:
> The updated attached version is significantly hand-tuned assembler code.
> We are looking forward to accepting and releasing this change.

At least the changes made to the attached version should be made.
They are small but save another cycle or two.

I don't see why the computation of j has to be that complicated.  6
bits are taken from the mantissa.  These are then interpreted as a
signed value.  Why?  I know it works but is there really a reason?

If the bits are considered are unsigned that computation for the
access of the array DP_T as well as the computation of  n*k are
simpler (only AND, no shifting and subtracting).  The algo has to be
adapted but this should be possible.  I think nobody really paid
attention to that since the DP_T table was too large (it had 65
entries!).

Follow-Ups:
- Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.
  - From: Ulrich Drepper
- Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.
  - From: Quentin Neill

References:
- PATCH: optimized libm single precision routines: erfcf, erff, expffor x86_64.
  - From: Dmitrieva Liubov
- Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.
  - From: Ulrich Drepper
- Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.
  - From: Dmitrieva Liubov
- Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.
  - From: Dmitrieva Liubov
- Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.
  - From: H.J. Lu
- Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.
  - From: Dmitrieva Liubov
- Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.
  - From: Richard Henderson
- Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.
  - From: H.J. Lu
- Re: PATCH: optimized libm single precision routines: erfcf, erff,expf for x86_64.
  - From: Dmitrieva Liubov

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]