[Libclc-dev] [PATCH 1/1] Implement log10

Jan Vesely jan.vesely at rutgers.edu
Wed Oct 22 09:32:49 PDT 2014


On Tue, 2014-10-21 at 17:05 -0700, Matt Arsenault wrote:
> On 10/21/2014 05:00 PM, Jan Vesely wrote:
> > Signed-off-by: Jan Vesely <jan.vesely at rutgers.edu>
> > ---
> >
> > Fixes myocyte rodinia benchmark.
> > piglit test posted
> >
> >   generic/include/clc/clc.h        |  1 +
> >   generic/include/clc/math/log10.h |  9 +++++++++
> >   generic/lib/SOURCES              |  1 +
> >   generic/lib/math/log10.cl        |  8 ++++++++
> >   generic/lib/math/log10.inc       | 10 ++++++++++
> >   5 files changed, 29 insertions(+)
> >   create mode 100644 generic/include/clc/math/log10.h
> >   create mode 100644 generic/lib/math/log10.cl
> >   create mode 100644 generic/lib/math/log10.inc
> >
> > diff --git a/generic/include/clc/clc.h b/generic/include/clc/clc.h
> > index bd92fdb..29c1855 100644
> > --- a/generic/include/clc/clc.h
> > +++ b/generic/include/clc/clc.h
> > @@ -50,6 +50,7 @@
> >   #include <clc/math/fmod.h>
> >   #include <clc/math/hypot.h>
> >   #include <clc/math/log.h>
> > +#include <clc/math/log10.h>
> >   #include <clc/math/log1p.h>
> >   #include <clc/math/log2.h>
> >   #include <clc/math/mad.h>
> > diff --git a/generic/include/clc/math/log10.h b/generic/include/clc/math/log10.h
> > new file mode 100644
> > index 0000000..ec4e4ae
> > --- /dev/null
> > +++ b/generic/include/clc/math/log10.h
> > @@ -0,0 +1,9 @@
> > +#undef log10
> > +
> > +#define __CLC_BODY <clc/math/unary_decl.inc>
> > +#define __CLC_FUNCTION log10
> > +
> > +#include <clc/math/gentype.inc>
> > +
> > +#undef __CLC_BODY
> > +#undef __CLC_FUNCTION
> > diff --git a/generic/lib/SOURCES b/generic/lib/SOURCES
> > index b76fec9..1032606 100644
> > --- a/generic/lib/SOURCES
> > +++ b/generic/lib/SOURCES
> > @@ -63,6 +63,7 @@ math/fmax.cl
> >   math/fmin.cl
> >   math/fmod.cl
> >   math/hypot.cl
> > +math/log10.cl
> >   math/log1p.cl
> >   math/mad.cl
> >   math/mix.cl
> > diff --git a/generic/lib/math/log10.cl b/generic/lib/math/log10.cl
> > new file mode 100644
> > index 0000000..d65764a
> > --- /dev/null
> > +++ b/generic/lib/math/log10.cl
> > @@ -0,0 +1,8 @@
> > +#include <clc/clc.h>
> > +
> > +#ifdef cl_khr_fp64
> > +#pragma OPENCL EXTENSION cl_khr_fp64 : enable
> > +#endif
> > +
> > +#define __CLC_BODY <log10.inc>
> > +#include <clc/math/gentype.inc>
> > diff --git a/generic/lib/math/log10.inc b/generic/lib/math/log10.inc
> > new file mode 100644
> > index 0000000..82e77bc
> > --- /dev/null
> > +++ b/generic/lib/math/log10.inc
> > @@ -0,0 +1,10 @@
> > +_CLC_OVERLOAD _CLC_DEF __CLC_GENTYPE log10(__CLC_GENTYPE val) {
> > +  // log10(x) = log2(x) / log2(10)
> > +#if __CLC_FPSIZE == 32
> > +  return log2(val) / log2(10.0f);
> > +#elif __CLC_FPSIZE == 64
> > +  return exp2(val) / log2(10.0);
             ^^^^ should have been log2
> > +#else
> > +#error unknown _CLC_FPSIZE
> > +#endif
> > +}
> 
> Last time I tried it, these functions failed the OpenCL conformance 
> test. Have you considered porting from the amd-builtins branch instead?

is precision the problem?

I could not find any information about relative errors of hw
instructions. I thought that _IEEE instructions produced correctly
rounded results.
I'm still not clear about how the relative error propagates across
operations:
if we have log1p implemented ( with error <= 2ulp ) would
log(x) = log1p(x-1.0) result in error <= 2 ulp since subtraction is
correctly rounded?
if so would log(x) / LOG_2 (correctly rounded constant) give correct
log2 (error <= 3 ulp)?

jan

> 
> 

-- 
Jan Vesely <jan.vesely at rutgers.edu>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 819 bytes
Desc: This is a digitally signed message part
URL: <http://lists.llvm.org/pipermail/libclc-dev/attachments/20141022/22f8f9ce/attachment.sig>


More information about the Libclc-dev mailing list