[PATCH] D28508: [NVPTX] Lower to sqrt.approx and rsqrt.approx under more circumstances.

Thu Jan 12 19:41:56 PST 2017

mehdi_amini added a comment.

In https://reviews.llvm.org/D28508#644720, @jlebar wrote:

> > Technically I don't think it is correct for your patch to lower llvm.sqrt (with the FMF) to PTX sqrt.approx, because "The maximum absolute error for sqrt.f32 is TBD."
>
> The patch only does this transformation with fastmath enabled (or if you pass a special flag to llvm that specifically asks for this transformation):

Ah, good :)

I didn't read the patch, just the discussion, and misunderstood.

================
Comment at: llvm/lib/Target/NVPTX/NVPTXInstrInfo.td:932
+// F32 rsqrt.  When do_SQRTF32_APPROX and do_DIVF32_APPROX are enabled, we can
+// transform 1.0f/sqrt(x) into rsqrt.approx.f32.
+//
----------------
I have to say that I find amazing that someone if finally documenting the backend table gen, this has always driven me crazy :)

https://reviews.llvm.org/D28508