[PATCH] D34844: [AMDGPU] Always use rcp + mul with fast math

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Jun 29 16:30:23 PDT 2017


rampitec added inline comments.


================
Comment at: lib/Target/AMDGPU/SIISelLowering.cpp:3780
     SDNodeFlags NewFlags;
     NewFlags.setUnsafeAlgebra(true);
     SDValue Recip = DAG.getNode(AMDGPUISD::RCP, SL, VT, RHS);
----------------
arsenm wrote:
> This is a possible source of errors now that less-strict flags can now trigger this
That is the intent to trigger this. That is how HSAIL compiler works and we had no complaints so far. Anyway, I am running confirmance now.

A philosophical question though what shall preval, unsafe fp or denorm support. Once again, HSAIL favors relaxation, so I did the same.

In fact this implementation is stricter than one we have in HSAIL. We were applying options to library as well, while here it is only applied to user code.


https://reviews.llvm.org/D34844





More information about the llvm-commits mailing list