[PATCH] D48586: [AMDGPU] Early expansion of 32 bit udiv/urem

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Jun 26 00:06:09 PDT 2018


arsenm added a comment.

Should we enable BypassSlowDivision or possibly merge this expansion with it?

The DAG expansion also probably needs to remain for all targets. DAGCombiner could still potentially introduce new div nodes that would need to be handled



================
Comment at: lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp:573-574
+  // fr = fabs(fr);
+  Function *FAbs = Intrinsic::getDeclaration(Mod, Intrinsic::fabs, { F32Ty });
+  FR = Builder.CreateCall(FAbs, { FR });
+
----------------
CreateIntrinsic should work for all of these


https://reviews.llvm.org/D48586





More information about the llvm-commits mailing list