[PATCH] D48586: [AMDGPU] Early expansion of 32 bit udiv/urem
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Jun 26 00:11:08 PDT 2018
rampitec added a comment.
In https://reviews.llvm.org/D48586#1143243, @arsenm wrote:
> Should we enable BypassSlowDivision or possibly merge this expansion with it?
Bypass is a separate question as it does runtime resolution. In fact it is questionable optimization for a SIMT, that is enough to have just one thread doing slow division to get the overhead penalty. In anyway this is really a separate optimization.
https://reviews.llvm.org/D48586
More information about the llvm-commits
mailing list