[PATCH] D112395: [AMDGPU] Enable 48-bit mul in AMDGPUCodeGenPrepare.

Abinav Puthan Purayil via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sun Oct 24 21:28:38 PDT 2021


abinavpp added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp:536
+    // mul48 (i.e. mul24, mul24hi pair) would truncate the result.
+    if (Size > 48 && LHSBits + RHSBits > 48)
       return false;
----------------
abinavpp wrote:
> I think we were incorrect in doing the `LHSBits + RHSBits > ...` check in D111523. We did not consider the case when an operand has > 24 known bits, but the sum of known bits being in limits. mul24 instruction works only on the low-order 24 bits of its operands.
Ignore this comment. I missed the operand width check in the if above.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D112395/new/

https://reviews.llvm.org/D112395



More information about the llvm-commits mailing list