[PATCH] D104049: [AMDGPU] [CodeGen] Fold negate llvm.amdgcn.class into test mask
Anshil Gandhi via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Jun 11 09:09:03 PDT 2021
gandhi21299 marked 6 inline comments as done.
gandhi21299 added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp:1452-1463
+ if (isa<TruncInst>(ExtCall))
+ IntrinsicCall =
+ dyn_cast<CallInst>(cast<TruncInst>(ExtCall)->getOperand(0));
+ else if (isa<SExtInst>(ExtCall))
+ IntrinsicCall =
+ dyn_cast<CallInst>(cast<SExtInst>(ExtCall)->getOperand(0));
+ else if (isa<ZExtInst>(ExtCall))
----------------
arsenm wrote:
> I don't know why you are looking at all of these extensions. The xor should directly consume the call
I am considering the case where the result from the class intrinsic is extended/truncated for some reason. Is it possible to get the IntrinsicInst directly from the xor operand?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D104049/new/
https://reviews.llvm.org/D104049
More information about the llvm-commits
mailing list