[PATCH] D157388: [AMDGPU] Support FMin/FMax in AMDGPUAtomicOptimizer.
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Aug 29 15:09:32 PDT 2023
arsenm accepted this revision.
arsenm added inline comments.
This revision is now accepted and ready to land.
================
Comment at: llvm/test/CodeGen/AMDGPU/global_atomic_optimizer_fp_rtn.ll:631-633
+; IR-DPP-NEXT: [[TMP4:%.*]] = bitcast i64 [[TMP3]] to <2 x i32>
+; IR-DPP-NEXT: [[TMP5:%.*]] = extractelement <2 x i32> [[TMP4]], i32 0
+; IR-DPP-NEXT: [[TMP6:%.*]] = extractelement <2 x i32> [[TMP4]], i32 1
----------------
The canonical way to do this extract in the IR is trunc and trunc (lshr x, 32)
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D157388/new/
https://reviews.llvm.org/D157388
More information about the llvm-commits
mailing list