[PATCH] D90051: AMDGPU/GlobalISel: Add floating point med3 combine for IEEE=false
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Oct 23 09:09:32 PDT 2020
arsenm added a comment.
Missing tests for f16 and v2f16 cases
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUPostLegalizerCombiner.cpp:174-175
+
+ // TODO: Add G_FMINNUM_IEEE (requires some additional checks for possible
+ // SNaN input).
+ if (OpcodeTriple.Min == AMDGPU::G_FMINNUM) {
----------------
Might as well handle this now
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUPostLegalizerCombiner.cpp:181-182
+ const SIInstrInfo *TII = MF->getSubtarget<GCNSubtarget>().getInstrInfo();
+ APInt KObits = KO_FPImm.bitcastToAPInt();
+ APInt K1bits = K1_FPImm.bitcastToAPInt();
+ if ((!MRI.hasOneNonDBGUse(K0Def) || TII->isInlineConstant(KObits)) &&
----------------
You can directly pass the APFloat to is the overload of isInlineConstant
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D90051/new/
https://reviews.llvm.org/D90051
More information about the llvm-commits
mailing list