[PATCH] D90051: AMDGPU/GlobalISel: Add floating point med3 combine for IEEE=false
    Matt Arsenault via Phabricator via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Fri Oct 23 09:09:32 PDT 2020
    
    
  
arsenm added a comment.
Missing tests for f16 and v2f16 cases
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUPostLegalizerCombiner.cpp:174-175
+
+    // TODO: Add G_FMINNUM_IEEE (requires some additional checks for possible
+    // SNaN input).
+    if (OpcodeTriple.Min == AMDGPU::G_FMINNUM) {
----------------
Might as well handle this now
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUPostLegalizerCombiner.cpp:181-182
+      const SIInstrInfo *TII = MF->getSubtarget<GCNSubtarget>().getInstrInfo();
+      APInt KObits = KO_FPImm.bitcastToAPInt();
+      APInt K1bits = K1_FPImm.bitcastToAPInt();
+      if ((!MRI.hasOneNonDBGUse(K0Def) || TII->isInlineConstant(KObits)) &&
----------------
You can directly pass the APFloat to is the overload of isInlineConstant
CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D90051/new/
https://reviews.llvm.org/D90051
    
    
More information about the llvm-commits
mailing list