[PATCH] D45203: [X86] VRNDSCALE* folding from masked and single-value ffloor and fceil patterns

Craig Topper via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Jun 4 10:17:02 PDT 2018


craig.topper added inline comments.


================
Comment at: llvm/lib/Target/X86/X86InstrAVX512.td:8605
+                                v4f32x_info, fp32imm0, 0x01,
+                                (COPY_TO_REGCLASS $mask, VK1WM), HasVLX>;
+defm : avx512_masked_scalar_imm<ffloor, "RNDSCALESS", X86Movss,
----------------
Why HasVLX? Shouldn't scalar instructions be valid under HasAVX512?


================
Comment at: llvm/lib/Target/X86/X86InstrAVX512.td:9777
+def : Pat<(v16f32 (vselect VK16WM:$mask, (ffloor VR512:$src), VR512:$dst)),
+          (VRNDSCALEPSZrrik VR512:$dst, VK16WM:$mask, VR512:$src, (i32 0x9))>;
 def : Pat<(v16f32 (fnearbyint VR512:$src)),
----------------
What about zero masking?


https://reviews.llvm.org/D45203





More information about the llvm-commits mailing list