[PATCH] D45202: [X86] Replacing X86-specific floor and ceil vector intrinsics with generic LLVM intrinsics

Craig Topper via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Tue Apr 3 13:07:06 PDT 2018


craig.topper added inline comments.


================
Comment at: include/clang/Basic/BuiltinsX86.def:951
 TARGET_BUILTIN(__builtin_ia32_rndscalepd_mask, "V8dV8dIiV8dUcIi", "", "avx512f")
+TARGET_BUILTIN(__builtin_ia32_floorps_mask, "V16fV16fV16fUs", "", "avx512f")
+TARGET_BUILTIN(__builtin_ia32_floorpd_mask, "V8dV8dV8dUc", "", "avx512f")
----------------
I'd prefer CGBuiltin to detect the specific immediates on the rndscale value. Primarily because we should be able to optimize _mm512_roundscale_pd when the ceil/floor immediate is used.


Repository:
  rC Clang

https://reviews.llvm.org/D45202





More information about the cfe-commits mailing list