[PATCH] D92940: [X86] Convert fadd/fmul _mm_reduce_* intrinsics to emit llvm.reduction intrinsics (PR47506)
Pengfei Wang via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Fri Dec 11 07:00:15 PST 2020
pengfei accepted this revision.
pengfei added a comment.
This revision is now accepted and ready to land.
LGTM. Thanks for bringing this refactor.
I also verified that ICC and GCC both do reduce math in an binary tree way, though sometimes ICC has a different LSB from GCC and Clang.
================
Comment at: clang/lib/Headers/avx512fintrin.h:9559
static __inline__ double __DEFAULT_FN_ATTRS512
_mm512_reduce_max_pd(__m512d __V) {
_mm512_mask_reduce_operator(max_pd);
----------------
Better to change min and max as well.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D92940/new/
https://reviews.llvm.org/D92940
More information about the cfe-commits
mailing list