[PATCH] D159250: [X86][RFC] Add new option `-m[no-]evex512` to disable ZMM and 64-bit mask instructions for AVX512 features
Simon Pilgrim via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Thu Aug 31 09:28:16 PDT 2023
RKSimon added a comment.
Would it be possible to add function multiversioning tests to ensure the evex512 attribute would work with it?
================
Comment at: clang/lib/CodeGen/Targets/X86.cpp:1517
+ bool Caller256 = CallerMap.lookup("avx512f") && !CallerMap.lookup("evex512");
+ bool Callee256 = CallerMap.lookup("avx512f") && !CallerMap.lookup("evex512");
+
----------------
typo in Callee256?
================
Comment at: clang/test/CodeGen/X86/avx512-error.c:9
+ return __builtin_ia32_sqrtpd512(a, _MM_FROUND_CUR_DIRECTION); // expected-error {{'__builtin_ia32_sqrtpd512' needs target feature evex512}}
+}
----------------
add __mmask64 test ? _knot_mask64 or _cvtmask64_u64 maybe?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D159250/new/
https://reviews.llvm.org/D159250
More information about the cfe-commits
mailing list