[PATCH] D159250: [X86][RFC] Add new option `-m[no-]evex512` to disable ZMM and 64-bit mask instructions for AVX512 features

Simon Pilgrim via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Thu Aug 31 09:28:16 PDT 2023


RKSimon added a comment.

Would it be possible to add function multiversioning tests to ensure the evex512 attribute would work with it?



================
Comment at: clang/lib/CodeGen/Targets/X86.cpp:1517
+  bool Caller256 = CallerMap.lookup("avx512f") && !CallerMap.lookup("evex512");
+  bool Callee256 = CallerMap.lookup("avx512f") && !CallerMap.lookup("evex512");
+
----------------
typo in Callee256?


================
Comment at: clang/test/CodeGen/X86/avx512-error.c:9
+  return __builtin_ia32_sqrtpd512(a, _MM_FROUND_CUR_DIRECTION); // expected-error {{'__builtin_ia32_sqrtpd512' needs target feature evex512}}
+}
----------------
add  __mmask64 test ? _knot_mask64 or _cvtmask64_u64 maybe?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D159250/new/

https://reviews.llvm.org/D159250



More information about the cfe-commits mailing list