[clang] [X86][Clang] VectorExprEvaluator::VisitCallExpr / InterpretBuiltin - Allow AVX/AVX512 IFMA madd52 intrinsics to be used in constexpr (PR #161056)
Simon Pilgrim via cfe-commits
cfe-commits at lists.llvm.org
Tue Sep 30 00:49:37 PDT 2025
================
@@ -3523,6 +3523,26 @@ bool InterpretBuiltin(InterpState &S, CodePtr OpPC, const CallExpr *Call,
return F;
});
+ case X86::BI__builtin_ia32_vpmadd52luq128:
+ case X86::BI__builtin_ia32_vpmadd52luq256:
+ case X86::BI__builtin_ia32_vpmadd52luq512:
+ return interp__builtin_elementwise_triop(
+ S, OpPC, Call, [](const APSInt &A, const APSInt &B, const APSInt &C) {
+ APSInt Result(A + (B.trunc(52) * C.trunc(52)).trunc(52).zext(64),
+ false);
+ return APSInt(Result.trunc(52).zext(64), false);
----------------
RKSimon wrote:
This is incorrect - only the multiply occurs as i52 - the accumulate is full i64:
```
return APSInt(A + (B.trunc(52) * C.trunc(52)).trunc(52).zext(64), false);
```
Same for the others.
https://github.com/llvm/llvm-project/pull/161056
More information about the cfe-commits
mailing list