[clang] [Clang] VectorExprEvaluator::VisitCallExpr / InterpretBuiltin - add MMX/SSE/AVX/AVX512 PMULHRSW intrinsics to be used in constexpr (PR #160636)
Simon Pilgrim via cfe-commits
cfe-commits at lists.llvm.org
Thu Sep 25 01:11:24 PDT 2025
================
@@ -3423,6 +3423,20 @@ bool InterpretBuiltin(InterpState &S, CodePtr OpPC, const CallExpr *Call,
return LHS.isSigned() ? LHS.ssub_sat(RHS) : LHS.usub_sat(RHS);
});
+
+ case clang::X86::BI__builtin_ia32_pmulhrsw128:
+ case clang::X86::BI__builtin_ia32_pmulhrsw256:
+ case clang::X86::BI__builtin_ia32_pmulhrsw512:
+ return interp__builtin_elementwise_int_binop(
+ S, OpPC, Call, [](const APSInt &LHS, const APSInt &RHS) {
+ unsigned Width = LHS.getBitWidth();
+
+ APInt Mul = llvm::APIntOps::mulhs(LHS, RHS);
+ Mul = Mul.relativeLShr(14);
+ Mul = Mul.sadd_sat(APInt(Width, 1, true));
----------------
RKSimon wrote:
Not sure sadd_sat is correct - compare against intel intrinsics guide and simplifyX86pmulh in X86InstCombineIntrinsic.cpp
https://github.com/llvm/llvm-project/pull/160636
More information about the cfe-commits
mailing list