[clang] [llvm] [mlir] [AArch64][SME] Improve codegen for aarch64.sme.cnts* when not in streaming mode (PR #154761)
Paul Walker via llvm-commits
llvm-commits at lists.llvm.org
Mon Sep 8 03:28:42 PDT 2025
================
@@ -4304,9 +4304,10 @@ Value *CodeGenFunction::EmitSMELd1St1(const SVETypeFlags &TypeFlags,
// size in bytes.
if (Ops.size() == 5) {
Function *StreamingVectorLength =
- CGM.getIntrinsic(Intrinsic::aarch64_sme_cntsb);
+ CGM.getIntrinsic(Intrinsic::aarch64_sme_cntsd);
llvm::Value *StreamingVectorLengthCall =
- Builder.CreateCall(StreamingVectorLength);
+ Builder.CreateMul(Builder.CreateCall(StreamingVectorLength),
+ llvm::ConstantInt::get(Int64Ty, 8), "svl");
----------------
paulwalker-arm wrote:
Is there anything that prevents the addition of no-wrap flags here?
https://github.com/llvm/llvm-project/pull/154761
More information about the llvm-commits
mailing list