[llvm] [AArch64] Improve codegen for some fixed-width partial reductions (PR #126529)
David Sherwood via llvm-commits
llvm-commits at lists.llvm.org
Mon Feb 10 07:30:05 PST 2025
================
@@ -175,13 +297,75 @@ define <4 x i32> @sudot(<4 x i32> %acc, <16 x i8> %u, <16 x i8> %s) #0{
; CHECK-I8MM: // %bb.0:
; CHECK-I8MM-NEXT: usdot v0.4s, v2.16b, v1.16b
; CHECK-I8MM-NEXT: ret
- %u.wide = sext <16 x i8> %u to <16 x i32>
- %s.wide = zext <16 x i8> %s to <16 x i32>
- %mult = mul nuw nsw <16 x i32> %s.wide, %u.wide
+ %s.wide = sext <16 x i8> %u to <16 x i32>
----------------
david-arm wrote:
I noticed the variable names were incorrect so I've corrected them while I'm here!
https://github.com/llvm/llvm-project/pull/126529
More information about the llvm-commits
mailing list