[llvm] [AArch64] Improve codegen for some fixed-width partial reductions (PR #126529)

David Sherwood via llvm-commits llvm-commits at lists.llvm.org
Mon Feb 10 07:30:05 PST 2025


================
@@ -175,13 +297,75 @@ define <4 x i32> @sudot(<4 x i32> %acc, <16 x i8> %u, <16 x i8> %s) #0{
 ; CHECK-I8MM:       // %bb.0:
 ; CHECK-I8MM-NEXT:    usdot v0.4s, v2.16b, v1.16b
 ; CHECK-I8MM-NEXT:    ret
-  %u.wide = sext <16 x i8> %u to <16 x i32>
-  %s.wide = zext <16 x i8> %s to <16 x i32>
-  %mult = mul nuw nsw <16 x i32> %s.wide, %u.wide
+  %s.wide = sext <16 x i8> %u to <16 x i32>
----------------
david-arm wrote:

I noticed the variable names were incorrect so I've corrected them while I'm here!

https://github.com/llvm/llvm-project/pull/126529


More information about the llvm-commits mailing list