[PATCH] D96110: [X86] Pass to transform tdpbf16ps intrinsics to scalar operation.

Pengfei Wang via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Tue Feb 9 01:45:22 PST 2021


pengfei added inline comments.


================
Comment at: llvm/test/CodeGen/X86/AMX/amx-low-intrinsics.ll:174-175
+; CHECK-NEXT:    [[TMP13:%.*]] = bitcast i32 [[TMP12]] to <2 x i16>
+; CHECK-NEXT:    [[TMP14:%.*]] = zext <2 x i16> [[TMP11]] to <2 x i32>
+; CHECK-NEXT:    [[TMP15:%.*]] = shl <2 x i32> [[TMP14]], <i32 16, i32 16>
+; CHECK-NEXT:    [[TMP16:%.*]] = bitcast <2 x i32> [[TMP15]] to <2 x float>
----------------
Can we use a shuffle instruction?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D96110/new/

https://reviews.llvm.org/D96110



More information about the cfe-commits mailing list