[PATCH] D96110: [X86] Pass to transform tdpbf16ps intrinsics to scalar operation.
Pengfei Wang via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Tue Feb 9 01:45:22 PST 2021
pengfei added inline comments.
================
Comment at: llvm/test/CodeGen/X86/AMX/amx-low-intrinsics.ll:174-175
+; CHECK-NEXT: [[TMP13:%.*]] = bitcast i32 [[TMP12]] to <2 x i16>
+; CHECK-NEXT: [[TMP14:%.*]] = zext <2 x i16> [[TMP11]] to <2 x i32>
+; CHECK-NEXT: [[TMP15:%.*]] = shl <2 x i32> [[TMP14]], <i32 16, i32 16>
+; CHECK-NEXT: [[TMP16:%.*]] = bitcast <2 x i32> [[TMP15]] to <2 x float>
----------------
Can we use a shuffle instruction?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D96110/new/
https://reviews.llvm.org/D96110
More information about the cfe-commits
mailing list