[clang] [llvm] [AMDGPU][WIP] Extend permlane16, permlanex16 and permlane64 intrinsic lowering for generic types (PR #92725)
Matt Arsenault via cfe-commits
cfe-commits at lists.llvm.org
Mon May 20 09:20:00 PDT 2024
================
@@ -5433,7 +5450,16 @@ bool AMDGPULegalizerInfo::legalizeLaneOp(LegalizerHelper &Helper,
? Src0
: B.buildBitcast(LLT::scalar(Size), Src0).getReg(0);
Src0 = B.buildAnyExt(S32, Src0Cast).getReg(0);
- if (Src2.isValid()) {
+
+ if (IsPermLane16) {
+ Register Src1Cast =
+ MRI.getType(Src1).isScalar()
+ ? Src1
+ : B.buildBitcast(LLT::scalar(Size), Src2).getReg(0);
----------------
arsenm wrote:
Like the other patch, shouldn't need any bitcasts
https://github.com/llvm/llvm-project/pull/92725
More information about the cfe-commits
mailing list