[PATCH] D147732: [AMDGPU] Add f32 permlane{16, x16} builtin variants

Fri Apr 7 15:43:45 PDT 2023

arsenm requested changes to this revision.
arsenm added a comment.
This revision now requires changes to proceed.

There is a benefit to not having bitcast noise in the IR

================
Comment at: llvm/include/llvm/IR/IntrinsicsAMDGPU.td:1962-1963
+// llvm.amdgcn.permlanex16.f32 <old> <src0> <src1> <src2> <fi> <bound_control>
+def int_amdgcn_permlanex16_f32 : ClangBuiltin<"__builtin_amdgcn_permlanex16_f32">,
+  Intrinsic<[llvm_float_ty],
+            [llvm_float_ty, llvm_float_ty, llvm_i32_ty, llvm_i32_ty, llvm_i1_ty, llvm_i1_ty],
----------------
Should use type mangling for the existing intrinsics rather than introducing new typed copies 

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D147732/new/

https://reviews.llvm.org/D147732