[PATCH] D147732: [AMDGPU] Add f32 permlane{16, x16} builtin variants
Matt Arsenault via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Fri Apr 7 15:43:45 PDT 2023
arsenm requested changes to this revision.
arsenm added a comment.
This revision now requires changes to proceed.
There is a benefit to not having bitcast noise in the IR
================
Comment at: llvm/include/llvm/IR/IntrinsicsAMDGPU.td:1962-1963
+// llvm.amdgcn.permlanex16.f32 <old> <src0> <src1> <src2> <fi> <bound_control>
+def int_amdgcn_permlanex16_f32 : ClangBuiltin<"__builtin_amdgcn_permlanex16_f32">,
+ Intrinsic<[llvm_float_ty],
+ [llvm_float_ty, llvm_float_ty, llvm_i32_ty, llvm_i32_ty, llvm_i1_ty, llvm_i1_ty],
----------------
Should use type mangling for the existing intrinsics rather than introducing new typed copies
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D147732/new/
https://reviews.llvm.org/D147732
More information about the cfe-commits
mailing list