[PATCH] D156647: [AMDGPU] Extend f32 support for llvm.amdgcn.update.dpp intrinsic
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Aug 17 06:50:24 PDT 2023
arsenm accepted this revision.
arsenm added inline comments.
================
Comment at: llvm/test/CodeGen/AMDGPU/llvm.amdgcn.update.dpp.ll:219
+define amdgpu_kernel void @dpp_test_f32_imm_comb8(ptr addrspace(1) %out, float %in1, float %in2) {
+ %tmp0 = call float @llvm.amdgcn.update.dpp.f32(float %in1, float %in2, i32 31, i32 63, i32 128, i1 1) #0
+ store float %tmp0, ptr addrspace(1) %out
----------------
arsenm wrote:
> Nit: drop the call site attributes and use true/false instead of i1 0/1
test nit not done
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D156647/new/
https://reviews.llvm.org/D156647
More information about the llvm-commits
mailing list