[PATCH] D63160: [AMDGPU] Custom lower INSERT_SUBVECTOR v3, v4, v5, v8
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Jun 11 13:16:16 PDT 2019
arsenm added inline comments.
================
Comment at: lib/Target/AMDGPU/SIISelLowering.cpp:348
+ setOperationAction(ISD::INSERT_SUBVECTOR, MVT::v8f32, Custom);
// BUFFER/FLAT_ATOMIC_CMP_SWAP on GCN GPUs needs input marshalling,
----------------
Should this handle MVT::Other and get all the types? What happens with v6i32 or v4i16?
================
Comment at: test/CodeGen/AMDGPU/insert-subvector-unused-scratch.ll:6
+
+; GCN: ScratchSize: 0
+define amdgpu_kernel void @store_v3i32(<3 x i32> addrspace(3)* %out, <3 x i32> %a) nounwind {
----------------
Should check more, even if generated
================
Comment at: test/CodeGen/AMDGPU/insert-subvector-unused-scratch.ll:12
+ ret void
+}
----------------
No case for 5 x?
Repository:
rL LLVM
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D63160/new/
https://reviews.llvm.org/D63160
More information about the llvm-commits
mailing list