[PATCH] D79776: [AMDGPU] Allow use of StackPtrOffsetReg when building spills

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed May 13 15:18:52 PDT 2020


arsenm added inline comments.


================
Comment at: llvm/test/CodeGen/AMDGPU/spill-scavenge-offset.ll:39
+; CHECK-LABEL: test_limited_sgpr
+define amdgpu_kernel void @test_limited_sgpr(<1280 x i32> addrspace(1)* %out, <1280 x i32> addrspace(1)* %in) #0 {
+entry:
----------------
Needs to check something


================
Comment at: llvm/test/CodeGen/AMDGPU/spill-scavenge-offset.ll:45
+  %aptr = getelementptr <1280 x i32>, <1280 x i32> addrspace(1)* %in, i32 %tid
+  %a = load <1280 x i32>, <1280 x i32> addrspace(1)* %aptr
+
----------------
A more directed tests would be better. We have a few others relying on splitting up giant vector loads, and they're some  of the slowest tests in the entire lit suite


================
Comment at: llvm/test/CodeGen/AMDGPU/spill-scavenge-offset.ll:65
 
+attributes #0 = { "amdgpu-num-sgpr"="30" }
 attributes #1 = { nounwind readnone }
----------------
Can you use amdgpu-max-waves-per-eu? I'm trying to move away from the direct register limit attributes


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D79776/new/

https://reviews.llvm.org/D79776





More information about the llvm-commits mailing list