[PATCH] D83862: [AMDGPU] Add missing test prefixes
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Jul 15 11:43:44 PDT 2020
rampitec added inline comments.
================
Comment at: llvm/test/CodeGen/AMDGPU/perfhint.ll:33
; GCN-LABEL: {{^}}test_large_stride:
-; MemoryBound: 0
-; WaveLimiterHint : 1
+; GCN: MemoryBound: 0
+; GCN: WaveLimiterHint : 1
----------------
foad wrote:
> This check fails.
This one is memory bound, there are practically only memory operations here. I think it needs some ALU in between to catch large stride only as intended.
================
Comment at: llvm/test/CodeGen/AMDGPU/perfhint.ll:87
+; GCN: MemoryBound: 0
+; GCN: WaveLimiterHint : 0
define amdgpu_kernel void @test_indirect_through_phi(float addrspace(1)* %arg) {
----------------
foad wrote:
> This check fails. Perhaps D47740 never worked?
Looks like it did not :(
Anyway, this case is not memory bound even though it is indirect. This is because we have a single load followed by multiple stores, that was the point of the check.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D83862/new/
https://reviews.llvm.org/D83862
More information about the llvm-commits
mailing list