[PATCH] D129759: [AMDGPU] Set amdgpu-memory-bound if a basic block has dense global memory access
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Jul 14 11:30:34 PDT 2022
rampitec added a comment.
This looks generally reasonable to me. Although I am thinking about a potential huge kernel with only a single block like this...
I guess it comes down to the question if that block is hot or cold. There is no perfect heuristic out there, but chances are higher a block is hot if it is in a loop.
Is the actual block you are considering in the regressed app inside a loop? Is that a "small single bb loop", i.e. when the bb branches right to itself? This is more likely to be a "dense memory access code" (and you will not need LoopInfo).
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D129759/new/
https://reviews.llvm.org/D129759
More information about the llvm-commits
mailing list