[PATCH] D89618: [AMDGPU] Optimize waitcnt insertion for flat memory operations
Nicolai Hähnle via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Oct 19 10:31:36 PDT 2020
nhaehnle added a comment.
LGTM modulo that inline question.
================
Comment at: llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp:1248
+
+ if (TII->usesVM_CNT(Inst) && mayAccessVMEMThroughFlat(Inst)) {
+ ++FlatASCount;
----------------
arsenm wrote:
> When is usesVM_CNT false and isFLAT true?
I'd like to know as well. It may be better to make this an `assert(TII->usesVM_CNT(Inst))` instead.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D89618/new/
https://reviews.llvm.org/D89618
More information about the llvm-commits
mailing list