[llvm] [AMDGPU] Insert before and after instructions that always use GDS (PR #131338)
Stephen Thomas via llvm-commits
llvm-commits at lists.llvm.org
Wed Mar 19 06:53:21 PDT 2025
================
@@ -1982,6 +2003,65 @@ static bool isCacheInvOrWBInst(MachineInstr &Inst) {
Opc == AMDGPU::GLOBAL_WBINV;
}
+// Return true if the next instruction is S_ENDPGM, following fallthrough
+// blocks if necessary.
+bool SIInsertWaitcnts::isNextENDPGM(MachineBasicBlock::instr_iterator It,
+ MachineBasicBlock *Block) const {
+ auto E = Block->instr_end();
+
+ while (true) {
+ if (It == E) {
+ if (auto FallThrough = Block->getFallThrough(false)) {
----------------
stepthomas wrote:
` auto EndBlock = Block->getParent()->end();
while (true) {
if (It.isEnd()) {
auto NextBlock = std::next(Block->getIterator());
if (NextBlock != EndBlock) {
It = NextBlock->instr_begin();
Block = &*NextBlock;
continue;
}
return false;
}
if (!It->isMetaInstruction())
break;
It++;
}
assert(!It.isEnd());
`
This is what come up with if I want to use `std::next()`. I think it could get a bit more readable if I had an iterator over blocks starting from `Block`, advancing it as the instruction iterator encountered block ends, but is `++` on iterators the same as using `std::next()`?
https://github.com/llvm/llvm-project/pull/131338
More information about the llvm-commits
mailing list