[llvm] [AMDGPU] Merge consecutive wait_alu instruction (PR #128916)
Jay Foad via llvm-commits
llvm-commits at lists.llvm.org
Fri Feb 28 03:44:22 PST 2025
================
@@ -362,6 +397,26 @@ class AMDGPUWaitSGPRHazards {
Mask = AMDGPU::DepCtr::encodeFieldVaSdst(Mask, 0);
}
if (Emit) {
+ MachineInstr *PrevWaitAlu = nullptr;
+ if (MI != MI->getParent()->begin()) {
+ PrevWaitAlu = getPreviousWaitAlu(MI);
+ } else {
+ auto Preds = MBB.predecessors();
+ if (MBB.pred_size() == 1) {
----------------
jayfoad wrote:
I guess here you are relying on the fact that if the predecessor block ends with s_wait_alu then it must fall through into MBB, i.e. it only has a single successor. But does the single-successor --> single predecessor case really occur often enough to be worth handling here?
https://github.com/llvm/llvm-project/pull/128916
More information about the llvm-commits
mailing list