[llvm] [AMDGPU] Add support for point sample accel out of order returns (PR #127991)

Stephen Thomas via llvm-commits llvm-commits at lists.llvm.org
Fri Feb 21 01:46:52 PST 2025


================
@@ -924,8 +951,13 @@ void WaitcntBrackets::updateByEvent(const SIInstrInfo *TII,
           // defs. That's required for a sane index into `VgprMemTypes` below
           assert(TRI->isVectorRegister(*MRI, Op.getReg()));
           VmemType V = getVmemType(Inst);
+          unsigned char TypesMask = 1 << V;
+          // If instruction can have Point Sample Accel applied, we have to flag
+          // this with another potential dependency
+          if (hasPointSampleAccel(Inst))
+            TypesMask |= 1 << VMEM_NOSAMPLER;
----------------
stepthomas wrote:

Agreed that the patch is fine as is, and that the suggested additional refactoring would be beneficial

https://github.com/llvm/llvm-project/pull/127991


More information about the llvm-commits mailing list