[llvm] [AMDGPU] Add support for point sample accel out of order returns (PR #127991)
Stephen Thomas via llvm-commits
llvm-commits at lists.llvm.org
Fri Feb 21 01:46:52 PST 2025
================
@@ -924,8 +951,13 @@ void WaitcntBrackets::updateByEvent(const SIInstrInfo *TII,
// defs. That's required for a sane index into `VgprMemTypes` below
assert(TRI->isVectorRegister(*MRI, Op.getReg()));
VmemType V = getVmemType(Inst);
+ unsigned char TypesMask = 1 << V;
+ // If instruction can have Point Sample Accel applied, we have to flag
+ // this with another potential dependency
+ if (hasPointSampleAccel(Inst))
+ TypesMask |= 1 << VMEM_NOSAMPLER;
----------------
stepthomas wrote:
Agreed that the patch is fine as is, and that the suggested additional refactoring would be beneficial
https://github.com/llvm/llvm-project/pull/127991
More information about the llvm-commits
mailing list