[llvm] [AMDGPU] Add support for point sample accel out of order returns (PR #127991)
Jay Foad via llvm-commits
llvm-commits at lists.llvm.org
Thu Feb 20 09:23:29 PST 2025
================
@@ -924,8 +951,13 @@ void WaitcntBrackets::updateByEvent(const SIInstrInfo *TII,
// defs. That's required for a sane index into `VgprMemTypes` below
assert(TRI->isVectorRegister(*MRI, Op.getReg()));
VmemType V = getVmemType(Inst);
+ unsigned char TypesMask = 1 << V;
+ // If instruction can have Point Sample Accel applied, we have to flag
+ // this with another potential dependency
+ if (hasPointSampleAccel(Inst))
+ TypesMask |= 1 << VMEM_NOSAMPLER;
----------------
jayfoad wrote:
I think this patch is probably fine as-is. My only hesitation is whether it would be better to do some more refactoring, and fold this logic into `getVmemType` itself, calling it `getVmemTypes` and returning a bit mask. That would force every user of `getVmemType` to consider what to do if more than one bit is set in the mask.
https://github.com/llvm/llvm-project/pull/127991
More information about the llvm-commits
mailing list