[llvm] [AMDGPU] Support block load/store for CSR (PR #130013)
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Mon Mar 17 21:06:32 PDT 2025
================
@@ -239,6 +240,34 @@ const MCExpr *AMDGPUAsmPrinter::lowerConstant(const Constant *CV) {
return AsmPrinter::lowerConstant(CV);
}
+static void emitVGPRBlockComment(const MachineInstr *MI, const SIInstrInfo *TII,
+ const TargetRegisterInfo *TRI,
+ const SIMachineFunctionInfo *MFI,
+ MCStreamer &OS) {
+ // The instruction will only transfer a subset of the registers in the block,
+ // based on the mask that is stored in m0. We could search for the instruction
+ // that sets m0, but most of the time we'll already have the mask stored in
+ // the machine function info. Try to use that. This assumes that we only use
+ // block loads/stores for CSR spills.
+ Register RegBlock =
+ TII->getNamedOperand(*MI, MI->mayLoad() ? AMDGPU::OpName::vdst
+ : AMDGPU::OpName::vdata)
+ ->getReg();
+ Register FirstRegInBlock = TRI->getSubReg(RegBlock, AMDGPU::sub0);
+ uint32_t Mask = MFI->getMaskForVGPRBlockOps(RegBlock);
+
+ SmallString<512> TransferredRegs;
+ for (unsigned I = 0; I < sizeof(Mask) * 8; ++I) {
+ if (Mask & (1 << I)) {
+ (llvm::Twine(" ") + TRI->getRegAsmName(FirstRegInBlock + I))
+ .toVector(TransferredRegs);
+ }
+ }
+
+ if (!TransferredRegs.empty())
----------------
arsenm wrote:
Replace this with a Mask == 0 early exit at the start?
https://github.com/llvm/llvm-project/pull/130013
More information about the llvm-commits
mailing list