[Mlir-commits] [mlir] [MLIR][NVVM] Update TMA Load Op (PR #156347)
Durgadoss R
llvmlistbot at llvm.org
Wed Sep 17 06:02:12 PDT 2025
================
@@ -1535,6 +1560,123 @@ mlir::NVVM::IDArgPair CpAsyncBulkSharedCTAToGlobalOp::getIntrinsicIDAndArgs(
return {id, std::move(args)};
}
+bool CpAsyncBulkTensorGlobalToSharedClusterOp::getAsmValues(
+ RewriterBase &rewriter,
+ llvm::SmallVectorImpl<std::pair<mlir::Value, mlir::NVVM::PTXRegisterMod>>
+ &asmValues) {
+ // Add all the operands but not the attrs to the asmValues list.
+ // The attrs here are used to generate the right variants for
+ // intrinsics-lowering. So, we ignore them while generating inline-PTX.
+ for (auto val : getOperands())
+ asmValues.push_back({val, mlir::NVVM::PTXRegisterMod::Read});
+
+ return false;
+}
+
+mlir::NVVM::IDArgPair
+CpAsyncBulkTensorGlobalToSharedClusterOp::getIntrinsicIDAndArgs(
+ Operation &op, LLVM::ModuleTranslation &mt, llvm::IRBuilderBase &builder) {
+ auto thisOp = cast<NVVM::CpAsyncBulkTensorGlobalToSharedClusterOp>(op);
+ const bool isCTAOnly = thisOp.getIsCTAOnly();
+ llvm::SmallVector<llvm::Value *> args;
+
+ // Fill the Intrinsic Args
+ args.push_back(mt.lookupValue(thisOp.getDstMem()));
+ args.push_back(mt.lookupValue(thisOp.getMbar()));
+ args.push_back(mt.lookupValue(thisOp.getTmaDescriptor()));
+
+ // Coordinates and im2col-offsets
+ for (auto v : thisOp.getCoordinates())
----------------
durga4github wrote:
Sure, fixed.
Resolving this,
https://github.com/llvm/llvm-project/pull/156347
More information about the Mlir-commits
mailing list