[Mlir-commits] [mlir] [mlir][nvgpu] Mark TMA descriptor as MemWriteAt in `tma.async.store` (PR #79427)
Mehdi Amini
llvmlistbot at llvm.org
Thu Jan 25 19:49:08 PST 2024
joker-eph wrote:
I have some questions about the `NVGPU_TensorMapDescriptor` type in MLIR.
Does it map to a `CUtensorMap` object? Or to a pointer to a `CUtensorMap` object?
In Cuda, the `CUtensorMap` object is setup on the host, and then **copied** to the GPU memory. Inside the kernel the PTX instruction are using a `CUtensorMap *`.
I am afraid that MLIR is sometimes using `NVGPU_TensorMapDescriptor` as a `CUtensorMap` and other times as a `CUtensorMap *`
https://github.com/llvm/llvm-project/pull/79427
More information about the Mlir-commits
mailing list