[all-commits] [llvm/llvm-project] 4319e1: [mlir][nvgpu] Introduce Multicast Capability to `n...
Guray Ozen via All-commits
all-commits at lists.llvm.org
Fri Jan 5 01:49:10 PST 2024
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 4319e1916dd13f5f7f56536acf01899320b82c6a
https://github.com/llvm/llvm-project/commit/4319e1916dd13f5f7f56536acf01899320b82c6a
Author: Guray Ozen <guray.ozen at gmail.com>
Date: 2024-01-05 (Fri, 05 Jan 2024)
Changed paths:
M mlir/include/mlir/Dialect/NVGPU/IR/NVGPU.td
M mlir/lib/Conversion/NVGPUToNVVM/NVGPUToNVVM.cpp
M mlir/lib/Dialect/NVGPU/TransformOps/NVGPUTransformOps.cpp
M mlir/test/Conversion/NVGPUToNVVM/nvgpu-to-nvvm.mlir
Log Message:
-----------
[mlir][nvgpu] Introduce Multicast Capability to `nvgpu.tma.async.load` (#76935)
This PR improves the functionality of the `nvgpu.tma.async.load` Op by
adding support for multicast. While we already had this capability in
the lower-level `nvvm.cp.async.bulk.tensor.shared.cluster.global` NVVM
Op, this PR lowers mask information to the NVVM operation.
More information about the All-commits
mailing list