[all-commits] [llvm/llvm-project] 108380: [mlir][nvvm] Add `cp.async.bulk.tensor.shared.clus...

Guray Ozen via All-commits all-commits at lists.llvm.org
Thu Nov 16 05:35:09 PST 2023


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 108380da357e2db513f016d33adede0d58636bea
      https://github.com/llvm/llvm-project/commit/108380da357e2db513f016d33adede0d58636bea
  Author: Guray Ozen <guray.ozen at gmail.com>
  Date:   2023-11-16 (Thu, 16 Nov 2023)

  Changed paths:
    M mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
    M mlir/lib/Conversion/NVGPUToNVVM/NVGPUToNVVM.cpp
    M mlir/test/Conversion/NVVMToLLVM/nvvm-to-llvm.mlir

  Log Message:
  -----------
  [mlir][nvvm] Add `cp.async.bulk.tensor.shared.cluster.global.multicast` (#72429)

This PR introduce `cp.async.bulk.tensor.shared.cluster.global.multicast`
Op in NVVM dialect. It loads data using TMA data from global memory to
shared memory of multiple CTAs in the cluster.

It resolves #72368




More information about the All-commits mailing list