[all-commits] [llvm/llvm-project] a3388f: [mlir] Introduce a pattern to lower `gpu.subgroup_...
Guray Ozen via All-commits
all-commits at lists.llvm.org
Fri Jan 20 04:56:37 PST 2023
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: a3388f3e2a30eaca4123f14a6444fd5aa03b24c5
https://github.com/llvm/llvm-project/commit/a3388f3e2a30eaca4123f14a6444fd5aa03b24c5
Author: Guray Ozen <guray.ozen at gmail.com>
Date: 2023-01-20 (Fri, 20 Jan 2023)
Changed paths:
M mlir/include/mlir/Conversion/GPUToNVVM/GPUToNVVMPass.h
M mlir/include/mlir/Conversion/Passes.td
M mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp
M mlir/test/Conversion/GPUToNVVM/gpu-to-nvvm.mlir
Log Message:
-----------
[mlir] Introduce a pattern to lower `gpu.subgroup_reduce` to `nvvm.redux_op`
This revision introduces a pattern to lower `gpu.subgroup_reduce` op into to the `nvvm.redux_sync` op. The op must be run by the entire subgroup, otherwise it is undefined behaviour.
It also adds a flag and populate function, because the op is not avaiable for every gpu (sm80+), so it can be used when it is desired.
Depends on D142088
Reviewed By: nicolasvasilache
Differential Revision: https://reviews.llvm.org/D142103
More information about the All-commits
mailing list