[all-commits] [llvm/llvm-project] d20615: [mlir][vector] Modify constraint and interface for...
Stanley Winata via All-commits
all-commits at lists.llvm.org
Wed Nov 9 11:54:26 PST 2022
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: d2061530dc093daca93fbb268611e1a146e722de
https://github.com/llvm/llvm-project/commit/d2061530dc093daca93fbb268611e1a146e722de
Author: stanley-nod <stanley at nod-labs.com>
Date: 2022-11-09 (Wed, 09 Nov 2022)
Changed paths:
M mlir/lib/Dialect/Vector/Transforms/VectorDistribute.cpp
M mlir/test/lib/Dialect/Vector/TestVectorTransforms.cpp
Log Message:
-----------
[mlir][vector] Modify constraint and interface for warp reduce on f16 and i8
Quantization method is crucial and ubiqutous in accelerating machine
learning workloads. Most of these methods uses f16 and i8 types.
This patch relaxes the type contraints on warp reduce distribution to
allow these types. Furthermore, this patch also changed the interface
and moved the initial reduction of data to a single thread into the
distributedReductionFn, this gives flexibility for developers to control
how they are obtaining the initial lane value, which might differ based
on the input types. (i.e to shuffle 32-width type, we need to reduce f16
to 2xf16 types rather than a single element).
Reviewed By: ThomasRaoux
Differential Revision: https://reviews.llvm.org/D137691
More information about the All-commits
mailing list