[all-commits] [llvm/llvm-project] 6d1149: [mlir][Linalg] Refine how broadcast dims are treat...

Andrzej Warzyński via All-commits all-commits at lists.llvm.org
Thu Sep 26 08:17:37 PDT 2024

  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 6d114944142ae5a1d0387fe40ffa9351b6f642aa
  Author: Andrzej Warzyński <andrzej.warzynski at arm.com>
  Date:   2024-09-26 (Thu, 26 Sep 2024)

  Changed paths:
    M mlir/include/mlir/IR/AffineMap.h
    M mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp
    M mlir/lib/IR/AffineMap.cpp
    M mlir/test/Dialect/Linalg/vectorization-with-patterns.mlir
    M mlir/test/Dialect/Linalg/vectorization.mlir

  Log Message:
  [mlir][Linalg] Refine how broadcast dims are treated (#99015)

This PR fixes how broadcast dims (identified as "zero" results in
permutation maps) corresponding to a reduction iterator are vectorised
in the case of generic Ops. Here's an example:

  #map = affine_map<(d0, d1, d2, d3) -> (d0, d1, d2, d3)>
  #map1 = affine_map<(d0, d1, d2, d3) -> (d0, d1, d2, 0)>

  func.func @generic_with_reduction_and_broadcast(%arg0: tensor<1x12x197x197xf32>) -> (tensor<1x12x197x1xf32>) {
    %0 = tensor.empty() : tensor<1x12x197x1xf32>

    %1 = linalg.generic {indexing_maps = [#map, #map1],
                        iterator_types = ["parallel", "parallel", "parallel", "reduction"]}
      ins(%arg0 : tensor<1x12x197x197xf32>)
      outs(%0 : tensor<1x12x197x1xf32>) {

    ^bb0(%in: f32, %out: f32):
      %818 = arith.addf %in, %out : f32
      linalg.yield %818 : f32
    } -> tensor<1x12x197x1xf32>
    return %1 : tensor<1x12x197x1xf32>

This is a perfectly valid Generic Op, but currently triggers two issues
in the vectoriser. The root cause is this map:

  #map1 = affine_map<(d0, d1, d2, d3) -> (d0, d1, d2, 0)>

This map triggers an assert in `reindexIndexingMap` -  this hook
incorrectly assumes that every result in the input map is a `dim`
expression and that there are no constants. That's not the case in this
example. `reindexIndexingMap` is extended to allow maps like the one
above. For now, only constant "zero" results are allowed. This can be
extended in the future once a good motivating example is available.

Separately, the permutation map highlighted above "breaks" mask
calculation (ATM masks are always computed, even in the presence of
static shapes). When applying the following permutation:
  (d0, d1, d2, d3) -> (d0, d1, d2, 0)

to these canonical shapes (corresponding to the example above):
  (1, 12, 197, 197)
we end up with the following error:
error: vector types must have positive constant sizes but got 1, 12, 197, 0

The error makes sense and indicates that we should update the
permutation map above to:
  (d0, d1, d2, d3) -> (d0, d1, d2)

This would correctly give the following vector type:

Fixes #97247

To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications

More information about the All-commits mailing list