[Mlir-commits] [clang] [llvm] [mlir] [openmp] [LoopTiling][Clang][MLIR] Canonical Intra-tile Loops (PR #191114)
Michael Kruse
llvmlistbot at llvm.org
Fri Apr 24 05:49:52 PDT 2026
Meinersbur wrote:
> But one concern, aren't we over-complicating this by saving execution of remainder iterations in the last tile?
LoopVectorize will require if-conversion/masked execution with such loop bodies which significantly hampers all executions (not just the remainder iterations). Since we do tiling for performance, we cannot make the generated code run slower than without tiling.
If LLVM is not able to optimize it away, another option is to emit the predicate, but optimize it to an smin expression at Clang's CodeGen.
https://github.com/llvm/llvm-project/pull/191114
More information about the Mlir-commits
mailing list