[all-commits] [llvm/llvm-project] a28004: [flang] Lower TRANSPOSE without using runtime.

Slava Zakharin via All-commits all-commits at lists.llvm.org
Tue Jul 12 08:44:00 PDT 2022


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: a280043b523182ab6bb3ce5caf75e931a26eaf3f
      https://github.com/llvm/llvm-project/commit/a280043b523182ab6bb3ce5caf75e931a26eaf3f
  Author: Slava Zakharin <szakharin at nvidia.com>
  Date:   2022-07-12 (Tue, 12 Jul 2022)

  Changed paths:
    M flang/lib/Lower/ConvertExpr.cpp
    M flang/test/Lower/Intrinsics/transpose.f90
    A flang/test/Lower/Intrinsics/transpose_opt.f90

  Log Message:
  -----------
  [flang] Lower TRANSPOSE without using runtime.

Calling runtime TRANSPOSE requires a temporary array for the result,
and, sometimes, a temporary array for the argument. Lowering it inline
should provide faster code.

I added -opt-transpose control just for debugging purposes temporary.
I am going to make driver changes that will disable inline lowering
for -O0. For the time being I would like to enable it by default
to expose the code to more tests.

Differential Revision: https://reviews.llvm.org/D129497




More information about the All-commits mailing list