[all-commits] [llvm/llvm-project] a28004: [flang] Lower TRANSPOSE without using runtime.
Slava Zakharin via All-commits
all-commits at lists.llvm.org
Tue Jul 12 08:44:00 PDT 2022
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: a280043b523182ab6bb3ce5caf75e931a26eaf3f
https://github.com/llvm/llvm-project/commit/a280043b523182ab6bb3ce5caf75e931a26eaf3f
Author: Slava Zakharin <szakharin at nvidia.com>
Date: 2022-07-12 (Tue, 12 Jul 2022)
Changed paths:
M flang/lib/Lower/ConvertExpr.cpp
M flang/test/Lower/Intrinsics/transpose.f90
A flang/test/Lower/Intrinsics/transpose_opt.f90
Log Message:
-----------
[flang] Lower TRANSPOSE without using runtime.
Calling runtime TRANSPOSE requires a temporary array for the result,
and, sometimes, a temporary array for the argument. Lowering it inline
should provide faster code.
I added -opt-transpose control just for debugging purposes temporary.
I am going to make driver changes that will disable inline lowering
for -O0. For the time being I would like to enable it by default
to expose the code to more tests.
Differential Revision: https://reviews.llvm.org/D129497
More information about the All-commits
mailing list