[all-commits] [llvm/llvm-project] dd2208: [flang][runtime] Split MATMUL[_TRANSPOSE] into sep...
Slava Zakharin via All-commits
all-commits at lists.llvm.org
Tue Jul 2 21:30:58 PDT 2024
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: dd220853081400db6b4f85027030645115229ba0
https://github.com/llvm/llvm-project/commit/dd220853081400db6b4f85027030645115229ba0
Author: Slava Zakharin <szakharin at nvidia.com>
Date: 2024-07-02 (Tue, 02 Jul 2024)
Changed paths:
A flang/include/flang/Runtime/matmul-instances.inc
M flang/include/flang/Runtime/matmul-transpose.h
M flang/include/flang/Runtime/matmul.h
M flang/runtime/matmul-transpose.cpp
M flang/runtime/matmul.cpp
M flang/unittests/Runtime/Matmul.cpp
M flang/unittests/Runtime/MatmulTranspose.cpp
Log Message:
-----------
[flang][runtime] Split MATMUL[_TRANSPOSE] into separate entries. (#97406)
Device compilation is much faster for separate MATMUL[_TRANPOSE]
entries than for a single one that covers all data types.
The lowering changes and the removal of the generic entries will follow.
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list