[all-commits] [llvm/llvm-project] dd2208: [flang][runtime] Split MATMUL[_TRANSPOSE] into sep...

Slava Zakharin via All-commits all-commits at lists.llvm.org
Tue Jul 2 21:30:58 PDT 2024


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: dd220853081400db6b4f85027030645115229ba0
      https://github.com/llvm/llvm-project/commit/dd220853081400db6b4f85027030645115229ba0
  Author: Slava Zakharin <szakharin at nvidia.com>
  Date:   2024-07-02 (Tue, 02 Jul 2024)

  Changed paths:
    A flang/include/flang/Runtime/matmul-instances.inc
    M flang/include/flang/Runtime/matmul-transpose.h
    M flang/include/flang/Runtime/matmul.h
    M flang/runtime/matmul-transpose.cpp
    M flang/runtime/matmul.cpp
    M flang/unittests/Runtime/Matmul.cpp
    M flang/unittests/Runtime/MatmulTranspose.cpp

  Log Message:
  -----------
  [flang][runtime] Split MATMUL[_TRANSPOSE] into separate entries. (#97406)

Device compilation is much faster for separate MATMUL[_TRANPOSE]
entries than for a single one that covers all data types.
The lowering changes and the removal of the generic entries will follow.



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list