[all-commits] [llvm/llvm-project] 2d3c26: [AArch64] break non-temporal loads over 256 into 2...

Florian Hahn via All-commits all-commits at lists.llvm.org
Wed Sep 28 07:21:21 PDT 2022


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 2d3c260362a29404909dadd65e904e872b52e09c
      https://github.com/llvm/llvm-project/commit/2d3c260362a29404909dadd65e904e872b52e09c
  Author: Florian Hahn <flo at fhahn.com>
  Date:   2022-09-28 (Wed, 28 Sep 2022)

  Changed paths:
    M llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
    M llvm/test/CodeGen/AArch64/nontemporal-load.ll

  Log Message:
  -----------
  [AArch64] break non-temporal loads over 256 into 256-loads and a smaller load

Currently over 256 non-temporal loads are broken inefficently. For example, `v17i32` gets broken into 2 128-bit loads. It is better if we can use
256-bit loads instead.

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D133421




More information about the All-commits mailing list