[PATCH] D104782: [AArch64] Custom lower <4 x i8> loads

Eli Friedman via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Jun 23 11:02:19 PDT 2021


efriedma added a comment.

Missing testcases for load+ext to `<4 x i16>`.



================
Comment at: llvm/test/CodeGen/AArch64/neon-extload.ll:38
+; BE-NEXT:    ldr s0, [x0]
+; BE-NEXT:    rev32 v0.8b, v0.8b
+; BE-NEXT:    ushll v0.8h, v0.8b, #0
----------------
SjoerdMeijer wrote:
> I am trying to remember how big-endian works in LLVM, but since I noticed these reverse here, this looked okay'ish to me, but I haven't tested BE. Any opinions on this welcome (while I look a bit more at this)....
Looks fine to me.  The rev32 comes out of lowering the ISD::BITCAST.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D104782/new/

https://reviews.llvm.org/D104782



More information about the llvm-commits mailing list