[PATCH] D83395: [SVE] Code generation for fixed length vector truncates.

Cameron McInally via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Jul 8 13:30:50 PDT 2020


cameron.mcinally marked an inline comment as done.
cameron.mcinally added inline comments.


================
Comment at: llvm/test/CodeGen/AArch64/sve-fixed-length-trunc.ll:213
+; VBITS_GE_512: uzp1 [[A_HALFS:z[0-9]+]].h, [[A_WORDS]].h, [[A_WORDS]].h
+; VBITS_GE_512: uzp1 z0.b, [[A_HALFS]].b, [[A_HALFS]].b
+; CHECK: ret
----------------
Just passing by and had to comment that this is an expensive truncate. It's around 5x slower than the equivalent on x86, with 1/2 the throughput.

Might be a good candidate for a dedicated hardware instruction on future SVE revisions... 


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D83395/new/

https://reviews.llvm.org/D83395





More information about the llvm-commits mailing list