[PATCH] D137547: [AArch64][SVE] Use PTRUE instruction for get_active_lane_mask intrinsic if the range is appropriate for predicator constant

Dinar Temirbulatov via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Nov 17 16:25:24 PST 2022


dtemirbulatov marked 7 inline comments as done.
dtemirbulatov added inline comments.


================
Comment at: llvm/test/CodeGen/AArch64/active_lane_mask.ll:495
+entry:
+  %active.lane.mask = call <vscale x 4 x i1> @llvm.get.active.lane.mask.nxv4i1.i64(i64 0, i64 4)
+  ret <vscale x 4 x i1> %active.lane.mask
----------------
c-rhodes wrote:
> active.lane.mask is an unsigned less-than comparison so this generates a predicate with 4 active lanes, which for a minimum vector width of 128 and esize of 32 can be represented with `ptrue p0.s, vl4`?
correct.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D137547/new/

https://reviews.llvm.org/D137547



More information about the llvm-commits mailing list