[PATCH] D65884: [ARM] MVE Tail Predication
Dave Green via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Aug 12 01:12:31 PDT 2019
dmgreen added a comment.
You had me convinced until the last line. I think it's probably simpler for the vctp to produce a v4i1, as we don't need the convert at all in this patch (unless I'm missing something).
Essentially, the vctp can either look more like the acle intrinsic (produce a i16, makes the acle->IR simpler, but the IR->instruction needs to match on a convert(vctp)), or more like the VCTP instruction (produce a v4i1, makes the IR->Instruction simpler). I would go with the second option, but Simon is the expert on all things Intrinsics. Go with whatever he thinks is OK!
================
Comment at: lib/Target/ARM/CMakeLists.txt:61
Thumb2SizeReduction.cpp
+ Thumb2TailPredication.cpp
)
----------------
I would go with MVETailPredicationPass.cpp
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D65884/new/
https://reviews.llvm.org/D65884
More information about the llvm-commits
mailing list