[llvm] [PPC] generate stxvw4x/lxvw4x on P7 (PR #87049)

Mon Apr 1 04:10:49 PDT 2024

================
@@ -17250,8 +17250,7 @@ bool PPCTargetLowering::getTgtMemIntrinsic(IntrinsicInfo &Info,
 EVT PPCTargetLowering::getOptimalMemOpType(
     const MemOp &Op, const AttributeList &FuncAttributes) const {
   if (getTargetMachine().getOptLevel() != CodeGenOptLevel::None) {
-    // We should use Altivec/VSX loads and stores when available. For unaligned
-    // addresses, unaligned VSX loads are only fast starting with the P8.
----------------
bzEq wrote:

Have you compared the performance w/o unaligned vector-scalar load/store on powr7? The comment looks indicating why it is not performed prior pwr8.

https://github.com/llvm/llvm-project/pull/87049