[llvm] [PPC] generate stxvw4x/lxvw4x on P7 (PR #87049)
Kai Luo via llvm-commits
llvm-commits at lists.llvm.org
Mon Apr 1 04:10:49 PDT 2024
================
@@ -17250,8 +17250,7 @@ bool PPCTargetLowering::getTgtMemIntrinsic(IntrinsicInfo &Info,
EVT PPCTargetLowering::getOptimalMemOpType(
const MemOp &Op, const AttributeList &FuncAttributes) const {
if (getTargetMachine().getOptLevel() != CodeGenOptLevel::None) {
- // We should use Altivec/VSX loads and stores when available. For unaligned
- // addresses, unaligned VSX loads are only fast starting with the P8.
----------------
bzEq wrote:
Have you compared the performance w/o unaligned vector-scalar load/store on powr7? The comment looks indicating why it is not performed prior pwr8.
https://github.com/llvm/llvm-project/pull/87049
More information about the llvm-commits
mailing list