[PATCH] D124284: [SLP]Try partial store vectorization if supported by target.
Simon Pilgrim via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sun Jun 12 10:32:23 PDT 2022
RKSimon added inline comments.
================
Comment at: llvm/test/Transforms/SLPVectorizer/X86/arith-mul-load.ll:14
define void @add4(ptr noalias nocapture noundef %r, ptr noalias nocapture noundef readonly %a) {
; SSE-LABEL: @add4(
----------------
xbolva00 wrote:
> xbolva00 wrote:
> > RKSimon wrote:
> > > xbolva00 wrote:
> > > > RKSimon wrote:
> > > > > ABataev wrote:
> > > > > > RKSimon wrote:
> > > > > > > I'll investigate adding 32-bit vector load/store handling as well (it has the same costs as the codegen for 64-bit anyhow).
> > > > > > TTI does not report that it supports 32 bit stores.
> > > > > We never bothered to add it - we mainly use the 64-bit vector load/store to handle f64-i64 handling on 32-bit targets
> > > > Do you plan to add them?
> > > yes - got a few other blockers to deal with first though - that yak has to be shaved.......
> > ok, thanks!
> any updates?
>
>
> ```
> void pr(char* __restrict a, char* __restrict r){
> for (int i = 0; i < 4; i++){
> r[i] += a[i];
> }
> }
> ```
>
> gcc emits nicely paddb.
https://reviews.llvm.org/D127604 - but I need someone to perf test the patch properly.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D124284/new/
https://reviews.llvm.org/D124284
More information about the llvm-commits
mailing list