<div dir="ltr">Because unaligned load/store are illegal on my target.<div>But ExpandUnalignedStore expand to too many load/store. <br></div><div><br></div><div>It seem that ExpandUnalignedStore is called after the vectorization cost analysis is done and not taken into account.</div>
<div><div class="gmail_extra"><br></div><div class="gmail_extra"><br><br><div class="gmail_quote">On Fri, Jul 19, 2013 at 4:32 PM, Eli Friedman <span dir="ltr"><<a href="mailto:eli.friedman@gmail.com" target="_blank">eli.friedman@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div class="im">On Fri, Jul 19, 2013 at 1:14 PM, Francois Pichet <<a href="mailto:pichet2000@gmail.com">pichet2000@gmail.com</a>> wrote:<br>
><br>
> What is the proper solution to disable auto-vectorization for unaligned<br>
> data?<br>
<br>
</div>Why are you trying to do this? If auto-vectorization is making a<br>
given loop slower on your target, that means the cost metrics are off,<br>
and we should fix them. If code size is an issue, you should tell the<br>
optimizer that you want to optimize for size.<br>
<br>
-Eli<br>
<div><div class="h5"><br>
> I have an out of tree target and I added this:<br>
><br>
> bool OpusTargetLowering::allowsUnalignedMemoryAccesses(EVT VT, bool *Fast)<br>
> const {<br>
> if (VT.isVector())<br>
> return false;<br>
> ....<br>
> }<br>
><br>
> After that, I could see that vectorization is still done on unaligned data<br>
> except that llvm will copy the data back and forth from the source to the<br>
> top of the stack and work from there. This is very costly, I rather get<br>
> scalar operations.<br>
><br>
> Then I tried to add:<br>
> unsigned getMemoryOpCost(unsigned Opcode, Type *Src,<br>
> unsigned Alignment,<br>
> unsigned AddressSpace) const {<br>
> if (Src->isVectorTy() && Alignment != 16)<br>
> return 10000; // <== high number to try to avoid unaligned load/store.<br>
> return TargetTransformInfo::getMemoryOpCost(Opcode, Src, Alignment,<br>
> AddressSpace);<br>
> }<br>
><br>
> Except that this doesn't work because Alignment will always be 4 even for<br>
> data like:<br>
> int data[16][16] __attribute__ ((aligned (16))),<br>
><br>
> Because individual element are still 4-byte aligned.<br>
><br>
> I am not sure what is the right way to do it?<br>
> Thanks.<br>
><br>
><br>
</div></div>> _______________________________________________<br>
> LLVM Developers mailing list<br>
> <a href="mailto:LLVMdev@cs.uiuc.edu">LLVMdev@cs.uiuc.edu</a> <a href="http://llvm.cs.uiuc.edu" target="_blank">http://llvm.cs.uiuc.edu</a><br>
> <a href="http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev" target="_blank">http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev</a><br>
><br>
</blockquote></div><br></div></div></div>