[PATCH][instcombine]: Slice a big load in two loads when the element are next to each other in memory.
Owen Anderson
resistor at mac.com
Mon Oct 7 22:39:47 PDT 2013
On Oct 7, 2013, at 4:04 PM, Quentin Colombet <qcolombet at apple.com> wrote:
> Hi Owen,
>
> Thanks for the review.
> See my comments inlined with yours.
>
> On Oct 7, 2013, at 3:47 PM, Owen Anderson <resistor at mac.com> wrote:
>
>>
>> On Oct 7, 2013, at 1:24 PM, Quentin Colombet <qcolombet at apple.com> wrote:
>>
>>> Hi Owen, hi Chandler,
>>>
>>> Owen, you may be a good match to review this patch, which is SDSel related and not instcombine related (as the subject says… it changes :)).
>>
>> I’m not sure I like the cost model this represents. It seems very x86-centric. For example, some RISCs (PowerPC and AArch64, offhand) can do single-instruction paired-register bitfield extracts, which means that original input sequence was better for them (1 load + 1 extract, instead of 2 loads).
> This shouldn’t happen unless the target specified that it supports paired load (the new target hook that this patch adds), which is false by default.
> In that case, you end up with 1 paired load instead of 1 load + 1 extract.
>
> Does it make sense?
The criteria for when a target should/should not implement the has-paired-load target hook. How is it different from supporting vector loads?
>> Even on x86, you’re probably best off trying to turn this into a <2 x i32> vector load + lane copy rather than splitting the load.
> My plan here was to turn this into two split loads, then having another dag combine for merging those loads into a vector load. I though maybe target supporting paired load already do this kind of target specific combine.
You certainly can go that way, but it sounds challenging to implement. My point was that it seems like you could achieve most of the same benefit with a DAG combine that turned this example into a <2 x float> vector load, with much less concern about the cost model, etc.
--Owen
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20131007/c020f56a/attachment.html>
More information about the llvm-commits
mailing list