<html><head><style type='text/css'>p { margin: 0; }</style></head><body><div style='font-family: arial,helvetica,sans-serif; font-size: 10pt; color: #000000'><br><hr id="zwchr"><blockquote id="DWT12381" style="border-left: 2px solid rgb(16, 16, 255); margin-left: 5px; padding-left: 5px; color: rgb(0, 0, 0); font-weight: normal; font-style: normal; text-decoration: none; font-family: Helvetica,Arial,sans-serif; font-size: 12pt;"><b>From: </b>"Mehdi Amini via llvm-dev" <llvm-dev@lists.llvm.org><br><b>To: </b>"Philip Reames" <listmail@philipreames.com>, "David Blaikie" <dblaikie@gmail.com><br><b>Cc: </b>"llvm-dev" <llvm-dev@lists.llvm.org>, uweigand@de.ibm.com, "Tom Stellard" <thomas.stellard@amd.com><br><b>Sent: </b>Tuesday, March 22, 2016 2:39:36 PM<br><b>Subject: </b>Re: [llvm-dev] RFC: A change in InstCombine canonical form<br><br>
I don't really mind, but the intermediate stage will not be very nice: that a lot of code / tests that needs to be written with bitcast, and all of that while they are deemed to disappear. The added value isn't clear to me considering the added work. I'm not sure it wouldn't add more work for all the cleanup required by the "typeless pointer", but I'm not sure what's involved here and if David thinks the intermediate steps of handling bit casts everywhere is not making it harder I'm fine with it.</blockquote>It is not clear to me that this is a particularly-large change. As I understand it, we're only talking about the canonicalization of small memcpys, so there are no other uses of the relevant values. Changing the canonical form here seems like a pure win, except for the fact that doing the load and store using integer registers is likely more efficient on some architectures than using floating-point registers. For those architectures, we'd want to fix this in the backend. This seems relatively easy to do in CGP (or DAGCombine) (and, in fact, doing it later would enable us to catch more load/store cases than just those that comes from memcpy expansions). Thoughts?<br><br> -Hal<br><blockquote style="border-left: 2px solid rgb(16, 16, 255); margin-left: 5px; padding-left: 5px; color: rgb(0, 0, 0); font-weight: normal; font-style: normal; text-decoration: none; font-family: Helvetica,Arial,sans-serif; font-size: 12pt;"><div class=""><br class=""></div><div class="">-- </div><div class="">Mehdi</div><div class=""><br class=""><div><blockquote class=""><div class="">On Mar 22, 2016, at 12:36 PM, Philip Reames <<a href="mailto:listmail@philipreames.com" class="" target="_blank">listmail@philipreames.com</a>> wrote:</div><br class="Apple-interchange-newline"><div class="">
<div class="">
I'd phrase this differently: being pointer-bitcast agnostic is a
step towards support typeless pointers. :) We can either become
bitcast agnostic all in one big change or incrementally.
Personally, I'd prefer the later since it reduces the risk
associated with enabling typeless pointers in the end.<br class="">
<br class="">
Philip<br class="">
<br class="">
<div class="moz-cite-prefix">On 03/22/2016 12:16 PM, Mehdi Amini via
llvm-dev wrote:<br class="">
</div>
<blockquote cite="mid:DCE36A26-6226-49B3-9826-95E25BFB4AA0@apple.com" class="">
I don't know enough about the tradeoff for 1, but 2 seems like a
bandaid for something that is not a correctness issue neither a
regression. I'm not sure it justifies "bandaid patches" while
there is a clear path forward, i.e. typeless pointers, unless
there is an acknowledgement that typeless pointers won't be there
before a couple of years.
<div class="">
<div class=""><br class="">
</div>
<div class="">-- </div>
<div class="">Mehdi</div>
<div class=""><br class="">
<div class="">
<div class="">
<blockquote class="">
<div class="">On Mar 22, 2016, at 11:32 AM, Ehsan Amiri
<<a href="mailto:ehsanamiri@gmail.com" class="" target="_blank">ehsanamiri@gmail.com</a>>
wrote:</div>
<br class="Apple-interchange-newline">
<div class="">
<div dir="ltr" class="">
<div class="">
<div class="">
<div class="">Back to the discussion on the RFC,
I still see some advantage in following the
proposed solution. I see two paths forward:<br class="">
<br class="">
</div>
1- Change canonical form, possibly lower memcpy
to non-integer load and store in InstCombine.
Then teach the backends to convert that to
integer load and store if that is more
profitable. Notice that we are talking about
loads that have no use other than store. So it
is a fairly simple change in the backends.<br class="">
<br class="">
</div>
2- Do not change the canonical form. Then for this
bug, we need to teach select speculation to see
through bitcasts. We will probably need to teach
other optimizations to see though bitcasts in the
future as problems are uncovered. That is until
typeless pointer work is complete. Once the
typeless pointer work is complete, we have some
extra code in each optimization for seeing through
bitcasts which is possibly no longer needed.<br class="">
<br class="">
</div>
<div class="">Based on this I think (1) is the right
thing to do. But there might be other reasons for
the current canonical form that I am not aware of.
Please let me know what you think.<br class="">
<br class="">
</div>
<div class="">Thanks<br class="">
</div>
Ehsan<br class="">
</div>
<div class="gmail_extra"><br class="">
<div class="gmail_quote">On Wed, Mar 16, 2016 at
2:13 PM, David Blaikie <span dir="ltr" class=""><<a href="mailto:dblaikie@gmail.com" target="_blank" class=""></a><a class="moz-txt-link-abbreviated" href="mailto:dblaikie@gmail.com" target="_blank">dblaikie@gmail.com</a>></span>
wrote:<br class="">
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
<div dir="ltr" class=""><br class="">
<div class="gmail_extra"><br class="">
<div class="gmail_quote"><span class="">On
Wed, Mar 16, 2016 at 11:00 AM, Ehsan
Amiri <span dir="ltr" class=""><<a href="mailto:ehsanamiri@gmail.com" target="_blank" class=""></a><a class="moz-txt-link-abbreviated" href="mailto:ehsanamiri@gmail.com" target="_blank">ehsanamiri@gmail.com</a>></span>
wrote:<br class="">
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
<div dir="ltr" class="">
<div class="">
<div class="">
<div class="">David,<br class="">
<br class="">
</div>
Could you give us an update on
the status of typeless pointer
work? How much work is left and
when you think it might be
ready?<br class="">
</div>
</div>
</div>
</blockquote>
<div class=""><br class="">
</div>
</span>
<div class="">It's a bit of an onion peel,
really - since it will eventually
involve generalizing/fixing every
optimization that's currently leaning on
typed pointers to keep the performance
while removing the crutch they're
currently leaning on. (in cases where
bitcasts are literally just getting in
the way, those won't require cleaning up
& should just become "free
performance wins" once we remove them,
though)<br class="">
<br class="">
At the moment we can roundtrip every
LLVM IR test case through bitcode and
textual IR (reading/writing both
formats) while using only a narrow
whitelist of places that request the
type of a pointer (things like the
verifier, the parser/printer where it
actually needs the typed pointer to
verify it matches the explicit type,
etc).<br class="">
<br class="">
The next thing on the list is probably
figuring out the byval/inalloca
representation (add an explicit pointee
type? just make the number of bytes
explicit with no type information?).<br class="">
<br class="">
Then start migrating optimizations over
- doing the same sort of testing I did
for the IR/bitcode roundtripping -
assert that the pointee type is not
accessed, whitelist places that need it
until the bitcasts go away, fix anything
else... it'll still be a fair bit of
work & I don't really know how much.
It should parallelize pretty well (doing
any of this work is really helpful, each
optimization is indepednent, etc) if
anyone wants to/is able to help.<br class="">
<br class="">
- Dave</div>
<div class="">
<div class="h5">
<div class=""> </div>
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
<div dir="ltr" class="">
<div class="">
<div class=""><br class="">
</div>
Thanks<span class=""><font class="" color="#888888"><br class="">
</font></span></div>
<span class=""><font class="" color="#888888">Ehsan<br class="">
<br class="">
</font></span></div>
<div class="">
<div class="">
<div class="gmail_extra"><br class="">
<div class="gmail_quote">On
Wed, Mar 16, 2016 at 1:12
PM, David Blaikie <span dir="ltr" class=""><<a href="mailto:dblaikie@gmail.com" target="_blank" class=""></a><a class="moz-txt-link-abbreviated" href="mailto:dblaikie@gmail.com" target="_blank">dblaikie@gmail.com</a>></span>
wrote:<br class="">
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
<div dir="ltr" class=""><br class="">
<div class="gmail_extra"><br class="">
<div class="gmail_quote"><span class="">On Wed,
Mar 16, 2016 at
8:34 AM, Mehdi
Amini via llvm-dev
<span dir="ltr" class=""><<a href="mailto:llvm-dev@lists.llvm.org" target="_blank" class=""></a><a class="moz-txt-link-abbreviated" href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a>></span>
wrote:<br class="">
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
<div style="word-wrap: break-word;" class="">Hi,
<div class=""><br class="">
</div>
<div class="">How
do it interact
with the
"typeless
pointers"
work?</div>
</div>
</blockquote>
<div class=""><br class="">
</div>
</span>
<div class="">Right
- the goal of the
typeless pointer
work is to fix all
these bugs related
to "didn't look
through bitcasts"
in optimizations.
Sometimes that's
going to mean more
work (because the
code is leaning on
the absence of
bitcasts & the
presence of
convenient (but
not guaranteed)
type information
to inform
optimization
decisions) but if
we remove typed
pointers while
keeping
optimization
quality in the
cases we have
today, then we
should've also
fixed the cases
that were broken
because the type
information didn't
end up aligning to
produce the
optimal output.<br class="">
<br class="">
& I know I've
been off the
typeless pointer
stuff for a few
months working on
llvm-dwp - but
happy for any help
(the next
immediate piece is
probably figuring
out teh right
representation for
byval and inalloca
- there were some
code reviews sent
out for that that
I'll need to come
back around to -
but also any
optimizations
people want to
help
rework/improve
would be great too
& I can
provide some
techniques/tools
to help people
approach those)<br class="">
<br class="">
- Dave</div>
<div class="">
<div class="">
<div class=""> </div>
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
<div style="word-wrap: break-word;" class="">
<div class=""><br class="">
</div>
<div class="">Thanks,</div>
<div class=""><br class="">
</div>
<div class="">-- </div>
<div class="">Mehdi</div>
<div class=""><br class="">
<div class="">
<blockquote class="">
<div class="">
<div class="">
<div class="">On
Mar 16, 2016,
at 6:41 AM,
Ehsan Amiri
via llvm-dev
<<a href="mailto:llvm-dev@lists.llvm.org" target="_blank" class=""></a><a class="moz-txt-link-abbreviated" href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a>>
wrote:</div>
<br class="">
</div>
</div>
<div class="">
<div class="">
<div class="">
<div dir="ltr" class="">===
PROBLEM ===
(See this bug
<a href="https://llvm.org/bugs/show_bug.cgi?id=26445" target="_blank" class=""></a><a class="moz-txt-link-freetext" href="https://llvm.org/bugs/show_bug.cgi?id=26445" target="_blank">https://llvm.org/bugs/show_bug.cgi?id=26445</a>)<br class="">
<br class="">
IR contains
code for
loading a
float from
float * and
storing it to
a float *
address. After
canonicalization
of load in
InstCombine
[1], new
bitcasts are
added to the
IR (see bottom
of the email
for code
samples). This
prevents
select
speculation in
SROA to work.
Also after
SROA we have
bitcasts from
int32 to
float.
(Whereas
originally
after
instCombine,
bitcasts are
only done on
pointer
types).<br class="">
<br class="">
=== PROPOSED
SOLUTION===<br class="">
<br class="">
[1] implies
that we need
load
canonicalization
when we load a
value only to
store it
again. The
reason is to
avoid
generating
slightly
different code
(due to
different ways
of adding
bitcasts), in
different
situations. In
all examples
presented in
[1] there is a
non-zero
number of
bitcasts. I
think when we
load a value
of type T from
a T* address
and store it
as a type T
value to one
or more T*
address (and
there is no
other use or
store), we can
redefine
canonical form
to mean there
should not be
any bitcasts.
So we still
have a
canonical
form, but its
definition is
slightly
different.<br class="">
<br class="">
=== REASONS
FOR /
AGAINST===<br class="">
<br class="">
Hal Finkel
warns that
while this may
be useful for
power pc, this
may hurt more
than one other
platform and
become a very
large project.
Despite this
he is fine
with bringing
up the issue
to the mailing
list to get
feedback,
mostly because
this seems
inline with
our future
direction of
having a
unique type
for all
pointers.
(Hal please
correct me if
I
misunderstood
your comment)<br class="">
<br class="">
This is a much
simpler fix
compared to
alternatives.
(ignoring
potential
regressions)<br class="">
<br class="">
===
ALTERNATIVE
SOLUTION ===<br class="">
<br class="">
Fix select
speculation in
SROA to see
through
bitcasts.
Handle
remaining
bitcasts
during code
gen. Other
alternative
solutions are
welcome.<br class="">
<br class="">
Should I
implement the
proposed
solution or is
it too risky?
I understand
that we may
need to undo
it if it
breaks too
many things.
Comments are
welcome.<br class="">
<br class="">
<br class="">
[1] <a href="http://lists.llvm.org/pipermail/llvm-dev/2015-January/080956.html" target="_blank" class=""></a><a class="moz-txt-link-freetext" href="http://lists.llvm.org/pipermail/llvm-dev/2015-January/080956.html" target="_blank">http://lists.llvm.org/pipermail/llvm-dev/2015-January/080956.html</a>
r226781 git
commit id:
b778cbc0c8<br class="">
<br class="">
<br class="">
<br class="">
Code Samples
(only relevant
part is
copied):<br class="">
<br class="">
--------------------
Before
Canonicalization
(contains call
to std::max):
--------------------
<br class="">
entry:<br class="">
%max_value =
alloca float,
align 4<br class="">
%1 = load
float, float*
%input, align
4, !tbaa !1<br class="">
store float
%1, float*
%max_value,
align 4, !tbaa
!1<br class="">
<br class="">
for.body:<br class="">
%call = call
dereferenceable(4)
float*
@_ZSt3maxIfERKT_S2_S2_(float*
dereferenceable(4)
%max_value,
float*
dereferenceable(4)
%arrayidx1)<br class="">
%3 = load
float, float*
%call, align
4, !tbaa !1<br class="">
store float
%3, float*
%max_value,
align 4, !tbaa
!1<br class="">
<br class="">
--------------------
After
Canonicalization
(contains call
to
std::max):--------------------
<br class="">
<br class="">
entry:<br class="">
%max_value =
alloca float,
align 4<br class="">
%1 = bitcast
float* %input
to i32*<br class="">
%2 = load
i32, i32* %1,
align 4, !tbaa
!1<br class="">
%3 = bitcast
float*
%max_value to
i32*<br class="">
store i32
%2, i32* %3,
align 4, !tbaa
!1<br class="">
<br class="">
for.body:<br class="">
%call = call
dereferenceable(4)
float*
@_ZSt3maxIfERKT_S2_S2_(float*
nonnull
dereferenceable(4)
%max_value,
float*
dereferenceable(4)
%arrayidx1)<br class="">
%5 = bitcast
float* %call
to i32*<br class="">
%6 = load
i32, i32* %5,
align 4, !tbaa
!1<br class="">
%7 = bitcast
float*
%max_value to
i32*<br class="">
store i32
%6, i32* %7,
align 4, !tbaa
!1<br class="">
<br class="">
--------------------
After SROA
(the call to
std::max is
inlined
now):--------------------
<br class="">
entry:<br class="">
%max_value.sroa.0
= alloca i32<br class="">
%0 = bitcast
float* %input
to i32*<br class="">
%1 = load
i32, i32* %0,
align 4, !tbaa
!1<br class="">
store i32
%1, i32*
%max_value.sroa.0<br class="">
<br class="">
for.body: <br class="">
%max_value.sroa.0.0.max_value.sroa.0.0.6
= load i32,
i32*
%max_value.sroa.0<br class="">
%3 = bitcast
i32
%max_value.sroa.0.0.max_value.sroa.0.0.6
to float<br class="">
%max_value.sroa.0.0.max_value.sroa_cast8
= bitcast i32*
%max_value.sroa.0
to float*<br class="">
%__b.__a.i =
select i1
%cmp.i, float*
%arrayidx1,
float*
%max_value.sroa.0.0.max_value.sroa_cast8<br class="">
%5 = bitcast
float*
%__b.__a.i to
i32*<br class="">
%6 = load
i32, i32* %5,
align 4, !tbaa
!1<br class="">
store i32
%6, i32*
%max_value.sroa.0<br class="">
<br class="">
--------------------
After SROA
when
Canonicalization
is turned
off--------------------
<br class="">
entry:<br class="">
%0 = load
float, float*
%input, align
4, !tbaa !1<br class="">
<br class="">
for.cond:
; preds =
%for.body,
%entry<br class="">
%max_value.0
= phi float [
%0, %entry ],
[
%.sroa.speculated,
%for.body ]<br class="">
<br class="">
for.body:<br class="">
%1 = load
float, float*
%arrayidx1,
align 4, !tbaa
!1<br class="">
%cmp.i =
fcmp olt float
%max_value.0,
%1<br class="">
%.sroa.speculate.load.true
= load float,
float*
%arrayidx1,
align 4, !tbaa
!1<br class="">
%.sroa.speculated
= select i1
%cmp.i, float
%.sroa.speculate.load.true,
float
%max_value.0<br class="">
</div>
</div>
</div>
_______________________________________________<br class="">
LLVM
Developers
mailing list<br class="">
<a href="mailto:llvm-dev@lists.llvm.org" target="_blank" class=""></a><a class="moz-txt-link-abbreviated" href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a><br class="">
<a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" target="_blank" class=""></a><a class="moz-txt-link-freetext" href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" target="_blank">http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev</a><br class="">
</div>
</blockquote>
</div>
<br class="">
</div>
</div>
<br class="">
_______________________________________________<br class="">
LLVM
Developers
mailing list<br class="">
<a href="mailto:llvm-dev@lists.llvm.org" target="_blank" class=""></a><a class="moz-txt-link-abbreviated" href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a><br class="">
<a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" rel="noreferrer" target="_blank" class=""></a><a class="moz-txt-link-freetext" href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" target="_blank">http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev</a><br class="">
<br class="">
</blockquote>
</div>
</div>
</div>
<br class="">
</div>
</div>
</blockquote>
</div>
<br class="">
</div>
</div>
</div>
</blockquote>
</div>
</div>
</div>
<br class="">
</div>
</div>
</blockquote>
</div>
<br class="">
</div>
</div>
</blockquote>
</div>
<br class="">
</div>
</div>
</div>
<br class="">
<fieldset class="mimeAttachmentHeader"></fieldset>
<br class="">
<pre class="">_______________________________________________<br>LLVM Developers mailing list<br><a class="moz-txt-link-abbreviated" href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a><br><a class="moz-txt-link-freetext" href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" target="_blank">http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev</a><br></pre>
</blockquote>
<br class="">
</div>
</div></blockquote></div><br class=""></div><br>_______________________________________________<br>LLVM Developers mailing list<br>llvm-dev@lists.llvm.org<br>http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev<br></blockquote><br><br><br>-- <br><div><span name="x"></span>Hal Finkel<br>Assistant Computational Scientist<br>Leadership Computing Facility<br>Argonne National Laboratory<span name="x"></span><br></div></div></body></html>