<div dir="ltr">FYI, see also the previous discussion about "speculatable":<div><br></div><div> <a href="http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-July/064426.html">http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-July/064426.html</a></div>
<div><br></div><div>I think such an attribute should be added.</div><div><br></div><div>In the thread which lead up to that thread, I proposed using more fine-grained attributes and Michael rightly pointed out the problem with that: you'd need one for every possible form of undefined behaviour. You listed "nofptrap", "nodivtrap" and "nomemtrap", but you didn't say "nounreachabletrap". Whoops!</div>
<div><br></div><div>Nick<br><div class="gmail_extra"><br><br><div class="gmail_quote">On 31 October 2013 07:38, Pekka Jääskeläinen <span dir="ltr"><<a href="mailto:pekka.jaaskelainen@tut.fi" target="_blank">pekka.jaaskelainen@tut.fi</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">Hello,<br>
<br>
OpenCL C specifies that instructions should not trap (it is "discouraged"<br>
in the specs). If they do, it's vendor-specific how the hardware<br>
exceptions are handled.<br>
<br>
It might be also the case with some other (future) languages targeting "streamlined" parallel accelerators in an heterogeneous setting.<br>
At least CUDA comes to mind. What about OpenACC and the new OpenMP,<br>
does someone know offhand?<br>
<br>
It would help several optimizations if they could assume certain<br>
instructions do not trap. E.g., I was looking at the if-conversion of<br>
the loop vectorizer, and it seems to not support speculating stores,<br>
divs, etc. which could be done if we knew it's safe to speculatively<br>
execute them.<br>
<br>
[In this particular if-conversion case proper predicated execution<br>
(not speculative) would require predicates to be added for all LLVM<br>
instructions so they could be squashed. I think this was discussed<br>
several years ago in the context of a generic IR-level if-conversion<br>
pass, but it seems such a thing did not realize eventually.]<br>
<br>
Anyways, "speculative" if-conversion is just one example where knowing<br>
that traps need not to be considered in the function at hand<br>
would help the optimizations. Also other speculative code motion<br>
optimizations, e.g., LICM, could benefit from it.<br>
<br>
One way would be to introduce a new function attribute. Functions (e.g.,<br>
OpenCL C or CUDA kernels) could be marked with an attribute that states<br>
that the instructions can be assumed not to trap -- it's a programmer's or<br>
the runtime's mistake if they do. The runtime should change the fp<br>
computation mode to the non-trapping one before calling such<br>
a function (this is actually stated in the OpenCL specs). If such<br>
handling is not supported by the target, then the attribute should not<br>
be added the first place.<br>
<br>
The attribute could be called 'notrap' which would include the<br>
semantics of any trap caused by any instruction. Or that could be<br>
split, just in case the hardware is known not to support one of the<br>
features. Three could suffice: 'nofptrap' (no IEEE FP exceptions),<br>
'nodivtrap' (no divide by zero exceptions, undef value output instead),<br>
'nomemtrap' (no mem exceptions).<br>
<br>
What do you think of the general idea? Or is there something similar<br>
already that can accomplish this?<br>
<br>
Thanks in advance,<span class=""><font color="#888888"><br>
-- <br>
Pekka<br>
______________________________<u></u>_________________<br>
LLVM Developers mailing list<br>
<a href="mailto:LLVMdev@cs.uiuc.edu" target="_blank">LLVMdev@cs.uiuc.edu</a> <a href="http://llvm.cs.uiuc.edu" target="_blank">http://llvm.cs.uiuc.edu</a><br>
<a href="http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev" target="_blank">http://lists.cs.uiuc.edu/<u></u>mailman/listinfo/llvmdev</a><br>
</font></span></blockquote></div><br></div></div></div>