<div class="gmail_quote">On Wed, Nov 16, 2011 at 8:05 AM, Alberto Magni <span dir="ltr"><<a href="mailto:alberto.magni86@gmail.com" target="_blank">alberto.magni86@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


Dear Justin,<br>

<br>

I am trying to add the support for some OpenCL builtin functions to<br>

the PTX backend.<br>

The attached file represent the first stub of a patch for the fmax<br>

builtin function.<br></blockquote><div><br></div><div>First off, thanks for helping to improve the PTX back-end!</div><div><br></div><div>There are really two main issues here.  First, OpenCL built-in functions do not belong in the PTX back-end.  These will be implemented in the libclc library (<a href="http://www.pcc.me.uk/~peter/libclc/" target="_blank">http://www.pcc.me.uk/~peter/libclc</a>).  The back-end will only implement PTX intrinsics, which may be used by the OpenCL built-in functions in libclc.  However, this particular function (max) corresponds to a PTX instruction, so it makes sense to implement it as an intrinsic in the back-end.</div>


<div><br></div><div>Second, intrinsic functions require a bit more work.  You're off to a great start, but intrinsics are implemented a bit differently.  It looks like LLVM does not have a max intrinsic, so we'll need to create one.  Have a look at include/llvm/IntrinsicsPTX.td.  This file defines the PTX-specific intrinsics.  You can add an intrinsic for max here, and then implement a pattern-match in the PTXInstrInfo.td file.  There is no need to create a new SDNode type for intrinsics, unless they require some special handling in the C++ code, which I do not see being the case here.</div>

<div><br></div><div>When you define a new intrinsic, use the following template as a name: int_ptx_max.  This will define the LLVM intrinsic as @llvm.ptx.max().  Please follow the same convention when naming the __builtin_* function.</div>

<div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">


<br>

The test case I am trying is the following:<br>

<br>

define ptx_device float @f(float %x, float %y) {<br>

entry:<br>

  %z = call float @fmax(float %x, float %y)<br>

  ret float %z<br>

}<br>

<br>

declare float @fmax(float, float)<br>

<br>

But at the moment llc crashes saying that "calls are not supported",<br>

this does not<br>

happens with llvm builtins like llvm.sqrt.f32<br></blockquote><div><br></div><div>Which version of LLVM are you using?  Calls to PTX device functions have been implemented for a little while now, so I'm surprised to see that error.  Perhaps it's because the fmax function is not defined as ptx_device.</div>


<div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<br>

Can you please give me a hint on what I am missing, or some general<br>

advice on how<br>

to add builtin functions.<br>

<br>

Thank you in advance,<br>

<br>

Alberto.<br>

<br>_______________________________________________<br>

LLVM Developers mailing list<br>

<a href="mailto:LLVMdev@cs.uiuc.edu" target="_blank">LLVMdev@cs.uiuc.edu</a>         <a href="http://llvm.cs.uiuc.edu" target="_blank">http://llvm.cs.uiuc.edu</a><br>

<a href="http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev" target="_blank">http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev</a><br>

<br></blockquote></div><br><br clear="all"><div><br></div>-- <br><br><div>Thanks,</div><div><br></div><div>Justin Holewinski</div><br>