[llvm-dev] Can I control HSA config generated by AMDGPU backend?
    Changdao Dong via llvm-dev 
    llvm-dev at lists.llvm.org
       
    Wed Sep  5 12:26:11 PDT 2018
    
    
  
The target algorithm is lyra2 and the target kernel is "search2" in
https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/phi2.cl.
The detail is implemented in
https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/lyra2mdz.cl
If you have time to take a look at the assembly, I can upload them later
today.
Thanks,
    Changdao
On Wed, Sep 5, 2018 at 11:32 AM Matt Arsenault <arsenm2 at gmail.com> wrote:
>
>
> On Sep 5, 2018, at 23:17, Changdao Dong via llvm-dev <
> llvm-dev at lists.llvm.org> wrote:
>
>
> Finally I kind of modified llvm to generate assembly that can run on
> AMDGPU pro drivers. One problem is the performance of the code generated by
> llvm is about 10% slower than amdgpu's online compiler. Anything I can tune
> the performance up the performance of llvm?\
>
> This is very dependent on the case you are looking at, so without a
> specific example or ISA comparison between the compilers there’s no guessing
>
> -Matt
>
-- 
DONG, Changdao
MP: 1-412-551-2330
dongchangdao at gmail.com <cddong at cmu.edu>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180905/ed1da73f/attachment.html>
    
    
More information about the llvm-dev
mailing list