[llvm] [AMDGPU] - Generate s_bitreplicate_b64_b32 (PR #69209)
    Nicolai Hähnle via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Sun Oct 22 18:36:28 PDT 2023
    
    
  
nhaehnle wrote:
The intrinsic is marked `convergent` precisely so that the waterfall loop isn't needed.
https://github.com/llvm/llvm-project/pull/69209
    
    
More information about the llvm-commits
mailing list