[llvm] [AMDGPU] - Generate s_bitreplicate_b64_b32 (PR #69209)
Nicolai Hähnle via llvm-commits
llvm-commits at lists.llvm.org
Sun Oct 22 18:36:28 PDT 2023
nhaehnle wrote:
The intrinsic is marked `convergent` precisely so that the waterfall loop isn't needed.
https://github.com/llvm/llvm-project/pull/69209
More information about the llvm-commits
mailing list