[llvm] [NVPTX] Stop using 16-bit CAS instructions from PTX (PR #120220)

Artem Belevich via llvm-commits llvm-commits at lists.llvm.org
Wed Jan 22 11:36:22 PST 2025


Artem-B wrote:

> So, for this common use case performing the emulation direclty in LLVM by using CAS 32-bit results in better codegen than generating these CAS loops using atom.cas.b16.

Fascinating. Thank you for the detailed explanation.

https://github.com/llvm/llvm-project/pull/120220


More information about the llvm-commits mailing list