[llvm] [NVPTX] fold movs into loads and stores (PR #144581)
Princeton Ferro via llvm-commits
llvm-commits at lists.llvm.org
Tue Jun 17 19:19:37 PDT 2025
Prince781 wrote:
> One thing I wonder about is how this changes the PTX semantics of a program. If we change from loading something as a b32 to a v2.b16 will this impact the memory consistency guarantees in PTX?
I'm not aware of anything in the spec that leads to different semantics. I would think since the underlying data being accessed and the alignment requirements are the same in both cases, they are essentially the "same operation" using different amounts of registers.
https://github.com/llvm/llvm-project/pull/144581
More information about the llvm-commits
mailing list