[llvm] [SLPVectorizer][NVPTX] Customize getBuildVectorCost for NVPTX (PR #128077)
via llvm-commits
llvm-commits at lists.llvm.org
Fri Feb 21 11:03:20 PST 2025
peterbell10 wrote:
I think your example is a bit misleading because it includes the argument passing convention, if we read the values from gmem instead the mov becomes a single SASS instruction:
https://godbolt.org/z/rKeGTrj96
I am a bit surprised though that ptxas keeps those extra `PRMT` calls in for the argument passing case. AFAICT it's zeroing out the top 16 bits, then immediately discarding them.
https://github.com/llvm/llvm-project/pull/128077
More information about the llvm-commits
mailing list