[llvm] [NVPTX] Preserve v16i8 vector loads when legalizing (PR #67322)
Drew Kersnar via llvm-commits
llvm-commits at lists.llvm.org
Thu Dec 5 10:15:21 PST 2024
dakersnar wrote:
Sounds good, I opened an issue here: https://github.com/llvm/llvm-project/issues/118851, we can continue the discussion there 👍.
One note I'll comment on here:
> Conversion often kill any benefit of vectorizing the wide load/store, so it may not be worth it in general case, where the rest of the code would use loaded data in element-wise fashion.
My understanding is that chunks of `load/store v4i8` gets lowered to `u32` currently, so the conversion/extraction is already happening regardless, but currently v8i8 gets lowered to two u32s instead of a v2.u32, which seems like a clear loss. See the repro in my issue.
https://github.com/llvm/llvm-project/pull/67322
More information about the llvm-commits
mailing list