[clang] [llvm] [SROA] Vector promote some memsets (PR #133301)

Nikita Popov via cfe-commits cfe-commits at lists.llvm.org
Tue Jun 17 15:20:03 PDT 2025


nikic wrote:

On x86, what we actually end up doing is to combine those to unaligned i64 loads (see https://godbolt.org/z/P5z674x4r), which is probably the best outcome if they are supported. I assume AMDGPU does not support unaligned loads, and that's why you want to have single element loads that get inserted into a vector and then perform sub-vector extracts on it?

https://github.com/llvm/llvm-project/pull/133301


More information about the cfe-commits mailing list