[llvm] [Vectorize] Vectorization for __builtin_prefetch (PR #66160)

Sun Jan 21 10:15:58 PST 2024

davemgreen wrote:

A couple of high level comments:
 - Is the main idea to make sure that we do vectorize the loop, or are the prefetches important for performance?
 - Under the assumption that prefetching works on cache lines we could consider scalarizing the prefetches instead, so just normal prefetches are emitted.
 - Or alternatively under the assumption that the users often put the prefetches in the wrong place, we could consider just dropping them from the vectorized loop.
 - Having said that, SVE does these instructions so it would make sense to add them.

I think it might make sense to have an initial patch that introduces the intrinsics+langref and adds the codegen support for them - both for genetic targets and for SVE. Once that is in it would make adding tests for vectorization easier.

https://github.com/llvm/llvm-project/pull/66160