[llvm] [SLPVectorizer][AMDGPU] Look-through vectorization of AMDGPU intrinsics (PR #175821)
via llvm-commits
llvm-commits at lists.llvm.org
Wed Apr 1 04:15:11 PDT 2026
xgxanq wrote:
> > > 3. `<user subtree>` is profitable, `<operand subtree>` is not profitable for the vectorization.I don't understand if mix the user subtree and operand subtree, the answer can also be not profitable or profitable. I can't understand broke the tree to consider is better.
>
> 3rd case can be handled by the tree trimming (throttling), but it affects compile time. That's another reason why the 2-independent-trees approach is better.
I still don't fully see why this would be better, but I'll keep learning and working to understand this part better.
https://github.com/llvm/llvm-project/pull/175821
More information about the llvm-commits
mailing list