[llvm] [SLP][AMDGPU] Vectorize operands of non-trivially-vectorizable intrinsic calls (PR #189784)
Alexey Bataev via llvm-commits
llvm-commits at lists.llvm.org
Sun Apr 5 10:17:11 PDT 2026
================
@@ -29477,6 +29507,28 @@ bool SLPVectorizerPass::vectorizeChainsInBlock(BasicBlock *BB, BoUpSLP &R) {
PostProcessCmps.insert(cast<CmpInst>(&*It));
}
+ // Collect operands of non-trivially vectorizable intrinsic calls (e.g.,
+ // llvm.amdgcn.exp2) and group by intrinsic ID, so their operands can be
+ // vectorized independently.
+ // FIXME: Extend for all non-vectorized functions.
+ SmallMapVector<std::pair<Intrinsic::ID, unsigned>, SmallVector<Value *, 4>, 4>
+ OpcodeGroups;
+
+ for (Instruction &I : *BB) {
----------------
alexey-bataev wrote:
Why in a separate loop?
https://github.com/llvm/llvm-project/pull/189784
More information about the llvm-commits
mailing list