https://github.com/wangpc-pp commented: I don't know if this is the right approach, but it really improves the codegen quality now. We still lack the abilities to handle vp intrinsics in instcombine and DAGCombiner. https://github.com/llvm/llvm-project/pull/126177