[llvm] [LV] Autovectorization for the all-in-one histogram intrinsic (PR #91458)
Graham Hunter via llvm-commits
llvm-commits at lists.llvm.org
Wed Jun 12 06:42:15 PDT 2024
================
@@ -0,0 +1,82 @@
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --version 3
+; RUN: opt < %s -passes=loop-vectorize -force-vector-interleave=1 -force-target-instruction-cost=1 -S | FileCheck %s
----------------
huntergr-arm wrote:
We can add a new flag for histcnt cost, but it's actually the gathers and scatters which prevent vectorization here. I've changed to use -sve-gather-overhead and -sve-scatter-overhead instead.
I do think we've made gathers and scatters too expensive by default for SVE, but we'll need to spend some time figuring out which loops get worse with it on and working out a better model.
https://github.com/llvm/llvm-project/pull/91458
More information about the llvm-commits
mailing list