[llvm] [RISCV] Tune flag for fast vrgather.vv (PR #124664)
Craig Topper via llvm-commits
llvm-commits at lists.llvm.org
Tue Feb 11 11:15:47 PST 2025
================
@@ -1365,6 +1365,10 @@ def FeatureUnalignedVectorMem
"true", "Has reasonably performant unaligned vector "
"loads and stores">;
+def TuneFastVRGather
+ : SubtargetFeature<"fast-vrgather", "HasFastVRGather",
+ "true", "Has vrgather.vv with LMUL*log2(LMUL) latency">;
----------------
topperc wrote:
x280 cost is linear in VL. Later x280 is constant time for VL<= (VLEN/2)/SEW (single DLEN), but linear in VL for everything else. Later x280 is quadratic in LMUL*2 except for fractional LMUL or when element at a time would be faster (i.e. there are less elements in a destination DLEN than the number of possible source DLENs for them).
https://github.com/llvm/llvm-project/pull/124664
More information about the llvm-commits
mailing list