[PATCH] D57059: [SLP] Initial support for the vectorization of the non-power-of-2 vectors.
Alexey Bataev via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Feb 10 06:05:41 PST 2021
ABataev added a comment.
Extra numbers:
**AVX512, O3 <https://reviews.llvm.org/owners/package/3/>+LTO, -march=native**
Metric: SLP.NumVectorInstructions
Program lhs rhs diff
test-suite :: External/SPEC/CINT2017rate/500.perlbench_r/500.perlbench_r.test 146.00 148.00 1.4%
test-suite :: External/SPEC/CINT2017speed/600.perlbench_s/600.perlbench_s.test 146.00 148.00 1.4%
test-suite :: External/SPEC/CINT2017speed/631.deepsjeng_s/631.deepsjeng_s.test 34.00 34.00 0.0%
test-suite :: External/SPEC/CINT2017speed/605.mcf_s/605.mcf_s.test 11.00 11.00 0.0%
test-suite :: External/SPEC/CINT2017rate/531.deepsjeng_r/531.deepsjeng_r.test 34.00 34.00 0.0%
test-suite :: External/SPEC/CINT2017rate/505.mcf_r/505.mcf_r.test 11.00 11.00 0.0%
test-suite :: External/SPEC/CINT2017rate/523.xalancbmk_r/523.xalancbmk_r.test 5587.00 5560.00 -0.5%
test-suite :: External/SPEC/CINT2017speed/623.xalancbmk_s/623.xalancbmk_s.test 5587.00 5560.00 -0.5%
test-suite :: External/SPEC/CFP2017rate/508.namd_r/508.namd_r.test 7384.00 7341.00 -0.6%
test-suite :: External/SPEC/CFP2017rate/510.parest_r/510.parest_r.test 9607.00 9359.00 -2.6%
test-suite :: External/SPEC/CFP2017rate/511.povray_r/511.povray_r.test 5340.00 5178.00 -3.0%
test-suite :: External/SPEC/CINT2017rate/525.x264_r/525.x264_r.test 1053.00 1006.00 -4.5%
test-suite :: External/SPEC/CINT2017speed/625.x264_s/625.x264_s.test 1053.00 1006.00 -4.5%
test-suite :: External/SPEC/CINT2017rate/541.leela_r/541.leela_r.test 141.00 134.00 -5.0%
test-suite :: External/SPEC/CINT2017speed/641.leela_s/641.leela_s.test 141.00 134.00 -5.0%
test-suite :: External/SPEC/CFP2017speed/638.imagick_s/638.imagick_s.test 3996.00 3563.00 -10.8%
test-suite :: External/SPEC/CFP2017rate/538.imagick_r/538.imagick_r.test 3996.00 3563.00 -10.8%
test-suite :: External/SPEC/CINT2017rate/502.gcc_r/502.gcc_r.test 862.00 767.00 -11.0%
test-suite :: External/SPEC/CINT2017speed/602.gcc_s/602.gcc_s.test 862.00 767.00 -11.0%
test-suite :: External/SPEC/CINT2017rate/520.omnetpp_r/520.omnetpp_r.test 524.00 463.00 -11.6%
test-suite :: External/SPEC/CINT2017speed/620.omnetpp_s/620.omnetpp_s.test 524.00 463.00 -11.6%
test-suite :: External/SPEC/CFP2017speed/644.nab_s/644.nab_s.test 426.00 370.00 -13.1%
test-suite :: External/SPEC/CFP2017rate/544.nab_r/544.nab_r.test 426.00 370.00 -13.1%
test-suite :: External/SPEC/CFP2017rate/526.blender_r/526.blender_r.test 15945.00 12573.00 -21.1%
test-suite :: External/SPEC/CFP2017rate/519.lbm_r/519.lbm_r.test NaN 16.00 nan%
test-suite :: External/SPEC/CFP2017speed/619.lbm_s/619.lbm_s.test NaN 16.00 nan%
Geomean difference nan%
**AVX512, O3 <https://reviews.llvm.org/owners/package/3/>+LTO**
Metric: SLP.NumVectorInstructions
Program lhs rhs diff
test-suite :: External/SPEC/CINT2017speed/641.leela_s/641.leela_s.test 22.00 60.00 172.7%
test-suite :: External/SPEC/CINT2017rate/541.leela_r/541.leela_r.test 22.00 60.00 172.7%
test-suite :: External/SPEC/CINT2017speed/600.perlbench_s/600.perlbench_s.test 68.00 72.00 5.9%
test-suite :: External/SPEC/CINT2017rate/500.perlbench_r/500.perlbench_r.test 68.00 72.00 5.9%
test-suite :: External/SPEC/CINT2017rate/505.mcf_r/505.mcf_r.test 11.00 11.00 0.0%
test-suite :: External/SPEC/CINT2017speed/631.deepsjeng_s/631.deepsjeng_s.test 10.00 10.00 0.0%
test-suite :: External/SPEC/CINT2017speed/623.xalancbmk_s/623.xalancbmk_s.test 3396.00 3396.00 0.0%
test-suite :: External/SPEC/CINT2017speed/605.mcf_s/605.mcf_s.test 11.00 11.00 0.0%
test-suite :: External/SPEC/CINT2017rate/531.deepsjeng_r/531.deepsjeng_r.test 10.00 10.00 0.0%
test-suite :: External/SPEC/CINT2017rate/523.xalancbmk_r/523.xalancbmk_r.test 3396.00 3396.00 0.0%
test-suite :: External/SPEC/CINT2017speed/620.omnetpp_s/620.omnetpp_s.test 499.00 497.00 -0.4%
test-suite :: External/SPEC/CINT2017rate/520.omnetpp_r/520.omnetpp_r.test 499.00 497.00 -0.4%
test-suite :: External/SPEC/CINT2017rate/525.x264_r/525.x264_r.test 838.00 826.00 -1.4%
test-suite :: External/SPEC/CINT2017speed/625.x264_s/625.x264_s.test 838.00 826.00 -1.4%
test-suite :: External/SPEC/CFP2017rate/511.povray_r/511.povray_r.test 6090.00 5906.00 -3.0%
test-suite :: External/SPEC/CFP2017speed/619.lbm_s/619.lbm_s.test 131.00 127.00 -3.1%
test-suite :: External/SPEC/CFP2017rate/519.lbm_r/519.lbm_r.test 131.00 127.00 -3.1%
test-suite :: External/SPEC/CFP2017rate/508.namd_r/508.namd_r.test 8815.00 8452.00 -4.1%
test-suite :: External/SPEC/CFP2017speed/638.imagick_s/638.imagick_s.test 2864.00 2712.00 -5.3%
test-suite :: External/SPEC/CFP2017rate/538.imagick_r/538.imagick_r.test 2864.00 2712.00 -5.3%
test-suite :: External/SPEC/CFP2017rate/526.blender_r/526.blender_r.test 16049.00 14753.00 -8.1%
test-suite :: External/SPEC/CINT2017rate/502.gcc_r/502.gcc_r.test 686.00 621.00 -9.5%
test-suite :: External/SPEC/CINT2017speed/602.gcc_s/602.gcc_s.test 686.00 621.00 -9.5%
test-suite :: External/SPEC/CFP2017speed/644.nab_s/644.nab_s.test 551.00 473.00 -14.2%
test-suite :: External/SPEC/CFP2017rate/544.nab_r/544.nab_r.test 551.00 473.00 -14.2%
test-suite :: External/SPEC/CFP2017rate/510.parest_r/510.parest_r.test 16240.00 13941.00 -14.2%
Geomean difference 4.3%
**AVX2, O3 <https://reviews.llvm.org/owners/package/3/>+LTO, -march=native**
Metric: SLP.NumVectorInstructions
Program lhs rhs diff
test-suite :: External/SPEC/CFP2017rate/508.namd_r/508.namd_r.test 7309.00 7341.00 0.4%
test-suite :: External/SPEC/CINT2017rate/505.mcf_r/505.mcf_r.test 11.00 11.00 0.0%
test-suite :: External/SPEC/CINT2017speed/631.deepsjeng_s/631.deepsjeng_s.test 34.00 34.00 0.0%
test-suite :: External/SPEC/CINT2017speed/623.xalancbmk_s/623.xalancbmk_s.test 5490.00 5490.00 0.0%
test-suite :: External/SPEC/CINT2017speed/605.mcf_s/605.mcf_s.test 11.00 11.00 0.0%
test-suite :: External/SPEC/CINT2017rate/531.deepsjeng_r/531.deepsjeng_r.test 34.00 34.00 0.0%
test-suite :: External/SPEC/CINT2017rate/523.xalancbmk_r/523.xalancbmk_r.test 5490.00 5490.00 0.0%
test-suite :: External/SPEC/CINT2017speed/620.omnetpp_s/620.omnetpp_s.test 462.00 455.00 -1.5%
test-suite :: External/SPEC/CINT2017rate/520.omnetpp_r/520.omnetpp_r.test 462.00 455.00 -1.5%
test-suite :: External/SPEC/CFP2017rate/510.parest_r/510.parest_r.test 9508.00 9347.00 -1.7%
test-suite :: External/SPEC/CFP2017rate/511.povray_r/511.povray_r.test 5393.00 5190.00 -3.8%
test-suite :: External/SPEC/CINT2017rate/525.x264_r/525.x264_r.test 1066.00 968.00 -9.2%
test-suite :: External/SPEC/CINT2017speed/625.x264_s/625.x264_s.test 1066.00 968.00 -9.2%
test-suite :: External/SPEC/CINT2017rate/541.leela_r/541.leela_r.test 151.00 134.00 -11.3%
test-suite :: External/SPEC/CINT2017speed/641.leela_s/641.leela_s.test 151.00 134.00 -11.3%
test-suite :: External/SPEC/CINT2017rate/500.perlbench_r/500.perlbench_r.test 160.00 141.00 -11.9%
test-suite :: External/SPEC/CINT2017speed/600.perlbench_s/600.perlbench_s.test 160.00 141.00 -11.9%
test-suite :: External/SPEC/CINT2017speed/602.gcc_s/602.gcc_s.test 820.00 722.00 -12.0%
test-suite :: External/SPEC/CINT2017rate/502.gcc_r/502.gcc_r.test 820.00 722.00 -12.0%
test-suite :: External/SPEC/CFP2017rate/538.imagick_r/538.imagick_r.test 3605.00 3173.00 -12.0%
test-suite :: External/SPEC/CFP2017speed/638.imagick_s/638.imagick_s.test 3605.00 3173.00 -12.0%
test-suite :: External/SPEC/CFP2017rate/544.nab_r/544.nab_r.test 438.00 370.00 -15.5%
test-suite :: External/SPEC/CFP2017speed/644.nab_s/644.nab_s.test 438.00 370.00 -15.5%
test-suite :: External/SPEC/CFP2017rate/526.blender_r/526.blender_r.test 14842.00 12463.00 -16.0%
test-suite :: External/SPEC/CFP2017speed/619.lbm_s/619.lbm_s.test 106.00 79.00 -25.5%
test-suite :: External/SPEC/CFP2017rate/519.lbm_r/519.lbm_r.test 106.00 79.00 -25.5%
Geomean difference -8.7%
**AVX2, O3 <https://reviews.llvm.org/owners/package/3/>+LTO**
Metric: SLP.NumVectorInstructions
Program lhs rhs diff
test-suite :: External/SPEC/CINT2017speed/641.leela_s/641.leela_s.test 22.00 60.00 172.7%
test-suite :: External/SPEC/CINT2017rate/541.leela_r/541.leela_r.test 22.00 60.00 172.7%
test-suite :: External/SPEC/CINT2017speed/600.perlbench_s/600.perlbench_s.test 68.00 72.00 5.9%
test-suite :: External/SPEC/CINT2017rate/500.perlbench_r/500.perlbench_r.test 68.00 72.00 5.9%
test-suite :: External/SPEC/CINT2017rate/505.mcf_r/505.mcf_r.test 11.00 11.00 0.0%
test-suite :: External/SPEC/CINT2017speed/631.deepsjeng_s/631.deepsjeng_s.test 10.00 10.00 0.0%
test-suite :: External/SPEC/CINT2017speed/623.xalancbmk_s/623.xalancbmk_s.test 3396.00 3396.00 0.0%
test-suite :: External/SPEC/CINT2017speed/605.mcf_s/605.mcf_s.test 11.00 11.00 0.0%
test-suite :: External/SPEC/CINT2017rate/531.deepsjeng_r/531.deepsjeng_r.test 10.00 10.00 0.0%
test-suite :: External/SPEC/CINT2017rate/523.xalancbmk_r/523.xalancbmk_r.test 3396.00 3396.00 0.0%
test-suite :: External/SPEC/CINT2017speed/620.omnetpp_s/620.omnetpp_s.test 499.00 497.00 -0.4%
test-suite :: External/SPEC/CINT2017rate/520.omnetpp_r/520.omnetpp_r.test 499.00 497.00 -0.4%
test-suite :: External/SPEC/CINT2017rate/525.x264_r/525.x264_r.test 838.00 826.00 -1.4%
test-suite :: External/SPEC/CINT2017speed/625.x264_s/625.x264_s.test 838.00 826.00 -1.4%
test-suite :: External/SPEC/CFP2017speed/619.lbm_s/619.lbm_s.test 131.00 127.00 -3.1%
test-suite :: External/SPEC/CFP2017rate/519.lbm_r/519.lbm_r.test 131.00 127.00 -3.1%
test-suite :: External/SPEC/CFP2017rate/511.povray_r/511.povray_r.test 6094.00 5906.00 -3.1%
test-suite :: External/SPEC/CFP2017rate/508.namd_r/508.namd_r.test 8734.00 8452.00 -3.2%
test-suite :: External/SPEC/CFP2017speed/638.imagick_s/638.imagick_s.test 2867.00 2712.00 -5.4%
test-suite :: External/SPEC/CFP2017rate/538.imagick_r/538.imagick_r.test 2867.00 2712.00 -5.4%
test-suite :: External/SPEC/CFP2017rate/526.blender_r/526.blender_r.test 16026.00 14753.00 -7.9%
test-suite :: External/SPEC/CINT2017rate/502.gcc_r/502.gcc_r.test 686.00 621.00 -9.5%
test-suite :: External/SPEC/CINT2017speed/602.gcc_s/602.gcc_s.test 686.00 621.00 -9.5%
test-suite :: External/SPEC/CFP2017rate/510.parest_r/510.parest_r.test 16241.00 13941.00 -14.2%
test-suite :: External/SPEC/CFP2017speed/644.nab_s/644.nab_s.test 559.00 473.00 -15.4%
test-suite :: External/SPEC/CFP2017rate/544.nab_r/544.nab_r.test 559.00 473.00 -15.4%
Geomean difference 4.2%
Will update the patch soon.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D57059/new/
https://reviews.llvm.org/D57059
More information about the llvm-commits
mailing list