[llvm] [X86] shouldReduceLoadWidth - don't split loads if ANY uses are a extract+store or a full width legal binop (PR #129695)

Phoebe Wang via llvm-commits llvm-commits at lists.llvm.org
Wed Mar 5 01:08:33 PST 2025


================
@@ -4228,7 +4228,7 @@ define <4 x float> @uitofp_load_4i64_to_4f32(ptr%a) {
 ; AVX1:       # %bb.0:
 ; AVX1-NEXT:    vmovdqa (%rdi), %ymm0
 ; AVX1-NEXT:    vpsrlq $1, %xmm0, %xmm1
-; AVX1-NEXT:    vmovdqa 16(%rdi), %xmm2
+; AVX1-NEXT:    vextractf128 $1, %ymm0, %xmm2
----------------
phoebewang wrote:

Considering data is already in cache, is vextractf128 better than vmovdqa? In any targets?

https://github.com/llvm/llvm-project/pull/129695


More information about the llvm-commits mailing list