[PATCH] D110524: [AArch64ISelLowering] Avoid duplane in some cases when sve enabled

David Sherwood via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Sep 28 05:34:04 PDT 2021


david-arm added a comment.

Hi @guopeilin, thanks for making the changes. I'm still a little concerned about the test case though as I'm not sure how reliable it will be over time, especially with the `undef` values present in the code. I did manage to reduce this test case to:

  ; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
  ; RUN: llc -mattr=+sve -aarch64-sve-vector-bits-min=256 < %s | FileCheck %s
  target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"
  target triple = "aarch64-unknown-linux-gnu"
  
  define <4 x i32> @test_work_knt_val(<16 x i32>* %arg) {
  entry:
    %0 = load <16 x i32>, <16 x i32>* %arg, align 256
    %shvec = shufflevector <16 x i32> %0, <16 x i32> undef, <4 x i32> <i32 14, i32 14, i32 14, i32 14>
    %1 = add <16 x i32> %0, %0
    store <16 x i32> %1, <16 x i32>* %arg, align 256
    ret <4 x i32> %shvec
  }

I think the problem is that there has to be multiple uses of the loaded value (`%0`) in order for the DAG combine to trigger.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D110524/new/

https://reviews.llvm.org/D110524



More information about the llvm-commits mailing list