[PATCH] D81172: [AMDGPU] Implement hardware bug workaround for image instructions
    Rodrigo Dominguez via Phabricator via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Thu Sep 10 06:55:53 PDT 2020
    
    
  
rdomingu added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp:3411-3415
+      auto Unmerge = B.buildUnmerge(S16, Reg);
+      for (int I = 0, E = Unmerge->getNumOperands() - 1; I != E; ++I)
+        PackedRegs.push_back(Unmerge.getReg(I));
+      PackedRegs.resize(8, B.buildUndef(S16).getReg(0));
+      Reg = B.buildBuildVector(LLT::vector(8, S16), PackedRegs).getReg(0);
----------------
arsenm wrote:
> It would be preferable to emit a concat_vectors of <2 x s16> pieces here
Sorry, I'm new to this. Why would concat_vectors be preferable than build_vector? Could you please elaborate?
Repository:
  rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D81172/new/
https://reviews.llvm.org/D81172
    
    
More information about the llvm-commits
mailing list