[PATCH] D81172: [AMDGPU] Implement hardware bug workaround for image instructions
Rodrigo Dominguez via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Sep 10 06:55:53 PDT 2020
rdomingu added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp:3411-3415
+ auto Unmerge = B.buildUnmerge(S16, Reg);
+ for (int I = 0, E = Unmerge->getNumOperands() - 1; I != E; ++I)
+ PackedRegs.push_back(Unmerge.getReg(I));
+ PackedRegs.resize(8, B.buildUndef(S16).getReg(0));
+ Reg = B.buildBuildVector(LLT::vector(8, S16), PackedRegs).getReg(0);
----------------
arsenm wrote:
> It would be preferable to emit a concat_vectors of <2 x s16> pieces here
Sorry, I'm new to this. Why would concat_vectors be preferable than build_vector? Could you please elaborate?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D81172/new/
https://reviews.llvm.org/D81172
More information about the llvm-commits
mailing list