[PATCH] D109889: AMDGPU: Broadcast scalar boolean to vector boolean explicitly

Jay Foad via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Sep 28 03:16:53 PDT 2021


foad accepted this revision.
foad added a comment.
This revision is now accepted and ready to land.

I think this is OK given that it fixes a bug, and it moves us in the direction of generating explicit bit-broadcasting code instead of relying on the behaviour of i1 COPY instructions.



================
Comment at: llvm/lib/Target/AMDGPU/SIISelLowering.cpp:4173
+        MRI.getRegClass(CarryDest.getReg());
+    unsigned SelOpc = (TRI->getRegSizeInBits(*CarryDestRC) == 64)
+                          ? AMDGPU::S_CSELECT_B64
----------------
Maybe remember the reg size calculated on line 4141, and reuse it here?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D109889/new/

https://reviews.llvm.org/D109889



More information about the llvm-commits mailing list