[PATCH] D144729: [AMDGPU] Select v_sat_pk_u8_i16

Thu Mar 30 01:52:45 PDT 2023

Pierre-vh added inline comments.

================
Comment at: llvm/lib/Target/AMDGPU/SIInstructions.td:2931
+  def: GCNPat<
+    (v2i16 (DivergentBinFrag<build_vector> (clamp_s16_u8 i16:$lo), (clamp_s16_u8 i16:$hi))),
+    (inst
----------------
foad wrote:
> Looking at this again, I don't think these patterns match what the instruction does. The instruction puts the two 8-bit results in bits [15..8] and [7..0], not in bits [23..16] and [7..0].
Oh I see, right. Not sure what the right pattern is then. All of the patterns are wrong in that case.
Maybe it needs to match an additional trunc to v2i8 after the build_vector?

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D144729/new/

https://reviews.llvm.org/D144729