[PATCH] D74629: AMDGPU/GlobalISel: Improve 16-bit bswap

Jay Foad via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sun Feb 16 01:14:54 PST 2020


foad added inline comments.


================
Comment at: llvm/test/CodeGen/AMDGPU/GlobalISel/bswap.ll:348
+; GFX7-NEXT:    s_or_b32 s0, s0, s1
+; GFX7-NEXT:    s_bfe_u32 s0, s0, 0x100000
 ; GFX7-NEXT:    ; return to shader part epilog
----------------
Why do we get masking both before and after the operation (s_and and s_bfe)? It seems like only one or the other should be required, depending on whether the upper bits of the register are undefined or defined to be zero.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D74629/new/

https://reviews.llvm.org/D74629





More information about the llvm-commits mailing list