[all-commits] [llvm/llvm-project] 4af8db: [AMDGPU] performCvtF32UByteNCombine - add SHL and ...

Wed Feb 19 03:53:22 PST 2020

  Branch: refs/heads/master
  Home:   https://github.com/llvm/llvm-project
  Commit: 4af8db317d2ac3f21b859e43a3240c024457a95d
      https://github.com/llvm/llvm-project/commit/4af8db317d2ac3f21b859e43a3240c024457a95d
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2020-02-19 (Wed, 19 Feb 2020)

  Changed paths:
    M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
    M llvm/test/CodeGen/AMDGPU/cvt_f32_ubyte.ll
    M llvm/test/CodeGen/AMDGPU/udiv64.ll

  Log Message:
  -----------
  [AMDGPU] performCvtF32UByteNCombine - add SHL and SimplifyMultipleUseDemandedBits support

This is part of the work to remove SelectionDAG::GetDemandedBits and just use SimplifyMultipleUseDemandedBits.

Recent experiments raised some v_cvt_f32_ubyte*_e32 regressions, so I've added some additional abilities to performCvtF32UByteNCombine to help unpack byte data more aggressively.

We still don't remove all OR(SHL,SRL) patterns as some of the regenerated nodes don't get combined again, but we are getting closer.

Differential Revision: https://reviews.llvm.org/D74786