[all-commits] [llvm/llvm-project] 4756a2: [X86] Insert FMUL for estimated non reciprocal SQR...

Phoebe Wang via All-commits all-commits at lists.llvm.org
Wed Dec 1 21:53:02 PST 2021


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 4756a2f1575cd78197f00ffdaa1491f9ec78c576
      https://github.com/llvm/llvm-project/commit/4756a2f1575cd78197f00ffdaa1491f9ec78c576
  Author: Phoebe Wang <phoebe.wang at intel.com>
  Date:   2021-12-02 (Thu, 02 Dec 2021)

  Changed paths:
    M llvm/lib/Target/X86/X86ISelLowering.cpp
    M llvm/test/CodeGen/X86/sqrt-fastmath.ll

  Log Message:
  -----------
  [X86] Insert FMUL for estimated non reciprocal SQRT when `RefinementSteps` = 0

Reviewed By: spatel

Differential Revision: https://reviews.llvm.org/D114843


  Commit: f13b43d5702bed381bb1717e2336f7f23ae722b4
      https://github.com/llvm/llvm-project/commit/f13b43d5702bed381bb1717e2336f7f23ae722b4
  Author: Phoebe Wang <phoebe.wang at intel.com>
  Date:   2021-12-02 (Thu, 02 Dec 2021)

  Changed paths:
    M llvm/lib/Target/X86/X86ISelLowering.cpp
    M llvm/test/CodeGen/X86/avx512fp16-intrinsics.ll
    M llvm/test/CodeGen/X86/avx512fp16vl-intrinsics.ll

  Log Message:
  -----------
  [X86][FP16] Only generate approximate rsqrt when Reciprocal is true for half type

We have reasonable fast sqrt and accurate rsqrt for half type due to the
limited fractions. So neither do we need multi steps refinement for
rsqrt nor replace sqrt by rsqrt.

Reviewed By: RKSimon

Differential Revision: https://reviews.llvm.org/D114844


Compare: https://github.com/llvm/llvm-project/compare/fcd2d85cc943...f13b43d5702b


More information about the All-commits mailing list