[all-commits] [llvm/llvm-project] 2a0db8: AMDGPU: Use more accurate fast f64 fdiv

Matt Arsenault via All-commits all-commits at lists.llvm.org
Thu Jan 21 07:52:06 PST 2021


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 2a0db8d70eeb0c4c09e4c91b365630eefbbf3993
      https://github.com/llvm/llvm-project/commit/2a0db8d70eeb0c4c09e4c91b365630eefbbf3993
  Author: Matt Arsenault <Matthew.Arsenault at amd.com>
  Date:   2021-01-21 (Thu, 21 Jan 2021)

  Changed paths:
    M llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
    M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
    M llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.h
    M llvm/lib/Target/AMDGPU/SIISelLowering.cpp
    M llvm/lib/Target/AMDGPU/SIISelLowering.h
    M llvm/lib/Target/AMDGPU/SIInstructions.td
    M llvm/test/CodeGen/AMDGPU/GlobalISel/fdiv.f64.ll
    M llvm/test/CodeGen/AMDGPU/GlobalISel/frem.ll
    M llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-fdiv.mir
    M llvm/test/CodeGen/AMDGPU/fdiv.f64.ll
    M llvm/test/CodeGen/AMDGPU/frem.ll
    M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.rcp.ll
    M llvm/test/CodeGen/AMDGPU/rsq.ll

  Log Message:
  -----------
  AMDGPU: Use more accurate fast f64 fdiv

A raw v_rcp_f64 isn't accurate enough, so start applying correction.


  Commit: 94375d1083ccc9187c2502894f1dad62d9dd92b9
      https://github.com/llvm/llvm-project/commit/94375d1083ccc9187c2502894f1dad62d9dd92b9
  Author: Matt Arsenault <Matthew.Arsenault at amd.com>
  Date:   2021-01-21 (Thu, 21 Jan 2021)

  Changed paths:
    M llvm/lib/Target/AMDGPU/SIInstructions.td
    M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.rcp.ll
    M llvm/test/CodeGen/AMDGPU/rsq.ll

  Log Message:
  -----------
  AMDGPU: Remove v_rsq_f64 patterns

This isn't accurate enough without correction


Compare: https://github.com/llvm/llvm-project/compare/48c54f0f6234...94375d1083cc


More information about the All-commits mailing list