[all-commits] [llvm/llvm-project] a738bd: AMDGPU: Permit more rsq formation in AMDGPUCodeGen...
Matt Arsenault via All-commits
all-commits at lists.llvm.org
Wed Aug 23 17:07:05 PDT 2023
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: a738bdf35eaa3bb48b7d8022e2aaa352cff909ba
https://github.com/llvm/llvm-project/commit/a738bdf35eaa3bb48b7d8022e2aaa352cff909ba
Author: Matt Arsenault <Matthew.Arsenault at amd.com>
Date: 2023-08-23 (Wed, 23 Aug 2023)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
M llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-fdiv.ll
M llvm/test/CodeGen/AMDGPU/fdiv_flags.f32.ll
M llvm/test/CodeGen/AMDGPU/rsq.f32.ll
Log Message:
-----------
AMDGPU: Permit more rsq formation in AMDGPUCodeGenPrepare
We were basing the defer the fast case to codegen based on the fdiv
itself, and not looking for a foldable sqrt input.
https://reviews.llvm.org/D158127
More information about the All-commits
mailing list