[PATCH] D154517: AMDGPU: Always use v_rcp_f16 and v_rsq_f16

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Jul 6 14:40:29 PDT 2023


arsenm added a comment.

In D154517#4476671 <https://reviews.llvm.org/D154517#4476671>, @foad wrote:

>> Brute force produces identical values compared to a reference host implementation for all values.
>
> Have you tested v_sqrt_f16 or any other f16 trans instructions?

Haven't gotten there yet


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D154517/new/

https://reviews.llvm.org/D154517



More information about the llvm-commits mailing list