[PATCH] D153472: AMDGPU: Correctly expand f64 sqrt intrinsic
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Jul 21 04:54:45 PDT 2023
arsenm added inline comments.
================
Comment at: llvm/test/CodeGen/AMDGPU/fsqrt.f64.ll:44
-; GCN-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
-; GCN-NEXT: v_sqrt_f64_e64 v[0:1], -|v[0:1]|
-; GCN-NEXT: s_setpc_b64 s[30:31]
----------------
Pierre-vh wrote:
> I think I need some context, why is `v_sqrt_f64` so bad that this expansion is preferred? Accuracy/semantics?
It's nowhere near accurate enough, I don't really know what the point of the instruction is. The raw instruction definitely just fails OpenCL conformance
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D153472/new/
https://reviews.llvm.org/D153472
More information about the llvm-commits
mailing list