[llvm] [AMDGPU] Improve AMDGPU sqrt and inverse sqrt handling for bf16 (PR #180291)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Mon Feb 9 08:41:37 PST 2026


================
@@ -13184,6 +13187,45 @@ SDValue SITargetLowering::lowerFSQRTF64(SDValue Op, SelectionDAG &DAG) const {
                      Flags);
 }
 
+SDValue SITargetLowering::lowerFSQRTBF16(SDValue Op, SelectionDAG &DAG) const {
----------------
arsenm wrote:

This is mostly duplicating the f32 path. Can you merge these into one function? 

https://github.com/llvm/llvm-project/pull/180291


More information about the llvm-commits mailing list