[all-commits] [llvm/llvm-project] 375b70: [libclc] Improve performance and precision of reci...

Joseph Huber via All-commits all-commits at lists.llvm.org
Sun Jun 14 18:32:21 PDT 2026


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 375b70bafedafa71fabfb0e8b1e759daeef0f703
      https://github.com/llvm/llvm-project/commit/375b70bafedafa71fabfb0e8b1e759daeef0f703
  Author: Joseph Huber <huberjn at outlook.com>
  Date:   2026-06-14 (Sun, 14 Jun 2026)

  Changed paths:
    M libclc/clc/lib/amdgpu/CMakeLists.txt
    A libclc/clc/lib/amdgpu/math/clc_recip_fast.cl
    A libclc/clc/lib/amdgpu/math/clc_recip_fast.inc
    M libclc/clc/lib/generic/CMakeLists.txt

  Log Message:
  -----------
  [libclc] Improve performance and precision of reciprocal functions (#203805)

Summary:
Small change to improve the performance and output of functions using
the reciprocol. This makes these functions *byte-for-byte* identical
with their OCML counterparts in ROCm.

Additionally ensure tanpi is correctly rounded to match ROCm.



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list