[libc-commits] [PATCH] D153395: Populating 'libmgpu.a' for math on the GPU

Joseph Huber via Phabricator via libc-commits libc-commits at lists.llvm.org
Mon Jul 31 16:56:39 PDT 2023


jhuber6 added a subscriber: lntue.
jhuber6 added a comment.

In D153395#4548963 <https://reviews.llvm.org/D153395#4548963>, @arsenm wrote:

> Looks a branchy/early-returny in ways the optimizer isn't aggressive enough at speculating

Yeah, probably something we could ask @lntue about. Looking at https://github.com/RadeonOpenCompute/ROCm-Device-Libs/blob/c1a736ae458f49e526932b3da611f6bd571a1c47/ocml/src/nextafterD.cl#L4 it looks much less branchy, presumably all the ternaries will get put into predicate registers for AMDGPU / NVPTX.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D153395/new/

https://reviews.llvm.org/D153395



More information about the libc-commits mailing list