[PATCH] D140846: [NVPTX] Fix NVPTX lowering of frem when denominator is infinite.

Benjamin Chetioui via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Jan 4 01:00:41 PST 2023


bchetioui marked 2 inline comments as done.
bchetioui added a comment.

Thanks for the review, @tra and @herhut!



================
Comment at: llvm/lib/Target/NVPTX/NVPTXInstrInfo.td:604
+
+// FIXME: Missing slct
+
----------------
tra wrote:
> Nit: It's more of a TODO, IMO. :-)
> 
> I wonder if the instruction actually provides any benefit over `cmp`+`selp` on the SASS level. I suspect that it probably does not, and implementing it would just give us a bit nicer PTX w/o much of an effect on the actual GPU code.
Done. (Though this is not really part of the change, I moved this block from below to satisfy requirements re. definition order. :-))


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D140846/new/

https://reviews.llvm.org/D140846



More information about the llvm-commits mailing list