[PATCH] D126158: [MLIR][GPU] Replace fdiv on fp16 with promoted (fp32) multiplication with reciprocal plus one (conditional) Newton iteration.
Stephan Herhut via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Tue May 31 01:42:02 PDT 2022
herhut accepted this revision.
herhut added a comment.
This revision is now accepted and ready to land.
Separate pass works for me.
================
Comment at: mlir/include/mlir/Dialect/LLVMIR/Transforms/Passes.td:19
+def NVVMOptimize : Pass<"nvvm-optimize"> {
+ let summary = "Optimize NVVM IR";
----------------
Maybe `llvm-optimize-for-nvvm`? Or even `llvm-optimize-for-nvvm-target`?
This does not really optimize `nvvm` but rewrites `llvm` ir.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D126158/new/
https://reviews.llvm.org/D126158
More information about the cfe-commits
mailing list