[PATCH] D51042: [NVPTX] Remove ftz variants of cvt with rounding mode

Benjamin Kramer via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Aug 21 10:49:41 PDT 2018


bkramer added a comment.

In https://reviews.llvm.org/D51042#1207769, @tra wrote:

> This is a surprise. PTX ISA does not mention that .ftz is not applicable to `cvt.*.f16.*` instructions. 
>  Is it only `cvt` that does not support .ftz or does it impact other instructions?  PTX spec has add/sub/mul/fma/set/setp instructions that support f16 and have .ftz variant.


It's only cvt with an explicit rounding mode. I actually ran the output of f16-instructions.ll with FTZ through ptxas and removed instructions until it compiled it. This might even be a bug in ptxas.


Repository:
  rL LLVM

https://reviews.llvm.org/D51042





More information about the llvm-commits mailing list