[PATCH] D11089: [NVPTX] declare no vector registers

Justin Holewinski jholewinski at nvidia.com
Thu Jul 9 16:33:30 PDT 2015


jholewinski added a comment.

The short answer is that ptxas doesn't handle vector registers very well. It may be good to revisit this, but ptxas currently prefers scalar ops.

Last time I looked into this, the implementation cost greatly outweighed any potential benefits. At the SASS level, we don't have vector fp ops anyway.


http://reviews.llvm.org/D11089







More information about the llvm-commits mailing list