[PATCH] D11089: [NVPTX] declare no vector registers
Justin Holewinski
jholewinski at nvidia.com
Thu Jul 9 16:33:30 PDT 2015
jholewinski added a comment.
The short answer is that ptxas doesn't handle vector registers very well. It may be good to revisit this, but ptxas currently prefers scalar ops.
Last time I looked into this, the implementation cost greatly outweighed any potential benefits. At the SASS level, we don't have vector fp ops anyway.
http://reviews.llvm.org/D11089
More information about the llvm-commits
mailing list