[PATCH] D11926: [NVPTX] Use 32-bit divides instead of 64-bit divides where possible

Jingyue Wu via llvm-commits llvm-commits at lists.llvm.org
Mon Aug 10 20:25:18 PDT 2015


jingyue added a comment.

Does any Eigen3 kernel (https://bitbucket.org/eigen/eigen/src/890ac1744b090c8de30aba2a33f4393e049d1559/unsupported/Eigen/CXX11/src/Tensor/?at=default) benefit from this improvement? If so, we can report some numbers there, so that people can understand how important this is for real-world CUDA programs.

Also, can you come up with some llc tests?

Thanks!


http://reviews.llvm.org/D11926





More information about the llvm-commits mailing list