[PATCH] D19990: [CUDA] Implement __ldg using intrinsics.
Justin Lebar via cfe-commits
cfe-commits at lists.llvm.org
Tue May 17 10:49:26 PDT 2016
jlebar added a comment.
Friendly ping. This is a big help with some Tensorflow benchmarks.
http://reviews.llvm.org/D19990
More information about the cfe-commits
mailing list