[PATCH] D38645: [NVPTX] Implemented wmma intrinsics and instructions.
Yuan Lin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Oct 11 12:01:51 PDT 2017
YuanLin added a comment.
We took this approach to reduce the number of intrinsic functions that opt and code-gen has to deal with, for example to have one ld_a_f16 instead of 12. It simplifies our code logic. Take the address space optimization for an example, when we translate a generic load to specific load, we can just change the pointer type. The rests are just copied over.
https://reviews.llvm.org/D38645
More information about the llvm-commits
mailing list