[PATCH] D38645: [NVPTX] Implemented wmma intrinsics and instructions.

Yuan Lin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Oct 11 12:01:51 PDT 2017


YuanLin added a comment.

We took this approach to reduce the number of intrinsic functions that opt and code-gen has to deal with, for example to have one ld_a_f16 instead of 12. It simplifies our code logic. Take the address space optimization for an example, when we translate a generic load to specific load, we can just change the pointer type. The rests are just copied over.


https://reviews.llvm.org/D38645





More information about the llvm-commits mailing list