[llvm] Reland "[NVPTX] Add folding for cvt.rn.bf16x2.f32" (PR #116417)
Artem Belevich via llvm-commits
llvm-commits at lists.llvm.org
Mon Nov 18 10:53:19 PST 2024
Artem-B wrote:
> Yea, inconsistent argument ordering in PTX does make things tricky. I've switched the patterns in this MR to use `lo` and `hi` as you suggested.
Tests could use the same treatment -- without looking up the docs for nvvm intrinsics it's hard to tell where a/b or f1/f2 are supposed to end up in the instruction arguments. While now I can be confident that the tablegen pattern is correct, I still would have hard time telling if the test patterns check what we want them to check.
https://github.com/llvm/llvm-project/pull/116417
More information about the llvm-commits
mailing list