[llvm] [NVPTX] support packed f32 instructions for sm_100+ (PR #126337)
Princeton Ferro via llvm-commits
llvm-commits at lists.llvm.org
Sat Feb 8 13:30:44 PST 2025
Prince781 wrote:
@AlexMaclean what led me to this implementation (and I did try it the other way) is that v2f16 and v2bf16 are supported by many kinds of instructions, so it makes more sense to legalize these types than v2f32. My concern is whether this feature should change code that uses f32 vectors but avoids these operations. Legalizing this type requires me to change some things in how we lower instructions, like loads and stores (for example, if we don't want `ld.v2.f32` to always become `ld.b64`), override a few things in TLI, etc.
See the test cases for more examples. If this is not a concern, then I can implement it the other way.
https://github.com/llvm/llvm-project/pull/126337
More information about the llvm-commits
mailing list