[llvm] [NVPTX] support packed f32 instructions for sm_100+ (PR #126337)

Princeton Ferro via llvm-commits llvm-commits at lists.llvm.org
Tue Jul 22 11:36:53 PDT 2025


Prince781 wrote:

@AlexMaclean I don't think we need to revert as this is exposing code that was technically already broken. We need to improve the semantics of `wgmma.mma_async` with an intrinsic. For now there are workarounds at the source level that other library vendors use.

https://github.com/llvm/llvm-project/pull/126337


More information about the llvm-commits mailing list