[llvm] [NVPTX] support packed f32 instructions for sm_100+ (PR #126337)

Alex MacLean via llvm-commits llvm-commits at lists.llvm.org
Tue Jun 10 09:30:41 PDT 2025


================
@@ -1245,7 +1251,9 @@ bool NVPTXDAGToDAGISel::tryLDGLDU(SDNode *N) {
     EltVT = EltVT.getVectorElementType();
     // vectors of 8/16bits type are loaded/stored as multiples of v4i8/v2x16
     // elements.
-    if ((EltVT == MVT::f16 && OrigType == MVT::v2f16) ||
+    // Packed vector types are loaded/stored in a single register.
----------------
AlexMaclean wrote:

I suppose this is fine but it seems like the whole chunk of code that this is part of should be removed. We should be doing this more like how we select loads and stores where the number of elements and type are derived from the types the node produces.

https://github.com/llvm/llvm-project/pull/126337


More information about the llvm-commits mailing list