[llvm] [AMDGPU] Support D16 folding for image.sample with multiple extractelement and fptrunc users (PR #141758)
Harrison Hao via llvm-commits
llvm-commits at lists.llvm.org
Mon Jun 16 23:28:42 PDT 2025
================
@@ -270,6 +270,62 @@ simplifyAMDGCNImageIntrinsic(const GCNSubtarget *ST,
});
}
}
+
+ // Only perform D16 folding if every user of the image sample is
+ // an ExtractElementInst immediately followed by an FPTrunc to half.
+ SmallVector<ExtractElementInst *, 4> Extracts;
+ SmallVector<FPTruncInst *, 4> Truncs;
----------------
harrisonGPU wrote:
Thanks, I have updated it.
https://github.com/llvm/llvm-project/pull/141758
More information about the llvm-commits
mailing list