[clang] [llvm] [mlir] [AMDGPU] add clamp immediate operand to WMMA iu8 intrinsic (PR #171069)
Muhammad Abdul via llvm-commits
llvm-commits at lists.llvm.org
Tue Dec 9 07:59:10 PST 2025
0xzre wrote:
> This can't just break existing uses. Needs to implement bitcode autoupgrade, and some compatibility on the builtin signature.
>
> Also, can we just infer clamping as an optimization fold based on the use context?
Good catch. Will add a bitcode autupgrade for the old 4 op WMMA IU8 intrinsic to append clamp=0, plus a lit bitcode test. I'll keep source compatibility by keeping the existing builtin spelling mapped to clamp=0 and adding the 5 op form. Clamp inference can be a follow-up combine I think... this patch stays on correctness/compat. Would that be okay @arsenm ?
https://github.com/llvm/llvm-project/pull/171069
More information about the llvm-commits
mailing list