[llvm] [AMDGPU][True16] Generate correct reg size for reg_sequence16 in wmma src mod select (PR #187629)
Guo Chen via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 20 08:40:03 PDT 2026
================
@@ -3752,22 +3753,42 @@ static MachineSDNode *buildRegSequence16(SmallVectorImpl<SDValue> &Elts,
if (isExtractHiElt(Elts[i + 1], HiSrc) && LoSrc == HiSrc) {
PackedElts.push_back(HiSrc);
} else {
+ if (Subtarget->useRealTrue16Insts()) {
+ // FIXME-TRUE16. use reg_sequence to replace v_perm_b32
----------------
broxigarchen wrote:
Updated the comment to make it more clear.
Factor it to be a seperate function. Does this looks good to you?
https://github.com/llvm/llvm-project/pull/187629
More information about the llvm-commits
mailing list