[llvm] ebe7265 - [Mips] Fix fast isel for i16 bswap. (#103398)
via llvm-commits
llvm-commits at lists.llvm.org
Fri Aug 16 14:54:54 PDT 2024
Author: Craig Topper
Date: 2024-08-16T14:54:51-07:00
New Revision: ebe7265b142f370f0a563fece5db22f57383ba2d
URL: https://github.com/llvm/llvm-project/commit/ebe7265b142f370f0a563fece5db22f57383ba2d
DIFF: https://github.com/llvm/llvm-project/commit/ebe7265b142f370f0a563fece5db22f57383ba2d.diff
LOG: [Mips] Fix fast isel for i16 bswap. (#103398)
We need to mask the SRL result to 8 bits before ORing in the SLL. This
is needed in case bits 23:16 of the input aren't zero. They will have
been shifted into bits 15:8.
We don't need to AND the result with 0xffff. It's ok if the upper 16
bits of the register are garbage.
Fixes #103035.
Added:
Modified:
llvm/lib/Target/Mips/MipsFastISel.cpp
llvm/test/CodeGen/Mips/Fast-ISel/bswap1.ll
Removed:
################################################################################
diff --git a/llvm/lib/Target/Mips/MipsFastISel.cpp b/llvm/lib/Target/Mips/MipsFastISel.cpp
index 3485825b77627e..7d8278c8ca3cf5 100644
--- a/llvm/lib/Target/Mips/MipsFastISel.cpp
+++ b/llvm/lib/Target/Mips/MipsFastISel.cpp
@@ -1608,8 +1608,8 @@ bool MipsFastISel::fastLowerIntrinsicCall(const IntrinsicInst *II) {
}
emitInst(Mips::SLL, TempReg[0]).addReg(SrcReg).addImm(8);
emitInst(Mips::SRL, TempReg[1]).addReg(SrcReg).addImm(8);
- emitInst(Mips::OR, TempReg[2]).addReg(TempReg[0]).addReg(TempReg[1]);
- emitInst(Mips::ANDi, DestReg).addReg(TempReg[2]).addImm(0xFFFF);
+ emitInst(Mips::ANDi, TempReg[2]).addReg(TempReg[1]).addImm(0xFF);
+ emitInst(Mips::OR, DestReg).addReg(TempReg[0]).addReg(TempReg[2]);
updateValueMap(II, DestReg);
return true;
}
diff --git a/llvm/test/CodeGen/Mips/Fast-ISel/bswap1.ll b/llvm/test/CodeGen/Mips/Fast-ISel/bswap1.ll
index bd762a0e1d741f..ce664c78e86c2a 100644
--- a/llvm/test/CodeGen/Mips/Fast-ISel/bswap1.ll
+++ b/llvm/test/CodeGen/Mips/Fast-ISel/bswap1.ll
@@ -21,8 +21,8 @@ define void @b16() {
; 32R1: sll $[[TMP1:[0-9]+]], $[[A_VAL]], 8
; 32R1: srl $[[TMP2:[0-9]+]], $[[A_VAL]], 8
- ; 32R1: or $[[TMP3:[0-9]+]], $[[TMP1]], $[[TMP2]]
- ; 32R1: andi $[[TMP4:[0-9]+]], $[[TMP3]], 65535
+ ; 32R1: andi $[[TMP3:[0-9]+]], $[[TMP2]], 255
+ ; 32R1: or $[[RESULT:[0-9]+]], $[[TMP1]], $[[TMP3]]
; 32R2: wsbh $[[RESULT:[0-9]+]], $[[A_VAL]]
More information about the llvm-commits
mailing list