[llvm] r355201 - [ARM] Fix FP16 stack loads/stores for Thumb2 with frame pointer
Oliver Stannard via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 1 06:20:28 PST 2019
Author: olista01
Date: Fri Mar 1 06:20:28 2019
New Revision: 355201
URL: http://llvm.org/viewvc/llvm-project?rev=355201&view=rev
Log:
[ARM] Fix FP16 stack loads/stores for Thumb2 with frame pointer
The new addressing mode added for the v8.2A FP16 instructions uses bit 8 of the
immediate to encode the sign of the offset, like the other FP loads/stores, so
need to be treated the same way.
Differential revision: https://reviews.llvm.org/D58816
Added:
llvm/trunk/test/CodeGen/ARM/fp16-frame-lowering.ll
Modified:
llvm/trunk/lib/Target/ARM/Thumb2InstrInfo.cpp
Modified: llvm/trunk/lib/Target/ARM/Thumb2InstrInfo.cpp
URL: http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Target/ARM/Thumb2InstrInfo.cpp?rev=355201&r1=355200&r2=355201&view=diff
==============================================================================
--- llvm/trunk/lib/Target/ARM/Thumb2InstrInfo.cpp (original)
+++ llvm/trunk/lib/Target/ARM/Thumb2InstrInfo.cpp Fri Mar 1 06:20:28 2019
@@ -638,7 +638,7 @@ bool llvm::rewriteT2FrameIndex(MachineIn
// Replace the FrameIndex with fp/sp
MI.getOperand(FrameRegIdx).ChangeToRegister(FrameReg, false);
if (isSub) {
- if (AddrMode == ARMII::AddrMode5)
+ if (AddrMode == ARMII::AddrMode5 || AddrMode == ARMII::AddrMode5FP16)
// FIXME: Not consistent.
ImmedOffset |= 1 << NumBits;
else
@@ -652,7 +652,7 @@ bool llvm::rewriteT2FrameIndex(MachineIn
// Otherwise, offset doesn't fit. Pull in what we can to simplify
ImmedOffset = ImmedOffset & Mask;
if (isSub) {
- if (AddrMode == ARMII::AddrMode5)
+ if (AddrMode == ARMII::AddrMode5 || AddrMode == ARMII::AddrMode5FP16)
// FIXME: Not consistent.
ImmedOffset |= 1 << NumBits;
else {
Added: llvm/trunk/test/CodeGen/ARM/fp16-frame-lowering.ll
URL: http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/ARM/fp16-frame-lowering.ll?rev=355201&view=auto
==============================================================================
--- llvm/trunk/test/CodeGen/ARM/fp16-frame-lowering.ll (added)
+++ llvm/trunk/test/CodeGen/ARM/fp16-frame-lowering.ll Fri Mar 1 06:20:28 2019
@@ -0,0 +1,22 @@
+; RUN: llc < %s -mtriple armv8a--none-eabi -mattr=+fullfp16 | FileCheck %s
+; RUN: llc < %s -mtriple armv8a--none-eabi -mattr=+fullfp16,+thumb-mode | FileCheck %s
+
+; Check that frame lowering for the fp16 instructions works correctly with
+; negative offsets (which happens when using the frame pointer).
+
+define void @foo(i32 %count) {
+entry:
+ %half_alloca = alloca half, align 2
+; CHECK: vstr.16 {{s[0-9]+}}, [{{r[0-9]+}}, #-10]
+ store half 0.0, half* %half_alloca
+ call void @bar(half* %half_alloca)
+
+ ; A variable-sized alloca to force the above store to use the frame pointer
+ ; instead of the stack pointer, and so need a negative offset.
+ %var_alloca = alloca i32, i32 %count
+ call void @baz(i32* %var_alloca)
+ ret void
+}
+
+declare void @bar(half*)
+declare void @baz(i32*)
More information about the llvm-commits
mailing list