[PATCH] D42743: [ARM] FullFP16 LowerReturn Fix

Sjoerd Meijer via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Jan 31 09:17:59 PST 2018


SjoerdMeijer created this revision.
SjoerdMeijer added a reviewer: olista01.
Herald added subscribers: kristof.beyls, javed.absar, aemerson.

Commit r323512 introduced an optimisation in LowerReturn for half-precision
return values. A missing check caused a crash when the return value is "undef"
(i.e. a node that has no operands)


https://reviews.llvm.org/D42743

Files:
  lib/Target/ARM/ARMISelLowering.cpp
  test/CodeGen/ARM/fp16-instructions.ll


Index: test/CodeGen/ARM/fp16-instructions.ll
===================================================================
--- test/CodeGen/ARM/fp16-instructions.ll
+++ test/CodeGen/ARM/fp16-instructions.ll
@@ -11,6 +11,18 @@
 ; RUN: llc < %s -mtriple=arm-none-eabihf -mattr=+vfp4      | FileCheck %s --check-prefix=CHECK-HARDFP-FP16
 ; RUN: llc < %s -mtriple=arm-none-eabihf -mattr=+fullfp16  | FileCheck %s --check-prefix=CHECK-HARDFP-FULLFP16
 
+
+define float @RetValBug(float %A.coerce) local_unnamed_addr {
+entry:
+  ret float bitcast (i32 zext (i16 bitcast (half fsub (half undef, half 0xH2FE6) to i16) to i32) to float)
+; This expression is optimised away due to the undef value. Check that
+; LowerReturn can handle undef nodes (i.e. nodes which do not have any
+; operands) when FullFP16 is enabled.
+;
+; CHECK-HARDFP-FULLFP16:  mov pc, lr
+}
+
+
 define float @Add(float %a.coerce, float %b.coerce) local_unnamed_addr {
 entry:
   %0 = bitcast float %a.coerce to i32
Index: lib/Target/ARM/ARMISelLowering.cpp
===================================================================
--- lib/Target/ARM/ARMISelLowering.cpp
+++ lib/Target/ARM/ARMISelLowering.cpp
@@ -2491,12 +2491,12 @@
       // t11 f16 = fadd ...
       // t12: i16 = bitcast t11
       //   t13: i32 = zero_extend t12
-      // t14: f32 = bitcast t13
+      // t14: f32 = bitcast t13  <~~~~~~~ Arg
       //
       // to avoid code generation for bitcasts, we simply set Arg to the node
       // that produces the f16 value, t11 in this case.
       //
-      if (Arg.getValueType() == MVT::f32) {
+      if (Arg.getValueType() == MVT::f32 && Arg.getOpcode() == ISD::BITCAST) {
         SDValue ZE = Arg.getOperand(0);
         if (ZE.getOpcode() == ISD::ZERO_EXTEND && ZE.getValueType() == MVT::i32) {
           SDValue BC = ZE.getOperand(0);


-------------- next part --------------
A non-text attachment was scrubbed...
Name: D42743.132196.patch
Type: text/x-patch
Size: 1812 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20180131/991dcd2e/attachment.bin>


More information about the llvm-commits mailing list