[PATCH] D11477: fix invalid load folding with SSE/AVX FP logical instructions (PR22371)

Sanjay Patel spatel at rotateright.com
Mon Jul 27 17:37:45 PDT 2015


spatel added inline comments.

================
Comment at: test/CodeGen/X86/pr2656.ll:9-10
@@ -11,1 +8,4 @@
 
+; We can not fold either stack load into an 'xor' intruction because that
+; would change what should be a 4-byte load into a 16-byte load.
+
----------------
chandlerc wrote:
> I'd specifically call out that we *can* do a 16-byte constant pool load for the xorps mask used to negate these values, it just isn't folded because it is used twice. Otherwise it's a bit confusing to read the comment followed by this particular example.
Yes, that is confusing on 2nd look. I'll fix that and get this checked in.

Thanks for the prompt review!


http://reviews.llvm.org/D11477







More information about the llvm-commits mailing list