[all-commits] [llvm/llvm-project] c2e8a4: [X86] Don't widen 128/256-bit strict compares with...
topperc via All-commits
all-commits at lists.llvm.org
Thu Feb 13 13:27:08 PST 2020
Branch: refs/heads/master
Home: https://github.com/llvm/llvm-project
Commit: c2e8a421ac52c5a17962a99db472be1d0bfdc296
https://github.com/llvm/llvm-project/commit/c2e8a421ac52c5a17962a99db472be1d0bfdc296
Author: Craig Topper <craig.topper at intel.com>
Date: 2020-02-13 (Thu, 13 Feb 2020)
Changed paths:
M llvm/lib/Target/X86/X86ISelLowering.cpp
M llvm/lib/Target/X86/X86InstrAVX512.td
M llvm/test/CodeGen/X86/vec-strict-cmp-128.ll
M llvm/test/CodeGen/X86/vec-strict-cmp-256.ll
Log Message:
-----------
[X86] Don't widen 128/256-bit strict compares with vXi1 result to 512-bits on KNL.
If we widen the compare we might trigger a spurious exception from
the garbage data.
We have two choices here. Explicitly force the upper bits to zero.
Or use a legacy VEX vcmpps/pd instruction and convert the XMM/YMM
result to mask register.
I've chosen to go with the second option. I'm not sure which is
really best. In some cases we could get rid of the zeroing since
the producing instruction probably already zeroed it. But we lose
the ability to fold a load. So which is best is dependent on
surrounding code.
Differential Revision: https://reviews.llvm.org/D74522
More information about the All-commits
mailing list