[PATCH] D41794: [X86] Improve AVX1 shuffle lowering for v8f32 shuffles where the low half comes from V1 and the high half comes from V2 and the halves do the same operation
Konstantin Belochapka via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Jan 10 18:54:55 PST 2018
kbelochapka added a comment.
Hi Craig, I placed simple test on the Bugzilla, your fix seems to be working fine.
The only thing, there are couple or more of shuffle masks which looks like they can be a subject of the similar optimization, but the generated code for them remained not optimized.
// IN0: |0|1| | |4|5| | |
// IN1: |8|9| | | | |E|F|
MASK011: 0,1,4,5,8,9,14,15
// IN0: |0|1| | | | |6|7|
// IN1: |8|9| | |C|D| | |
MASK012: 0,1,6,7,8,9,12,13
https://reviews.llvm.org/D41794
More information about the llvm-commits
mailing list