[PATCH] D41794: [X86] Improve AVX1 shuffle lowering for v8f32 shuffles where the low half comes from V1 and the high half comes from V2 and the halves do the same operation

Konstantin Belochapka via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Jan 10 18:54:55 PST 2018


kbelochapka added a comment.

Hi Craig, I placed simple test on the Bugzilla, your fix seems to be working fine.
The only thing, there are couple or more of shuffle masks which looks like they can be a subject of the similar optimization, but the generated code for them remained not optimized. 
// IN0:  |0|1| | |4|5| | |
// IN1:  |8|9| | | | |E|F|
MASK011:	0,1,4,5,8,9,14,15	
// IN0:  |0|1| | | | |6|7|
// IN1:  |8|9| | |C|D| | |
MASK012:	0,1,6,7,8,9,12,13


https://reviews.llvm.org/D41794





More information about the llvm-commits mailing list