[PATCH] D48712: [X86] Lowering integer truncation intrinsics to native IR
Simon Pilgrim via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Fri Jun 29 10:31:13 PDT 2018
RKSimon added a comment.
Please can you create a llvm side parallel patch that updates the relevant fast-isel tests
================
Comment at: clang/lib/Headers/avx512vlbwintrin.h:1501
+ (__v8qi){0, 0, 0, 0, 0, 0, 0, 0}, 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,
+ 12, 13, 14, 15);
}
----------------
mike.dvoretsky wrote:
> RKSimon wrote:
> > Are we happy with using illegal types like this? What about flipping the shuffle and convert?
> >
> > ```
> > return (__m128i)__builtin_convertvector(
> > __builtin_shufflevector((__v8hi)__A,
> > (__v8hi){0, 0, 0, 0, 0, 0, 0, 0},
> > 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15), __v16qi);
> > ```
> This would bring its own issues, since in the cvtepi64_epi8 cases the inner shuffle would produce vectors of 16 64-bit values. There would be no extra typedef, but in the back-end these would be split in type legalization, making it harder to fold them into VPMOV instructions.
Yeah, neither solution is particularly clean. Please keep it as is.
https://reviews.llvm.org/D48712
More information about the cfe-commits
mailing list