[Openmp-commits] [clang] [libcxx] [compiler-rt] [libc] [flang] [llvm] [clang-tools-extra] [libcxxabi] [openmp] [mlir] [AArch64] Add custom lowering for load <3 x i8>. (PR #78632)

Florian Hahn via Openmp-commits openmp-commits at lists.llvm.org
Thu Jan 25 09:09:22 PST 2024


fhahn wrote:

> > Thanks, this is indeed more compact. I tried to massage the SelectionDAG nodes to generate it ([7cc78c5](https://github.com/llvm/llvm-project/commit/7cc78c52f481161d7195ac4c7f9ec05b1cd1f442)) but it appears there are some cases where this results in slightly more code. I can check where those differences are coming from.
> 
> It looks like the INSERT_VECTOR_ELT is getting "optimized" into a BUILD_VECTOR, or something like that, instead of doing a shuffle like it does with your original sequence.


@efriedma-quic ok I managed to track down where the issue is. The only workaround I could come up with is extending `ReconstructShuffle`  to support the case where one element is a load via `shuffleWithSingleLoad`. WDYT?

https://github.com/llvm/llvm-project/pull/78632


More information about the Openmp-commits mailing list