[PATCH] D93229: [VectorCombine] allow peeking through GEPs when creating a vector load

Sanjay Patel via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Dec 14 11:08:13 PST 2020


spatel added a comment.

In D93229#2452695 <https://reviews.llvm.org/D93229#2452695>, @lebedev.ri wrote:

> I'm having trouble coming up with an example because there appears to be a preexisting soundness problems, example: (CC @nlopes @aqjune)
>
>   define <8 x i16> @t(i8* align 128 dereferenceable(128) %base) {
>     %ptr = getelementptr inbounds i8, i8* %base, i64 1
>     %p = bitcast i8* %ptr to <8 x i16>*
>   
>     %gep = getelementptr inbounds <8 x i16>, <8 x i16>* %p, i64 0, i64 1
>     %s = load i16, i16* %gep, align 1
>     %r = insertelement <8 x i16> undef, i16 %s, i64 0
>     ret <8 x i16> %r
>   }
>
>   /builddirs/llvm-project/build-Clang11-unknown$ /builddirs/llvm-project/build-Clang11-unknown/bin/opt -load /repositories/alive2/build-Clang-release/tv/tv.so -tv -vector-combine -mtriple=x86_64-- -mattr=avx2 -tv -o /dev/null --tv-smt-to=60000 /tmp/D93229.ll 
>   
>   ----------------------------------------
>   define <8 x i16> @t(* dereferenceable(128) align(128) %base) {
>   %0:
>     %ptr = gep inbounds * dereferenceable(128) align(128) %base, 1 x i64 1
>     %p = bitcast * %ptr to *
>     %gep = gep inbounds * %p, 16 x i64 0, 2 x i64 1
>     %s = load i16, * %gep, align 1
>     %r = insertelement <8 x i16> undef, i16 %s, i64 0
>     ret <8 x i16> %r
>   }
>   =>
>   define <8 x i16> @t(* dereferenceable(128) align(128) %base) {
>   %0:
>     %ptr = gep inbounds * dereferenceable(128) align(128) %base, 1 x i64 1
>     %p = bitcast * %ptr to *
>     %gep = gep inbounds * %p, 16 x i64 0, 2 x i64 1
>     %1 = bitcast * %gep to *
>     %r = load <8 x i16>, * %1, align 1
>     ret <8 x i16> %r
>   }
>   Transformation doesn't verify!
>   ERROR: Target is more poisonous than source
>   
>   Example:
>   * dereferenceable(128) align(128) %base = pointer(non-local, block_id=1, offset=1664)
>   
>   Source:
>   * %ptr = pointer(non-local, block_id=1, offset=1665)
>   * %p = pointer(non-local, block_id=1, offset=1665)
>   * %gep = pointer(non-local, block_id=1, offset=1667)
>   i16 %s = poison
>   <8 x i16> %r = < poison, any, any, any, any, any, any, any >
>   
>   SOURCE MEMORY STATE
>   ===================
>   NON-LOCAL BLOCKS:
>   Block 0 >       size: 0 align: 1        alloc type: 0
>   Block 1 >       size: 2048      align: 128      alloc type: 0
>   
>   Target:
>   * %ptr = pointer(non-local, block_id=1, offset=1665)
>   * %p = pointer(non-local, block_id=1, offset=1665)
>   * %gep = pointer(non-local, block_id=1, offset=1667)
>   * %1 = pointer(non-local, block_id=1, offset=1667)
>   <8 x i16> %r = < poison, poison, poison, poison, poison, poison, poison, poison >
>   Source value: < poison, any, any, any, any, any, any, any >
>   Target value: < poison, poison, poison, poison, poison, poison, poison, poison >
>   
>   Alive2: Transform doesn't verify!

IIUC, this is a question of allowing poison (from the unused loaded memory elements) to propagate?
So we have to freeze or explicitly make those elements undef again?
https://alive2.llvm.org/ce/z/LKqBVW


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D93229/new/

https://reviews.llvm.org/D93229



More information about the llvm-commits mailing list