[PATCH] D106053: [CodeGen] Remove pending AssertZext AssertSext in promoting FP_TO_INT

Xiang Zhang via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Jul 15 19:24:45 PDT 2021


xiangzhangllvm added a comment.

> If the loaded value is not in the range [0.0, 1.0] so that %i375 is in the range [0.0, 255.0], then I don't know what value the fptoui is supposed to produce.

Or we can see in this way.
In the edge calculation, we just need to load 4 elements of v8f32, but for performance reason, we usually load full (8) elements. So some of its loaded element has no meaning (may over flow).
So the shuffle don't select these element, but the shuffle may zero there element then for other use. (very just like this small reproduce case )


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D106053/new/

https://reviews.llvm.org/D106053



More information about the llvm-commits mailing list