[PATCH] D145634: [X86] Support llvm.{min,max}imum.f{16,32,64}
Serguei Katkov via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Apr 24 20:57:52 PDT 2023
skatkov added a comment.
In D145634#4294264 <https://reviews.llvm.org/D145634#4294264>, @e-kud wrote:
> In D145634#4291019 <https://reviews.llvm.org/D145634#4291019>, @skatkov wrote:
>
>> I have a side question (not delaying landing this one)
>>
>> It looks like this change lowers vectorized form of intrinsic in a non-optimal way.
>> So I wonder whether there are some plans to improve it as follow-up?
>
> Yes, for sure. I've tried to find a way to implement vectorized version for SSE, but nothing's come to my mind. We need at least SSE2 for `PCMPEQ{W,D}` because all fp comparison instructions treat `-0.0` and `0.0` as equal. It seems that pentium3 will suffer from not optimal fmaximum/fminimum...
Why we cannot do something like this https://godbolt.org/z/Yxfn3jTj1 ?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D145634/new/
https://reviews.llvm.org/D145634
More information about the llvm-commits
mailing list