[PATCH] D145299: [X86] Generate better code for std::bit_ceil

Sanjay Patel via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sun Mar 5 11:26:55 PST 2023


spatel added a comment.

In D145299#4169799 <https://reviews.llvm.org/D145299#4169799>, @goldstein.w.n wrote:

> In D145299#4169706 <https://reviews.llvm.org/D145299#4169706>, @spatel wrote:
>
>> In D145299#4169702 <https://reviews.llvm.org/D145299#4169702>, @RKSimon wrote:
>>
>>> What is preventing is from performing this in InstCombine? I don't think this pattern will emerge in SelectionDAG
>>
>> I haven't found a way to avoid a poison shift in IR without doing a cmp+select or umax yet. I think we're relying on the x86-specific behavior of masking the shift amount to make that part of the logic disappear in this patch.
>
> The IR is:
>
>   %2 = add i32 %0, -1
>   %3 = tail call i32 @llvm.ctlz.i32(i32 %2, i1 false), !range !5
>   %4 = sub nuw nsw i32 32, %3
>   %5 = shl nuw i32 1, %4
>   %6 = icmp ugt i32 %0, 1
>   %7 = select i1 %6, i32 %5, i32 1
>   ret i32 %7
>
> The poison shift is if `%3` is zero?

Yes - if we shift by the bitwidth, that's poison in IR.

IIUC in this example, we don't have to care about any input "ugt 0x8000_0000" ( https://en.cppreference.com/w/cpp/numeric/bit_ceil ), so we'd need the front-end to provide that info somehow. But 0x8000_0000 is still a valid input, so does that knowledge actually help?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D145299/new/

https://reviews.llvm.org/D145299



More information about the llvm-commits mailing list