[PATCH] D112268: [LegalizeTypes][RISCV][PowerPC] Expand CTLZ/CTTZ/CTPOP instead of promoting if they'll be expanded later.
Thomas Preud'homme via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Nov 17 15:27:20 PST 2021
thopre added a comment.
In D112268#3138471 <https://reviews.llvm.org/D112268#3138471>, @craig.topper wrote:
> In D112268#3137708 <https://reviews.llvm.org/D112268#3137708>, @thopre wrote:
>
>> Hi @craig.topper ,
>>
>> This caused a regression for us for i16 cttz with is_undef_zero true because cttz is expanded to a sequence using ctpop which loses the fact that any output is ok for the 0 case.
>>
>> t17: i16 = sub t9, Constant:i16<1>
>> t19: i16 = xor t9, Constant:i16<-1>
>> t20: i16 = and t19, t17
>> t21: i16 = ctpop t20
>>
>> So when this gets promoted, a mask is inserted to not have the result of ctpop change when `t9` is 0.
>>
>> t15: i32 = extract_vector_elt t2, Constant:i32<0>
>> t25: i32 = xor t15, Constant:i32<-1>
>> t23: i32 = sub t15, Constant:i32<1>
>> t26: i32 = and t25, t23
>> t28: i32 = and t26, Constant:i32<65535>
>> t29: i32 = ctpop t28
>>
>> Prior to this patch, we'd get:
>>
>> t19: i32 = sub t15, Constant:i32<1>
>> t21: i32 = xor t15, Constant:i32<-1>
>> t22: i32 = and t21, t19
>> t23: i32 = ctpop t22
>>
>> I thought about inserting some llvm.assume saying the value is non zero when expanding but those work at IR level. Any pointer on how to solve this? Best regards.
>
> I take it i32 ctpop is legal for your target or the i16 ctpop would have gotten expanded too?
Correct.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D112268/new/
https://reviews.llvm.org/D112268
More information about the llvm-commits
mailing list