[PATCH] D149281: Must unroll epilogue loops after vectorization on AMDGPU target
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Apr 26 14:55:40 PDT 2023
rampitec added a comment.
In D149281#4299934 <https://reviews.llvm.org/D149281#4299934>, @fhahn wrote:
> In D149281#4299890 <https://reviews.llvm.org/D149281#4299890>, @rampitec wrote:
>
>> Add a test?
>
> That would be helpful. It would be good to understand why runtime unrolling is needed here. Does the interleave heuristic not kick in?
I actually meant a simpler formal test to check that unroll works on AMDGPU. I am not sure @alex-t will be able to produce a reasonable test showing performance problem. The loop in question triggered the regression is quite complex to begin with.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D149281/new/
https://reviews.llvm.org/D149281
More information about the llvm-commits
mailing list