[PATCH] D96517: [AMDGPU] Optimize SGPR to scratch spilling
Sebastian Neubauer via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Feb 22 07:59:24 PST 2021
sebastian-ne added a comment.
In D96517#2578884 <https://reviews.llvm.org/D96517#2578884>, @hliao wrote:
> why exec mask = 0 case is a valid one, won't we already branch away once exec mask goes to zero?
That is the question it comes down to. If it is guaranteed that exec is never 0, i.e. at least one bit is always set, I’m in favor of your patch.
To have some numbers, I saw some functions spilling 30 SGPRs to scratch, so it can be more than just a one or two.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D96517/new/
https://reviews.llvm.org/D96517
More information about the llvm-commits
mailing list