[PATCH] D99507: [amdgpu] Add a pass to avoid jump into blocks with 0 exec mask.
Ruiling, Song via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Mar 29 18:16:08 PDT 2021
ruiling added a comment.
> For such blocks where the mask is restored from a reloaded mask, zero exec mask results in the undefined behavior as the SGPR reload uses v_readfirstlane
Can we mark the SGPRs holding the masks unspillable during LowerControlFlow to fix your problem? After we split SGPR/VGPR allocation, this problem would disappear.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D99507/new/
https://reviews.llvm.org/D99507
More information about the llvm-commits
mailing list