[PATCH] D99507: [amdgpu] Add a pass to avoid jump into blocks with 0 exec mask.

Ruiling, Song via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Mar 29 18:16:08 PDT 2021


ruiling added a comment.

> For such blocks where the mask is restored from a reloaded mask, zero exec mask results in the undefined behavior as the SGPR reload uses v_readfirstlane

Can we mark the SGPRs holding the masks unspillable during LowerControlFlow to fix your problem? After we split SGPR/VGPR allocation, this problem would disappear.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D99507/new/

https://reviews.llvm.org/D99507



More information about the llvm-commits mailing list