[PATCH] D135269: [AMDGPU] Disable bool range metadata to workaround backend issue
Yaxun Liu via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Wed Oct 5 15:40:02 PDT 2022
yaxunl added a comment.
In D135269#3837394 <https://reviews.llvm.org/D135269#3837394>, @tra wrote:
> Is there more info about the issue? What does AMDGPU currently emit for the test case?
>
> AFAICT from running it on CE (https://godbolt.org/z/ccq3vnbrM) llvm optimizes it to essentially `*y = *x` and generates a 1-byte load+store for both NVPTX and AMDGPU.
The issue happens to more complicated test cases which I cannot reduce right now.
Basically 8018d6be3459780e81a5da128a9915eb27909902 <https://reviews.llvm.org/rG8018d6be3459780e81a5da128a9915eb27909902> caused regressions in some PyTorch tests. Investigation shows the propagation of range metadata for bool type triggered some optimizations which caused some bool values to be loaded incorrectly. I will continue investigating the issue. However, I need a workaround for now.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D135269/new/
https://reviews.llvm.org/D135269
More information about the cfe-commits
mailing list