[PATCH] D135269: [AMDGPU] Disable bool range metadata to workaround backend issue

Yaxun Liu via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Wed Oct 5 15:40:02 PDT 2022


yaxunl added a comment.

In D135269#3837394 <https://reviews.llvm.org/D135269#3837394>, @tra wrote:

> Is there more info about the issue? What does AMDGPU currently emit for the test case?
>
> AFAICT from running it on CE (https://godbolt.org/z/ccq3vnbrM) llvm optimizes it to essentially `*y = *x` and generates a 1-byte load+store for both NVPTX and AMDGPU.

The issue happens to more complicated test cases which I cannot reduce right now.

Basically 8018d6be3459780e81a5da128a9915eb27909902 <https://reviews.llvm.org/rG8018d6be3459780e81a5da128a9915eb27909902> caused regressions in some PyTorch tests. Investigation shows the propagation of range metadata for bool type triggered some optimizations which caused some bool values to be loaded incorrectly. I will continue investigating the issue. However, I need a workaround for now.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D135269/new/

https://reviews.llvm.org/D135269



More information about the cfe-commits mailing list