[llvm] [AMDGPU] Correctly insert s_nops for implicit read of SDWA (PR #100276)
Jeffrey Byrnes via llvm-commits
llvm-commits at lists.llvm.org
Thu Jul 25 14:21:18 PDT 2024
jrbyrnes wrote:
Adjust condition for "consumer" of forwarded dest. A VOP3A instruction can have partial write to the same dest using op_sel -- in this case, there will be implicit read on initial dest for parity check for ECC. The correctness issue is not isolated then to selecting the high bits via op_sel (as was previously implemented). This only affects ECC and not the actual bits in the dest reg.
https://github.com/llvm/llvm-project/pull/100276
More information about the llvm-commits
mailing list