[PATCH] D131560: AMDGPU: Improve atomicrmw fadd selection
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Sep 21 15:56:15 PDT 2022
rampitec added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/SIISelLowering.cpp:12786
- if ((AS == AMDGPUAS::GLOBAL_ADDRESS || AS == AMDGPUAS::FLAT_ADDRESS) &&
- Subtarget->hasAtomicFaddNoRtnInsts()) {
- if (Subtarget->hasGFX940Insts())
- return AtomicExpansionKind::None;
-
+ if (AS == AMDGPUAS::GLOBAL_ADDRESS || AS == AMDGPUAS::FLAT_ADDRESS) {
// The amdgpu-unsafe-fp-atomics attribute enables generation of unsafe
----------------
The logic below seems correct to me, but why dropping `hasAtomicFaddNoRtnInsts` check? It is a fast way to skip the whole block for the targets which do not have it at all.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D131560/new/
https://reviews.llvm.org/D131560
More information about the llvm-commits
mailing list