[PATCH] D71726: Let clang atomic builtins fetch add/sub support floating point types
Matt Arsenault via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Fri May 15 13:37:29 PDT 2020
arsenm added a comment.
In D71726#1801346 <https://reviews.llvm.org/D71726#1801346>, @__simt__ wrote:
> In D71726#1792852 <https://reviews.llvm.org/D71726#1792852>, @yaxunl wrote:
>
> > In D71726#1791904 <https://reviews.llvm.org/D71726#1791904>, @jfb wrote:
> >
> > > This generally seems fine. Does it work on most backends? I want to make sure it doesn't fail in backends :)
> >
> >
> > For x86_64, amdgcn, aarch64, armv7, mips64, it is translated to cmpxchg by AtomicExpandPass and backends did codegen successfully.
> >
> > For hexagon, riscv32, it is translated to call of `__atomic_fetch_add_4` for fadd float. This is concerning. Probably we need to add `__atomic_fetch_{add|sub}_{f16|f32|f64}` ?
>
>
> For systems that have load-link/store-conditional architectures, like ARM / PPC / base RISC-V without extension, I would imagine that using a cmpxchg loop is much worse than simply doing the floating-point add/sub in the middle of the atomic mini-transaction. I'm sure that we want back-ends to be capable of implementing this better than what this pass is doing, even when they don't have "native" fp atomics.
>
> You listed amdgcn... what does this do on nvptx?
Targets can implement shouldExpandAtomicRMWInIR for the desired behavior, which NVPTX currently does not implement. Looking at AtomicExpandPass, it looks like either cmpxchg or LLSC expansions should work for the FP atomics already
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D71726/new/
https://reviews.llvm.org/D71726
More information about the cfe-commits
mailing list