[llvm] [NVPTX] Add support for atomic add for f16 type (PR #84295)

Adrian Kuegel via llvm-commits llvm-commits at lists.llvm.org
Wed Mar 13 08:07:14 PDT 2024


akuegel wrote:

By now I have created a reproducer for what caused the revert. If I adjust the test slightly, and change the line with %r1:

```
%r1 = atomicrmw fadd ptr %dp0, half 1.0 seq_cst, align 2
```
The codegen for this becomes:

```
ld.param.u64 %rd1, [test_param_0]; 
atom.add.noftz.f16 %rs1, [%rd1], 0x3C00;
```

And this fails verification with this error:

```
Arguments mismatch for instruction 'atom'
```

I lack the knowledge about PTX to know what is wrong with that. Will try to figure it out.

https://github.com/llvm/llvm-project/pull/84295


More information about the llvm-commits mailing list