[PATCH] D156301: [WIP] Support FAdd/FSub global atomics in AMDGPUAtomicOptimizer.

Pravin Jagtap via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Jul 31 02:32:48 PDT 2023


pravinjagtap added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp:329
+    // TODO: Support for double type
+    if (!isScanStrategyIterative() || I.getType()->isDoubleTy()) {
+      return;
----------------
arsenm wrote:
> Doesn't consider half
> 
> Should also handle <2 x half>, but atomicrmw doesn't support vectors now (you need the intrinsics for those)
> Doesn't consider half

Appears that `_Float16` is not supported for atomics in HIP: https://cuda.godbolt.org/z/Gf7so4Y9K


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D156301/new/

https://reviews.llvm.org/D156301



More information about the llvm-commits mailing list