[PATCH] D37985: [AMDGPU] add LDS f32 intrinsics

Wed Dec 20 09:19:29 PST 2017

arsenm added inline comments.

================
Comment at: include/llvm/IR/IntrinsicsAMDGPU.td:306
+
+def int_amdgcn_atomic_add : AMDGPUAtomicF32Intrin;
+def int_amdgcn_atomic_min : AMDGPUAtomicF32Intrin;
----------------
These should probably be named fadd.. etc to match the IR operations

================
Comment at: lib/Target/AMDGPU/SIISelLowering.cpp:574-576
     Info.vol = !Vol || !Vol->isZero();
     Info.readMem = true;
     Info.writeMem = true;
----------------
This will need to be rebased since I just changed these last week

================
Comment at: lib/Target/AMDGPU/SIISelLowering.cpp:6549-6552
+  case AMDGPUISD::ATOMIC_DEC:
+  case AMDGPUISD::ATOMIC_LOAD_FADD:
+  case AMDGPUISD::ATOMIC_LOAD_FMIN:
+  case AMDGPUISD::ATOMIC_LOAD_FMAX:  // TODO: Target mem intrinsics.
----------------
Tests for these with this combine would be nice

================
Comment at: test/CodeGen/AMDGPU/lds_atomic_f32.ll:1
+; RUN: llc -march=amdgcn -mcpu=gfx900 -verify-machineinstrs < %s | FileCheck -enable-var-scope %s
+declare float @llvm.amdgcn.atomic.add.f32(float addrspace(3)* nocapture, float, i32, i32, i1)
----------------
Can you also add a pre-gfx9 run line, and check for the m0 initialization

https://reviews.llvm.org/D37985