[PATCH] D16747: AMDGPU: Fix emitting invalid workitem intrinsics for HSA

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Fri Jan 29 19:03:47 PST 2016


arsenm created this revision.
arsenm added a reviewer: tstellarAMD.
arsenm added a subscriber: llvm-commits.
Herald added a subscriber: arsenm.

The AMDGPUPromoteAlloca pass was emitting the read.local.size
calls, which with HSA was incorrectly selected to reading from
the offset mesa uses off of the kernarg pointer.
    
Error on intrinsics which aren't supported by HSA, and start
emitting the correct IR to read the workgroup size
out of the dispatch pointer.
    
Also initialize the pass so it can be tested with opt, and
start moving towards not depending on the subtarget as an
argument.
    
Start emitting errors for the intrinsics not handled with HSA.

http://reviews.llvm.org/D16747

Files:
  lib/Target/AMDGPU/AMDGPU.h
  lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp
  lib/Target/AMDGPU/AMDGPUTargetMachine.cpp
  lib/Target/AMDGPU/SIISelLowering.cpp
  test/CodeGen/AMDGPU/private-memory-r600.ll
  test/CodeGen/AMDGPU/private-memory.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D16747.46462.patch
Type: text/x-patch
Size: 29371 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20160130/4f412ce2/attachment.bin>


More information about the llvm-commits mailing list