[PATCH] D15608: AMDGPU/SI: Fix implemenation of isSourceOfDivergence() for graphics shaders
Tom Stellard via llvm-commits
llvm-commits at lists.llvm.org
Thu Dec 17 07:02:48 PST 2015
tstellarAMD created this revision.
tstellarAMD added a reviewer: arsenm.
tstellarAMD added a subscriber: llvm-commits.
Herald added a subscriber: arsenm.
The analysis of shader inputs was completely wrong. We were passing the
wrong index to AttributeSet::hasAttribute() and the logic for which
inputs where in SGPRs was wrong too.
http://reviews.llvm.org/D15608
Files:
lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
test/Analysis/DivergenceAnalysis/AMDGPU/kernel-args.ll
test/Analysis/DivergenceAnalysis/AMDGPU/lit.local.cfg
Index: test/Analysis/DivergenceAnalysis/AMDGPU/lit.local.cfg
===================================================================
--- /dev/null
+++ test/Analysis/DivergenceAnalysis/AMDGPU/lit.local.cfg
@@ -0,0 +1,2 @@
+if not 'NVPTX' in config.root.targets:
+ config.unsupported = True
Index: test/Analysis/DivergenceAnalysis/AMDGPU/kernel-args.ll
===================================================================
--- /dev/null
+++ test/Analysis/DivergenceAnalysis/AMDGPU/kernel-args.ll
@@ -0,0 +1,18 @@
+; RUN: opt %s -analyze -divergence | FileCheck %s
+
+target triple = "amdgcn--"
+
+; CHECK: DIVERGENT:
+; CHECK-NOT: %arg0
+; CHECK-NOT: %arg1
+; CHECK-NOT; %arg2
+; CHECK: <2 x i32> %arg3
+; CHECK: DIVERGENT: <3 x i32> %arg4
+; CHECK: DIVERGENT: float %arg5
+; CHECK: DIVERGENT: i32 %arg6
+
+define void @main([4 x <16 x i8>] addrspace(2)* byval %arg0, float inreg %arg1, i32 inreg %arg2, <2 x i32> %arg3, <3 x i32> %arg4, float %arg5, i32 %arg6) #0 {
+ ret void
+}
+
+attributes #0 = { "ShaderType"="0" }
Index: lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
===================================================================
--- lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
+++ lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
@@ -130,14 +130,13 @@
if (ShaderType == ShaderType::COMPUTE)
return true;
- // For non-compute shaders, the inreg attribute is used to mark inputs,
- // which pre-loaded into SGPRs.
- if (F->getAttributes().hasAttribute(A->getArgNo(), Attribute::InReg))
+ // For non-compute shaders, SGPR inputs are marekd with either inreg or byval.
+ if (F->getAttributes().hasAttribute(A->getArgNo() + 1, Attribute::InReg) ||
+ F->getAttributes().hasAttribute(A->getArgNo() + 1, Attribute::ByVal))
return true;
- // For non-compute shaders, 32-bit values are pre-loaded into vgprs, all
- // other value types use SGPRS.
- return !A->getType()->isIntegerTy(32) && !A->getType()->isFloatTy();
+ // Everything else is in VGPRs.
+ return false;
}
///
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D15608.43134.patch
Type: text/x-patch
Size: 2017 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20151217/76639697/attachment.bin>
More information about the llvm-commits
mailing list