[PATCH] D151153: [AMDGPU] Add attribute to AMDGPU ctor / dtor to indicate single threadedness
Joseph Huber via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed May 24 08:12:45 PDT 2023
jhuber6 added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUCtorDtorLowering.cpp:41
InitOrFiniKernel->setCallingConv(CallingConv::AMDGPU_KERNEL);
+ InitOrFiniKernel->addFnAttr("amdgpu-flat-work-group-size", "1,1");
if (IsCtor)
----------------
arsenm wrote:
> We should probably teach UniformityAnalysis about this special case for the work item IDs
This special case where the number of threads is one? It would definitely simplify anything to do with convergence.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D151153/new/
https://reviews.llvm.org/D151153
More information about the llvm-commits
mailing list