[llvm-bugs] [Bug 31321] New: Halide would like to set the .maxnreg directive per PTX kernel.
via llvm-bugs
llvm-bugs at lists.llvm.org
Thu Dec 8 13:37:12 PST 2016
https://llvm.org/bugs/show_bug.cgi?id=31321
Bug ID: 31321
Summary: Halide would like to set the .maxnreg directive per
PTX kernel.
Product: libraries
Version: trunk
Hardware: Other
OS: All
Status: NEW
Severity: enhancement
Priority: P
Component: Backend: PTX
Assignee: unassignedbugs at nondot.org
Reporter: zalman at google.com
CC: llvm-bugs at lists.llvm.org
Classification: Unclassified
The PTX backend has the ability to generate certain per kernel (.entry) PTX
directives via metadata annotations. The test for this is in
test/CodeGen/NVPTX/annotations.ll . According to this NVIDIA PTX document:
http://docs.nvidia.com/cuda/parallel-thread-execution/#performance-tuning-directives
the .maxnreg value can be set on a per entry basis as well. Halide would like
to exploit this to be able to provide a scheduling directive to control this
value. (See:
https://github.com/halide/Halide/pull/1667
where there is a hack to set the maximum number of registers on a per module
basis at load time.)
Plumbing this through involves adding support to
NVPTXAsmPrinter::emitKernelFunctionDirectives .
--
You are receiving this mail because:
You are on the CC list for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-bugs/attachments/20161208/044ab247/attachment.html>
More information about the llvm-bugs
mailing list