[llvm-dev] PTX generation from CUDA file for compute capability 1.0 (sm_10)

Justin Lebar via llvm-dev llvm-dev at lists.llvm.org
Thu Jun 2 00:34:09 PDT 2016


Hi, Ginu.

No earlier version of llvm supports sm_10.  It's not something I have
looked at deeply, but I expect adding support would be nontrivial, because
one would have to teach the nvptx backend which machine instructions are
and are not available in that architecture.

Regards,
-Justin
On Jun 2, 2016 12:23 AM, "ginu jacob via llvm-dev" <llvm-dev at lists.llvm.org>
wrote:

> Hello Bergström/Eric,
>
> Thanks for the reply. The G80(sm_10) architecture was ported on FPGA by a
> group of researchers (
> http://www.ecs.umass.edu/ece/tessier/andryc-fpt13.pdf). Our group have
> some further research interest on this work. I was working on modifying the
> Clang-LLVM for a couple of months and achieved the required changes. But
> Clang-LLVM is only allowing me to generate PTX for sm_20, sm_30 etc.While
> trying to generate PTX for sm_10, it gave
>
> *error: unknown target CPU 'sm_10'*
> *fatal error: cannot open file '/tmp/shared-395893.s': No such file or
> directory1 error generated.*
>
>
> The compilation command used is:
> clang -Xclang -I$LIBCLC/include/generic -I$LIBCLC/include/ptx
> -Dcl_clang_storage_class_specifiers -O3 CudaSource.cu -S -o PtxOutput.ptx
> --cuda-gpu-arch=sm_10
>
> Is there any chance that this error being generated from CUDA runtime
> alone since I am using CUDA 7.5 which does not support sm_10. If there is
> any chance that the error is isolated from LLVM and is only due to CUDA, i
> have some hope to use a lower CUDA version. Please let me know your
> suggestions.
>
> Thank you,
> Ginu
>
>
> On Thu, Jun 2, 2016 at 2:36 PM, C Bergström <cbergstrom at pathscale.com>
> wrote:
>
>> What happens if you hack change llvm to accept sm_10? Do you get an
>> error somewhere further down the pipeline?
>>
>> sm_10 is pretty old hardware - Why the strong dependency on this?
>>
>> On Thu, Jun 2, 2016 at 1:18 PM, ginu jacob via llvm-dev
>> <llvm-dev at lists.llvm.org> wrote:
>> > Hello,
>> >
>> > When generating the PTX output from CUDA file(.cu file), the minimum
>> target
>> > that is accepted by LLVM is sm_20. But I have a specific requirement to
>> > generate PTX output for compute capability 1.0 (sm_10). Is there any
>> > previous version of LLVM supporting this?
>> >
>> > Thank you,
>> > Ginu
>> >
>> > _______________________________________________
>> > LLVM Developers mailing list
>> > llvm-dev at lists.llvm.org
>> > http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>> >
>>
>
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20160602/38660292/attachment.html>


More information about the llvm-dev mailing list