[llvm-bugs] [Bug 43505] New: PTX code generation with cuda10: shfl without .sync is not supported
via llvm-bugs
llvm-bugs at lists.llvm.org
Mon Sep 30 06:13:09 PDT 2019
https://bugs.llvm.org/show_bug.cgi?id=43505
Bug ID: 43505
Summary: PTX code generation with cuda10: shfl without .sync is
not supported
Product: clang
Version: 9.0
Hardware: Other
OS: Linux
Status: NEW
Severity: normal
Priority: P
Component: CUDA
Assignee: unassignedclangbugs at nondot.org
Reporter: smithc11 at rpi.edu
CC: llvm-bugs at lists.llvm.org
Created attachment 22605
--> https://bugs.llvm.org/attachment.cgi?id=22605&action=edit
tarball with source code, compilation instructions, and temporary files
Overview:
Using Clang 9.0.0 for PTX code generation from Thrust source code results in
the following error with Cuda 10.1.243:
ptxas /var/tmp/testcase-min-af0932.s, line 4521; error : Instruction 'shfl'
without '.sync' is not supported on .target sm_70 and higher from PTX ISA
version 6.4
Steps to Reproduce:
Compile the 'attached testcase-min.cu' with the following command:
$ clang++ -O0 --cuda-gpu-arch=sm_70 testcase-min.cu
Actual Results:
The following error is output during the build:
ptxas /var/tmp/testcase-min-af0932.s, line 4521; error : Instruction 'shfl'
without '.sync' is not supported on .target sm_70 and higher from PTX ISA
version 6.4
...
ptxas fatal : Ptx assembly aborted due to errors
clang-9: error: ptxas command failed with exit code 255 (use -v to see
invocation)
Expected Results:
A working binary.
Build Date & Hardware:
System: LLNL Lassen system; 2x IBM Power9 host processors + 4x Nvidia V100
per node
$ clang++ --version
clang version 9.0.0 (/builddir/build/BUILD/ibm-llvm/tools/clang
63a7d47678dad8b206a08bdfa9380ebdb147e888) (/builddir/build/BUILD/ibm-llvm
d99a7ea8cd2b634d0dcb13c44d06c4bdd4436c4e)
Target: powerpc64le-unknown-linux-gnu
Thread model: posix
InstalledDir: /usr/tce/packages/clang/clang-upstream-2019.08.15/release/bin
$ nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2019 NVIDIA Corporation
Built on Sun_Jul_28_19:07:52_PDT_2019
Cuda compilation tools, release 10.1, V10.1.243
$ echo $CUDA_HOME
/usr/tce/packages/cuda/cuda-10.1.243
Additional Builds and Platforms:
I did not compile the problematic code with Clang on another Volta equipped
system.
Additional Information:
Temporary files generated with '-save-temps' are included in the attached
tarball.
--
You are receiving this mail because:
You are on the CC list for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-bugs/attachments/20190930/6e6daf16/attachment.html>
More information about the llvm-bugs
mailing list