[clang] [CUDA] Add support for sm101 and sm120 target architectures (PR #127187)

Artem Belevich via cfe-commits cfe-commits at lists.llvm.org
Tue Feb 18 11:26:04 PST 2025


================
@@ -300,6 +306,10 @@ void NVPTXTargetInfo::getTargetDefines(const LangOptions &Opts,
       Builder.defineMacro("__CUDA_ARCH_FEAT_SM90_ALL", "1");
     if (GPU == OffloadArch::SM_100a)
       Builder.defineMacro("__CUDA_ARCH_FEAT_SM100_ALL", "1");
+    if (GPU == OffloadArch::SM_101a)
+      Builder.defineMacro("__CUDA_ARCH_FEAT_SM101_ALL", "1");
+    if (GPU == OffloadArch::SM_120a)
+      Builder.defineMacro("__CUDA_ARCH_FEAT_SM120_ALL", "1");
----------------
Artem-B wrote:

Right. We need to trim the last digit. A bit shorter way to do it would be `StringRef(CUDAArchCode).drop_back()` 
Actually, we can change return type of the lambda to `StringRef` which will simplify it to just `CUDAArchCode.drop_back()`



https://github.com/llvm/llvm-project/pull/127187


More information about the cfe-commits mailing list