[clang] [llvm] [CUDA] Add a pseudo GPU sm_next which allows overriding for SM/PTX version. (PR #100247)
Artem Belevich via llvm-commits
llvm-commits at lists.llvm.org
Wed Jul 24 10:48:17 PDT 2024
================
@@ -26,24 +27,38 @@ static cl::opt<bool>
NoF16Math("nvptx-no-f16-math", cl::Hidden,
cl::desc("NVPTX Specific: Disable generation of f16 math ops."),
cl::init(false));
+static cl::opt<unsigned>
+ NextSM("nvptx-next-sm", cl::Hidden,
+ cl::desc("NVPTX Specific: Override SM ID for sm_next."),
+ cl::init(90));
----------------
Artem-B wrote:
Supported values live in [llvm/lib/Target/NVPTX/NVPTX.td, ](https://github.com/llvm/llvm-project/blob/cdc193459d90fad83d2eafaccfe03368a9a8a160/llvm/lib/Target/NVPTX/NVPTX.td#L4)
I think setting it to zero, and erroring out if the exp[licit value is not specified when we're targeting `sm_next` would probably be the right thing to do here. Will fix.
https://github.com/llvm/llvm-project/pull/100247
More information about the llvm-commits
mailing list