[clang] [llvm] [CUDA] Add support for CUDA-12.3 and sm_90a (PR #74895)

Joseph Huber via cfe-commits cfe-commits at lists.llvm.org
Fri Dec 8 15:32:31 PST 2023


================
@@ -80,8 +85,10 @@ class NVPTXSubtarget : public NVPTXGenSubtargetInfo {
   bool allowFP16Math() const;
   bool hasMaskOperator() const { return PTXVersion >= 71; }
   bool hasNoReturn() const { return SmVersion >= 30 && PTXVersion >= 64; }
-  unsigned int getSmVersion() const { return SmVersion; }
+  unsigned int getSmVersion() const { return FullSmVersion / 10; }
+  unsigned int getFullSmVersion() const { return FullSmVersion; }
   std::string getTargetName() const { return TargetName; }
+  bool isSm90a() const { return getFullSmVersion() == 901; }
----------------
jhuber6 wrote:

Could we expose this more like `getSmVersion` and `getSmFeature`? Has CUDA even documented how they intend to further build on this?

https://github.com/llvm/llvm-project/pull/74895


More information about the cfe-commits mailing list