[all-commits] [llvm/llvm-project] 0cbcf1: [llvm][nvptx] Add sm_90a
Guray Ozen via All-commits
all-commits at lists.llvm.org
Mon Jul 24 06:12:09 PDT 2023
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 0cbcf17cb8be4a2c6f1b55e36cba31070c674045
https://github.com/llvm/llvm-project/commit/0cbcf17cb8be4a2c6f1b55e36cba31070c674045
Author: Guray Ozen <guray.ozen at gmail.com>
Date: 2023-07-24 (Mon, 24 Jul 2023)
Changed paths:
M llvm/lib/Target/NVPTX/NVPTX.td
Log Message:
-----------
[llvm][nvptx] Add sm_90a
This works adds `sm_90a` as nvptx target. `sm_90a` is required to generate wgmma and setmaxnreg instructions.
Here is information about "a" prefix in PTX document:
Target architectures with suffix “a”, such as sm_90a, include architecture-accelerated features that are supported on the specified architecture only, hence such targets do not follow the onion layer model. Therefore, PTX code generated for such targets cannot be run on later generation devices. Architecture-accelerated features can only be used with targets that support these features.
Reviewed By: tra
Differential Revision: https://reviews.llvm.org/D155851
More information about the All-commits
mailing list