[all-commits] [llvm/llvm-project] a5925e: [MLIR][NVVM] Fix register mapping in `wgmma.mma_as...
Guray Ozen via All-commits
all-commits at lists.llvm.org
Mon Aug 14 05:09:04 PDT 2023
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: a5925eee5a8cbdf0ee8d77a744c245405987e3b6
https://github.com/llvm/llvm-project/commit/a5925eee5a8cbdf0ee8d77a744c245405987e3b6
Author: Guray Ozen <guray.ozen at gmail.com>
Date: 2023-08-14 (Mon, 14 Aug 2023)
Changed paths:
M mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp
M mlir/test/Conversion/NVVMToLLVM/nvvm-to-llvm.mlir
Log Message:
-----------
[MLIR][NVVM] Fix register mapping in `wgmma.mma_async`
WgmmaMmaAsync Op generates `wgmma.mma_async` PTX instruction that uses the same registers as read and write with mapping. Therefore, the registers count needs to be increased 2 times for the following registers.
This works changes this:
```
llvm.inline_asm has_side_effects asm_dialect = att "{wgmma.mma_async... {$0, $1, $2, $3, $4}, $5, $6, p", "=f,=f,=f,=f,0,1,2,3,l,l"
```
Into this one below. The only different is the number of registers ($8 and $9) that comes after read/write.
```
llvm.inline_asm has_side_effects asm_dialect = att "{wgmma.mma_async... {$0, $1, $2, $3, $4}, $8, $9, p", "=f,=f,=f,=f,0,1,2,3,l,l"
```
Reviewed By: qcolombet
Differential Revision: https://reviews.llvm.org/D157843
More information about the All-commits
mailing list