[all-commits] [llvm/llvm-project] c82f45: [mlir][nvgpu] Simplify TMA IR generation (#87153)
Guray Ozen via All-commits
all-commits at lists.llvm.org
Thu Apr 18 00:58:47 PDT 2024
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: c82f45f9deb98aa383b0be37f8edc32f9e48c4ec
https://github.com/llvm/llvm-project/commit/c82f45f9deb98aa383b0be37f8edc32f9e48c4ec
Author: Guray Ozen <guray.ozen at gmail.com>
Date: 2024-04-18 (Thu, 18 Apr 2024)
Changed paths:
M mlir/test/Integration/GPU/CUDA/sm90/python/tools/matmulBuilder.py
Log Message:
-----------
[mlir][nvgpu] Simplify TMA IR generation (#87153)
This PR add `TmaDescriptorBuilder`
- class simplifies TMA generation.
- Makes the code ready to support various Tma configurations
- removes strings and use the enums from `mlir.nvgpu.ENUMs`.
- Example "swizzle = swizzle_128b, l2promo=none, oob=zero,
interleave=none" to enums in `mlir.nvgpu` dialects.
- Enums have string equivalent that are used during the IR writing and
generation (see `TmaDescriptorBuilder::tensormap_descriptor_ty`).
- Improves readability and abstracts out TMA descriptor builders in
reusable component.
---------
Co-authored-by: Manish Gupta <manigupta at google.com>
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list