[all-commits] [llvm/llvm-project] c82f45: [mlir][nvgpu] Simplify TMA IR generation (#87153)

Guray Ozen via All-commits all-commits at lists.llvm.org
Thu Apr 18 00:58:47 PDT 2024


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: c82f45f9deb98aa383b0be37f8edc32f9e48c4ec
      https://github.com/llvm/llvm-project/commit/c82f45f9deb98aa383b0be37f8edc32f9e48c4ec
  Author: Guray Ozen <guray.ozen at gmail.com>
  Date:   2024-04-18 (Thu, 18 Apr 2024)

  Changed paths:
    M mlir/test/Integration/GPU/CUDA/sm90/python/tools/matmulBuilder.py

  Log Message:
  -----------
  [mlir][nvgpu] Simplify TMA IR generation (#87153)

This PR add `TmaDescriptorBuilder` 
- class simplifies TMA generation.
- Makes the code ready to support various Tma configurations 
   - removes strings and use the enums from `mlir.nvgpu.ENUMs`.
- Example "swizzle = swizzle_128b, l2promo=none, oob=zero,
interleave=none" to enums in `mlir.nvgpu` dialects.
- Enums have string equivalent that are used during the IR writing and
generation (see `TmaDescriptorBuilder::tensormap_descriptor_ty`).
- Improves readability and abstracts out TMA descriptor builders in
reusable component.

---------

Co-authored-by: Manish Gupta <manigupta at google.com>



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list