[Mlir-commits] [mlir] [mlir][amdgpu] wrapper for gfx1250 async global load to lds intrinsics (PR #189279)
llvmlistbot at llvm.org
llvmlistbot at llvm.org
Sun Mar 29 12:37:59 PDT 2026
github-actions[bot] wrote:
<!--LLVM CODE FORMAT COMMENT: {clang-format}-->
:warning: C/C++ code formatter, clang-format found issues in your code. :warning:
<details>
<summary>
You can test this locally with the following command:
</summary>
``````````bash
git-clang-format --diff origin/main HEAD --extensions cpp -- mlir/lib/Conversion/AMDGPUToROCDL/AMDGPUToROCDL.cpp mlir/lib/Dialect/AMDGPU/IR/AMDGPUOps.cpp --diff_from_common_commit
``````````
:warning:
The reproduction instructions above might return results for more than one PR
in a stack if you are using a stacked PR workflow. You can limit the results by
changing `origin/main` to the base branch/commit you want to compare against.
:warning:
</details>
<details>
<summary>
View the diff from clang-format here.
</summary>
``````````diff
diff --git a/mlir/lib/Conversion/AMDGPUToROCDL/AMDGPUToROCDL.cpp b/mlir/lib/Conversion/AMDGPUToROCDL/AMDGPUToROCDL.cpp
index 5150ce0af..a6c928597 100644
--- a/mlir/lib/Conversion/AMDGPUToROCDL/AMDGPUToROCDL.cpp
+++ b/mlir/lib/Conversion/AMDGPUToROCDL/AMDGPUToROCDL.cpp
@@ -2102,12 +2102,12 @@ struct GlobalLoadAsyncToLDSOpLowering
cast<VectorType>(transferType).getElementTypeBitWidth()
: transferType.getIntOrFloatBitWidth();
- Value srcPtr = getStridedElementPtr(rewriter, loc, srcMemRefType,
- adaptor.getSrc(),
- adaptor.getSrcIndices());
- Value dstPtr = getStridedElementPtr(rewriter, loc, dstMemRefType,
- adaptor.getDst(),
- adaptor.getDstIndices());
+ Value srcPtr =
+ getStridedElementPtr(rewriter, loc, srcMemRefType, adaptor.getSrc(),
+ adaptor.getSrcIndices());
+ Value dstPtr =
+ getStridedElementPtr(rewriter, loc, dstMemRefType, adaptor.getDst(),
+ adaptor.getDstIndices());
auto offset = rewriter.getI32IntegerAttr(0);
auto aux = rewriter.getI32IntegerAttr(0);
@@ -4192,9 +4192,8 @@ void mlir::populateAMDGPUToROCDLConversionPatterns(LLVMTypeConverter &converter,
ScaledExtPackedMatrixOpLowering, ScaledExtPackedOpLowering,
PackedScaledTruncOpLowering, PackedTrunc2xFp8OpLowering,
PackedStochRoundFp8OpLowering, GatherToLDSOpLowering,
- GlobalLoadAsyncToLDSOpLowering,
- TransposeLoadOpLowering, AMDGPUPermlaneLowering,
- AMDGPUMakeDmaBaseLowering<MakeDmaBaseOp>,
+ GlobalLoadAsyncToLDSOpLowering, TransposeLoadOpLowering,
+ AMDGPUPermlaneLowering, AMDGPUMakeDmaBaseLowering<MakeDmaBaseOp>,
AMDGPUMakeDmaBaseLowering<MakeGatherDmaBaseOp>,
AMDGPULowerDescriptor<MakeDmaDescriptorOp>,
AMDGPULowerDescriptor<MakeGatherDmaDescriptorOp>,
``````````
</details>
https://github.com/llvm/llvm-project/pull/189279
More information about the Mlir-commits
mailing list