[Mlir-commits] [mlir] [AMDGPU] [MLIR] Add 96 and 128 bit GatherToLDS for gfx950 (PR #147496)
Daniel Hernandez-Juarez
llvmlistbot at llvm.org
Tue Jul 8 07:48:12 PDT 2025
================
@@ -1198,14 +1198,19 @@ struct GatherToLDSOpLowering : public ConvertOpToLLVMPattern<GatherToLDSOp> {
Type transferType = op.getTransferType();
size_t loadWidth = [&]() -> size_t {
if (auto transferVectorType = dyn_cast<VectorType>(transferType)) {
- return transferVectorType.getNumElements() *
- (transferVectorType.getElementTypeBitWidth() / 8);
+ return (transferVectorType.getNumElements() *
+ transferVectorType.getElementTypeBitWidth()) /
+ 8;
}
return transferType.getIntOrFloatBitWidth() / 8;
}();
- // Currently only 1, 2, and 4 byte loads are supported.
- if (loadWidth != 1 && loadWidth != 2 && loadWidth != 4)
+ // Currently only 1, 2, 4, 12 and 16 byte loads are supported.
+ if (loadWidth != 1 && loadWidth != 2 && loadWidth != 4 && loadWidth != 12 &&
----------------
dhernandez0 wrote:
done
https://github.com/llvm/llvm-project/pull/147496
More information about the Mlir-commits
mailing list