[Mlir-commits] [mlir] [ml_program] fix bufferizesToMemoryRead for ml_program.global_store (PR #177387)
llvmlistbot at llvm.org
llvmlistbot at llvm.org
Thu Jan 22 08:02:34 PST 2026
llvmbot wrote:
<!--LLVM PR SUMMARY COMMENT-->
@llvm/pr-subscribers-mlir
Author: Nathan Malimban (nmalimban)
<details>
<summary>Changes</summary>
This is a fix for the `BufferizableOpInterface` implementation for `ml_program.global_store`.
`bufferizesToMemoryRead` currently returns false for `GlobalStoreOpInterface`, but I believe it should return true as `ml_program.global_store` needs to read its input buffer to know what value to store to global.
This manifested in a bug where `one-shot-bufferize` would produce MLIR that copies uninitialized data to the global var instead of the intended value to be stored.
For the following MLIR:
```
module {
ml_program.global private mutable @"state_tensor"(dense<0.0> : tensor<4x75xf32>) : tensor<4x75xf32>
func.func @<!-- -->main() -> tensor<4x75xf32> {
%c0 = arith.constant 0 : index
%cst_val = arith.constant 1.0 : f32
%initial_state = ml_program.global_load @"state_tensor" : tensor<4x75xf32>
%val = tensor.extract %initial_state[%c0, %c0] : tensor<4x75xf32>
%next_val = arith.addf %val, %cst_val : f32
%updated_tensor = tensor.insert %next_val into %initial_state[%c0, %c0] : tensor<4x75xf32>
ml_program.global_store @"state_tensor" = %updated_tensor : tensor<4x75xf32>
return %updated_tensor : tensor<4x75xf32>
}
}
```
`one-shot-bufferize` produces this incorrect MLIR
```
module {
memref.global "private" @<!-- -->state_tensor : memref<4x75xf32> = dense<0.000000e+00>
func.func @<!-- -->main() -> tensor<4x75xf32> {
%c0 = arith.constant 0 : index
%cst = arith.constant 1.000000e+00 : f32
%0 = memref.get_global @<!-- -->state_tensor : memref<4x75xf32>
%1 = memref.load %0[%c0, %c0] : memref<4x75xf32>
%2 = arith.addf %1, %cst : f32
%alloc = memref.alloc() {alignment = 64 : i64} : memref<4x75xf32>
memref.copy %0, %alloc : memref<4x75xf32> to memref<4x75xf32>
memref.store %2, %alloc[%c0, %c0] : memref<4x75xf32>
%3 = bufferization.to_tensor %alloc : memref<4x75xf32> to tensor<4x75xf32>
%alloc_0 = memref.alloc() {alignment = 64 : i64} : memref<4x75xf32>
%4 = memref.get_global @<!-- -->state_tensor : memref<4x75xf32>
memref.copy %alloc_0, %4 : memref<4x75xf32> to memref<4x75xf32>
return %3 : tensor<4x75xf32>
}
}
```
Note that `memref.copy` at the end copies an uninitialized `alloc_0` to the global variable.
But after the change we see the following MLIR:
```
module {
memref.global "private" @<!-- -->state_tensor : memref<4x75xf32> = dense<0.000000e+00>
func.func @<!-- -->main() -> tensor<4x75xf32> {
%c0 = arith.constant 0 : index
%cst = arith.constant 1.000000e+00 : f32
%0 = memref.get_global @<!-- -->state_tensor : memref<4x75xf32>
%1 = memref.load %0[%c0, %c0] : memref<4x75xf32>
%2 = arith.addf %1, %cst : f32
%alloc = memref.alloc() {alignment = 64 : i64} : memref<4x75xf32>
memref.copy %0, %alloc : memref<4x75xf32> to memref<4x75xf32>
memref.store %2, %alloc[%c0, %c0] : memref<4x75xf32>
%3 = bufferization.to_tensor %alloc : memref<4x75xf32> to tensor<4x75xf32>
%alloc_0 = memref.alloc() {alignment = 64 : i64} : memref<4x75xf32>
memref.copy %alloc, %alloc_0 : memref<4x75xf32> to memref<4x75xf32>
%4 = memref.get_global @<!-- -->state_tensor : memref<4x75xf32>
memref.copy %alloc_0, %4 : memref<4x75xf32> to memref<4x75xf32>
return %3 : tensor<4x75xf32>
}
}
```
We now see that the relevant data is copied to `alloc_0` before it is stored in global.
---
Full diff: https://github.com/llvm/llvm-project/pull/177387.diff
2 Files Affected:
- (modified) mlir/lib/Dialect/MLProgram/Transforms/BufferizableOpInterfaceImpl.cpp (+1-1)
- (modified) mlir/test/Dialect/MLProgram/one-shot-bufferize.mlir (+31)
``````````diff
The server is unavailable at this time. Please wait a few minutes before you try again.
``````````
</details>
https://github.com/llvm/llvm-project/pull/177387
More information about the Mlir-commits
mailing list