[Mlir-commits] [mlir] Extending UniformQuantizedType with interface-based support for new storage types in Quant dialect (PR #152966)
Anurag Singh
llvmlistbot at llvm.org
Thu Aug 21 11:24:34 PDT 2025
anuragsingh-tt wrote:
> we'd no longer choose to embed the scale information as part of the tensor/datatype information
+1. Hardwares will extract these separately.
> Define a non-uniform quant type say quant.nonuniform which is a tuple of quant.uniform i.e (quant.uniform, quant.uniform) whose 1st element signify the value and the 2nd element is scale.
What are the issues with instead treating scales/zps as SSA operands on the consuming ops in the case of dynamic quantization?
https://github.com/llvm/llvm-project/pull/152966
More information about the Mlir-commits
mailing list