CHANGELOG.md · a0afd5e1e085ec755694031a670b2d9223279601 · Kleidi / KleidiAI · GitLab

Matmul Micro-kernel(1xN) F32/F16 <- (QSI8D32) LHS x (QAI4C32) RHS · a0afd5e1

Anitha Raj authored Jun 24, 2025 and

Anton Bondarenko committed Jun 24, 2025



* Matrix multiplication (1xN) micro-kernels to compute the matrix multiplication of dynamically quantized symmetric signed 8-bit integer with per-block quantization (QSI8D32) LHS matrix and quantized asymmetric 4-bit signed integer with per-block quantization (QAI4C32) RHS matrix and the accumulation of the result into a single-precision (F32) and half-precision (F16) output, optimized for FEAT_DotProd and packing parameter kr = 8.

Signed-off-by: Anitha Raj <anitha.raj@arm.com>

Approved-by: Anton Bondarenko <anton.bondarenko@arm.com>

a0afd5e1

To find the state of this project's repository at the time of any of these versions, check out the tags.