Pipelines · Matmul Micro-kernels F16<-(QAI8DX) LHS x (QSI4CX) RHS (315ed95c) · Commits · Kleidi / KleidiAI

Commit 315ed95c authored Apr 11, 2025 by Anitha Raj Committed by Viet-Hoa Do Apr 11, 2025

Matmul Micro-kernels F16<-(QAI8DX) LHS x (QSI4CX) RHS

Micro-kernels to compute the matrix multiplication of dynamically quantized asymmetric signed 8-bit integer with per-channel quantization (QAI8DX) LHS matrix and quantized symmetric 4-bit signed integer with per-channel quantization (QSI4CX) RHS matrix and the accumulation of the result into a half-precision (F16):

Matrix multiplication (MxN) Micro-kernels of QAI8DX LHS and QSI4CX RHS with F16 output, optimized for FEAT_I8MM and FEAT_DotProd.
Matrix multiplication (1xN) Micro-kernels of QAI8DX LHS and QSI4CX RHS with F16 output, optimized for FEAT_DotProd.

Signed-off-by: Anitha Raj <anitha.raj@arm.com>

Signed-off-by: Evie Wright <evie.wright@arm.com>

Approved-by: Viet-Hoa Do <viet-hoa.do@arm.com>

parent 8ddbdc0a

Pipeline #25476 passed with stages

in 6 minutes and 39 seconds