Matmul int4 micro-kernels for QSI8D32 (LHS) x QSI4C32 (RHS) -> F32 (cf999441) · Commits · Kleidi / KleidiAI

Commit cf999441 authored Aug 16, 2024 by Gian Marco Iodice Committed by Jakub Sujak Aug 16, 2024

Matmul int4 micro-kernels for QSI8D32 (LHS) x QSI4C32 (RHS) -> F32



* The LHS matrix is Quantized (Q) Symmetric (S) Signed 8-bit (I8) with per-block quantization (D32) quantization parameters
* The RHS matrix is Quantized (Q) Symmetric (S) Signed 4-bit (I4) with per-block quantization(C32) F16 scale factors,
* The destination is F32
* Implement micro-kernels to perform the matrix multiplication
* Implement a micro-kernel to pack the LHS and RHS matrices
* Added unit tests

Signed-off-by: Gian Marco <Iodice gianmarco.iodice@arm.com>

Signed-off-by: Anitha <Raj Anitha.Raj@arm.com>

Signed-off-by: Viet-Hoa Do <viet-hoa.do@arm.com>

Reviewed-by: Viet-Hoa Do <viet-hoa.do@arm.com>
Reviewed-by: Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com>
Reviewed-by: Jakub Sujak <jakub.sujak@arm.com>
Approved-by: Jakub Sujak <jakub.sujak@arm.com>

parent a386ec56

Pipeline #12863 passed with stages

in 5 minutes and 34 seconds

Hide whitespace changes

Inline Side-by-side

Please register or to comment