Optimize kai_rhs_pack_nxk_qsi4c32p_qsu4c32s1s0 using advanced SIMD (05ef512e) · Commits · Kleidi / KleidiAI

Commit 05ef512e authored Jun 05, 2025 by Evie Wright Committed by Anton Bondarenko Jun 05, 2025

Optimize kai_rhs_pack_nxk_qsi4c32p_qsu4c32s1s0 using advanced SIMD



Optimize the transposed RHS packing function for matmul_clamp_f32_qai8dxp_qsi4c32p using advanced SIMD, for kr / sr = 8

Signed-off-by: Evie Wright <evie.wright@arm.com>

Signed-off-by: Anitha Raj <anitha.raj@arm.com>

Reviewed-by: Anitha Raj <anitha.raj@arm.com>
Reviewed-by: Anton Bondarenko <anton.bondarenko@arm.com>
Reviewed-by: Evie Wright <evie.wright@arm.com>
Approved-by: Anton Bondarenko <anton.bondarenko@arm.com>

parent 09771a98

Pipeline #27044 passed with stages

in 9 minutes and 15 seconds

Hide whitespace changes

Inline Side-by-side

Please register or to comment