Skip to content

Improve packing performance for quantized Int4 per-block

Evie Wright requested to merge qsi4c32p-qsu4c32s1s0-bl8-rowsum into main

Improves performance of ‘kai_rhs_pack_nxk_qsi4c32pnrx8_qsu4c32s1s0_neon’ by vectorizing row summation

Signed-off-by: Evie Wright evie.wright@arm.com

Edited by Evie Wright

Merge request reports

Loading