Skip to content

Add SME F16 GEMV kernel targeting FEAT_SME

Jakub Sujak requested to merge jakub/sme1_gemv_f16 into main
  • Add SME F16 GEMV micro-kernel.

  • The GEMV micro-kernel uses instructions compatible with FEAT_SME.

  • The GEMV micro-kernel is designed to reuse the same RHS packing functions as the SME F16 GEMM.

This new GEMV micro-kernel is compatible with FEAT_SME but not FEAT_SME2 requirement. By using pairs of FMLALB and FMLALT instructions, we can reuse the existing RHS data format of the GEMM operation where kr=2 thus eliminating the need for a specialized packing function for the GEMV operation.

Signed-off-by: Jakub Sujak jakub.sujak@arm.com

Merge request reports

Loading