Add SME F16 GEMV kernel targeting FEAT_SME
-
Add SME F16 GEMV micro-kernel.
-
The GEMV micro-kernel uses instructions compatible with FEAT_SME.
-
The GEMV micro-kernel is designed to reuse the same RHS packing functions as the SME F16 GEMM.
This new GEMV micro-kernel is compatible with FEAT_SME but not FEAT_SME2 requirement. By using pairs of FMLALB
and FMLALT
instructions, we can reuse the existing RHS data format of the GEMM operation where kr=2
thus eliminating the need for a specialized packing function for the GEMV operation.
Signed-off-by: Jakub Sujak jakub.sujak@arm.com