Add SME1 F16 GEMM micro-kernel
Adds F16 GEMM micro-kernel using SME1 MOPA instruction and 2VL x 2VL block size. This SME1 kernel is compatible with existing SME F16 LHS and RHS packing functions.
Signed-off-by: Jakub Sujak jakub.sujak@arm.com
Adds F16 GEMM micro-kernel using SME1 MOPA instruction and 2VL x 2VL block size. This SME1 kernel is compatible with existing SME F16 LHS and RHS packing functions.
Signed-off-by: Jakub Sujak jakub.sujak@arm.com