Skip to content

Add SME1 F16 GEMM micro-kernel

Jakub Sujak requested to merge jakub/sme1_f16_gemm into main

Adds F16 GEMM micro-kernel using SME1 MOPA instruction and 2VL x 2VL block size. This SME1 kernel is compatible with existing SME F16 LHS and RHS packing functions.

Signed-off-by: Jakub Sujak jakub.sujak@arm.com

Merge request reports

Loading