Add SME F16 GEMV kernel targeting FEAT_SME
* Add SME F16 GEMV micro-kernel. * The GEMV micro-kernel uses instructions compatible with FEAT_SME. * The GEMV micro-kernel is designed to reuse the same RHS packing functions as the SME F16 GEMM. This new GEMV micro-kernel is compatible with FEAT_SME but not FEAT_SME2 requirement. By using pairs of `FMLALB` and `FMLALT` instructions, we can reuse the existing RHS data format of the GEMM operation where `kr=2` thus eliminating the need for a specialized packing function for the GEMV operation. Signed-off-by:Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com>
parent
d071a3a4
Loading
Loading
Pipeline
#27929
passed
with stages
in
9 minutes and 38 seconds
Loading
Please register or sign in to comment