-
Jakub Sujak authored
* Add SME F16 GEMV micro-kernel. * The GEMV micro-kernel uses instructions compatible with FEAT_SME. * The GEMV micro-kernel is designed to reuse the same RHS packing functions as the SME F16 GEMM. This new GEMV micro-kernel is compatible with FEAT_SME but not FEAT_SME2 requirement. By using pairs of `FMLALB` and `FMLALT` instructions, we can reuse the existing RHS data format of the GEMM operation where `kr=2` thus eliminating the need for a specialized packing function for the GEMV operation. Signed-off-by:
Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com>
a1dacc36
To find the state of this project's repository at the time of any of these versions, check out the tags.
Loading