Add micro-kernel to compute FP16 GEMV (1b11b46f) · Commits · Kleidi / KleidiAI

Commit 1b11b46f authored Aug 06, 2024 by Jakub Sujak Committed by Felix Johnny Thomasmathibalan Aug 07, 2024

Add micro-kernel to compute FP16 GEMV



* Compute the general matrix-vector (GEMV) multiplication between an FP16 LHS and RHS and accumulate into FP16 output. The RHS packs FP16 weights and biases together.

* Optimized for Arm® Neon™ using MLA instructions.

* Add accompanying tests.

Signed-off-by: Jakub Sujak <jakub.sujak@arm.com>

Approved-by: Viet-Hoa Do <viet-hoa.do@arm.com>

parent 7d64c8ac

Hide whitespace changes

Inline Side-by-side

Please register or to comment