Skip to content
Commit cddf991a authored by Jakub Sujak's avatar Jakub Sujak
Browse files

Add micro-kernel to compute FP16 GEMV



* Compute the general matrix-vector (GEMV) multiplication between an FP16 LHS and RHS and accumulate into FP16 output. The RHS packs FP16 weights and biases together.

* Optimized for Arm® Neon™ using MLA instructions.

* Add accompanying tests.

Signed-off-by: Jakub Sujak's avatarJakub Sujak <jakub.sujak@arm.com>

Approved-by: Viet-Hoa Do's avatarViet-Hoa Do <viet-hoa.do@arm.com>
parent 9f746bd5
Loading
Loading
Loading
Pipeline #12659 passed with stages
in 7 minutes
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment