Skip to content
Commit 1a7a7700 authored by Nikhil Gupta's avatar Nikhil Gupta Committed by Anton Bondarenko
Browse files

Matmul Micro-kernels BF16 <- (QAI8DXP) LHS x (QSI4CXP) RHS



- Matrix multiplication (MxN) Micro-kernels of QAI8DXP LHS and QSI4CXP
  RHS with BF16 output, optimized for FEAT_I8MM.
- Matrix multiplication (1xN) Micro-kernels of QAI8DXP LHS and QSI4CXP
  RHS with BF16 output, optimized for FEAT_DotProd.

Signed-off-by: Nikhil Gupta's avatarNikhil Gupta <nikhil.gupta2@arm.com>
Signed-off-by: Evie Wright's avatarEvie Wright <evie.wright@arm.com>

Reviewed-by: Viet-Hoa Do's avatarViet-Hoa Do <viet-hoa.do@arm.com>
Reviewed-by: Anton Bondarenko's avatarAnton Bondarenko <anton.bondarenko@arm.com>
Reviewed-by: Evie Wright's avatarEvie Wright <evie.wright@arm.com>
Reviewed-by: Emil Ohlsson's avatarEmil Ohlsson <emil.ohlsson@arm.com>
Approved-by: Anton Bondarenko's avatarAnton Bondarenko <anton.bondarenko@arm.com>
parent 7f93c5c5
Loading
Loading
Loading
Pipeline #27814 passed with stages
in 9 minutes and 31 seconds
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment