Matmul Micro-kernels BF16 <- (QAI8DXP) LHS x (QSI4CXP) RHS
- Matrix multiplication (MxN) Micro-kernels of QAI8DXP LHS and QSI4CXP RHS with BF16 output, optimized for FEAT_I8MM. - Matrix multiplication (1xN) Micro-kernels of QAI8DXP LHS and QSI4CXP RHS with BF16 output, optimized for FEAT_DotProd. Signed-off-by:Nikhil Gupta <nikhil.gupta2@arm.com> Signed-off-by:
Evie Wright <evie.wright@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Anton Bondarenko <anton.bondarenko@arm.com> Reviewed-by:
Evie Wright <evie.wright@arm.com> Reviewed-by:
Emil Ohlsson <emil.ohlsson@arm.com> Approved-by:
Anton Bondarenko <anton.bondarenko@arm.com>
parent
7f93c5c5
Loading
Loading
Pipeline
#27814
passed
with stages
in
9 minutes and 31 seconds
Loading
Please register or sign in to comment