Skip to content
  • Anitha Raj's avatar
    Matmul Micro-kernels F16<-(QAI8DX) LHS x (QSI4CX) RHS · 315ed95c
    Anitha Raj authored and Viet-Hoa Do's avatar Viet-Hoa Do committed
    
    
    Micro-kernels to compute the matrix multiplication of dynamically quantized asymmetric signed 8-bit integer with per-channel quantization (QAI8DX) LHS matrix and quantized symmetric 4-bit signed integer with per-channel quantization (QSI4CX) RHS matrix and the accumulation of the result into a half-precision (F16):
    
    Matrix multiplication (MxN) Micro-kernels of QAI8DX LHS and QSI4CX RHS with F16 output, optimized for FEAT_I8MM and FEAT_DotProd. 
    Matrix multiplication (1xN) Micro-kernels of QAI8DX LHS and QSI4CX RHS with F16 output, optimized for FEAT_DotProd.
    
    Signed-off-by: Anitha Raj's avatarAnitha Raj <anitha.raj@arm.com>
    
    Signed-off-by: Evie Wright's avatarEvie Wright <evie.wright@arm.com>
    
    Approved-by: Viet-Hoa Do's avatarViet-Hoa Do <viet-hoa.do@arm.com>
    315ed95c
To find the state of this project's repository at the time of any of these versions, check out the tags.
Loading