Micro-kernel | Abbreviation | Data type | Reference framework | Notes |
---|---|---|---|---|
Matrix-multiplication with LHS packed and RHS packed matrices | matmul_clamp_f32_qai8dxp_qsi4cxp |
LHS: qai8dxp RHS: qsi4cxp DST: f32 |
TensorFlow Lite |
The packing function for the RHS matrix is available in the `kai_rhs_pack_nxk_qsi4cxp_qsi4cxs1s0.c/.h` files. Since the RHS matrix often contains constant values, we recommend packing the RHS matrix only once and freeing the content of the original RHS matrix. |
Matrix-multiplication with RHS packed | matmul_clamp_f16_f16_f16p |
LHS: f16 RHS: f16p DST: f16 |
TensorFlow Lite |
The packing function for the RHS matrix is available in the `kai_rhs_pack_kxn_f16p16x1biasf16_f16_f16_neon.c/.h` files. Since the RHS matrix often contains constant values, we recommend packing the RHS matrix only once and freeing the content of the original RHS matrix. |
Matrix-multiplication with RHS packed | matmul_clamp_f32_f32_f32p |
DST: f32 LHS: f32 RHS: f32p |
TensorFlow Lite |
The packing function for the RHS matrix is listed in the header file of the GEMM micro kernel. |
Dynamic quantization and LHS matrix packing | kai_lhs_quant_pack_qai8dxp_f32 |
SRC: f32 DST: qai8cx |
TensorFlow Lite |
|