- Jul 04, 2024
-
-
Felix Johnny Thomasmathibalan authored
Signed-off-by:
Felix Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Jakub Sujak authored
Signed-off-by:
Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com>
-
Signed-off-by:
Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com>
-
Jakub Sujak authored
Signed-off-by:
Jakub Sujak <jakub.sujak@arm.com> Reviewed-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com> Approved-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com>
-
Jakub Sujak authored
This example demonstrates the usage of the FP16 packing and matmul routines which: 1. Packs the bias and the weights together into a single tensor. 2. Performs a matrix multiplication of the activations and the packed tensor. All tensors are in half precision floating-point (FP16) data type. Signed-off-by:
Jakub Sujak <jakub.sujak@arm.com> Reviewed-by:
Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com>
-
- Jul 03, 2024
-
-
Anton Bondarenko authored
Signed-off-by:
Anton Bondarenko <anton.bondarenko@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
- Jul 02, 2024
-
-
- Adjust the int4 matmul micro-kernels (qsi4cx) to skip the bias Signed-off-by:
Gian Marco Iodice <gianmarco.iodice@arm.com> Signed-off-by:
Felix Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com> Signed-off-by:
Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Signed-off-by:
Jens Elofsson <jens.elofsson@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Anton Bondarenko <anton.bondarenko@arm.com>
-
- Jul 01, 2024
-
-
Jens Elofsson authored
Add file and line information to KAI_ERROR-macro. Signed-off-by:
Jens Elofsson <jens.elofsson@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Anton Bondarenko authored
* Prepare and upload JUnit report for tests execution * Simplify pre-commit job * Replace dependencies with needs to unblock jobs Signed-off-by:
Anton Bondarenko <anton.bondarenko@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Anton Bondarenko authored
New hook would be able to automatically download specified version of the Buildifier tool instead of relying on local one. Signed-off-by:
Anton Bondarenko <anton.bondarenko@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
- Jun 28, 2024
-
-
Jakub Sujak authored
* Add compiler version checks in CMake before adding feature specific (e.g. FEAT_DOTPROD) micro-kernels. * Enable compiling scalar routines using MSVC. Signed-off-by:
Felix Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com> Signed-off-by:
Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Jens Elofsson <jens.elofsson@arm.com>
-
- Jun 21, 2024
-
-
Felix Johnny Thomasmathibalan authored
The _dst_lhs_rhs_ naming is used instead of hybrid for the test function name to follow the format used in the main repository. Signed-off-by:
Felix Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
- Jun 20, 2024
-
-
Viet-Hoa Do authored
* Add SME2 GeMM kernels in FP32. - Kernels: LHS packing, RHS packing, interleaved GeMM. * Update the build script and the test framework. * Fix issues in the API. Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Anton Bondarenko authored
Use Bazelisk to perform Bazel builds instead of relying directly on specific Bazel version. At the moment latest LTS is chosen, but could be later updated to a specific version if required Signed-off-by:
Anton Bondarenko <anton.bondarenko@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Viet-Hoa Do authored
* Add more checks in the test to verify all utility functions provided by GeMM kernels. Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Jens Elofsson authored
"Hello World"-style example of the microbenchmark tool that pipecleans the cmake-build. Signed-off-by:
Jens Elofsson <jens.elofsson@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Anton Bondarenko <anton.bondarenko@arm.com>
-
Anton Bondarenko authored
Signed-off-by:
Anton Bondarenko <anton.bondarenko@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Viet-Hoa Do authored
* Reorder the arguments of getting RHS offset function. * Regenerate the assembly kernel. Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Viet-Hoa Do authored
Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Anton Bondarenko <anton.bondarenko@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
- Jun 19, 2024
-
-
Viet-Hoa Do authored
* Run full Linux in FVP to test features without silicon. * Collect coverage report from multiple runs. Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
- Jun 18, 2024
-
-
Viet-Hoa Do authored
* Update the base image for the Docker image to Ubuntu 24.04 which is the latest LTS version of Ubuntu. Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com>
-
Viet-Hoa Do authored
Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
- Jun 13, 2024
-
-
Viet-Hoa Do authored
Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Viet-Hoa Do authored
* Add code coverage build and test. * Add coverage report and visualization. Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
- Jun 07, 2024
-
-
Felix Johnny Thomasmathibalan authored
Signed-off-by:
Felix Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Felix Johnny Thomasmathibalan authored
stddef.h is included for size_t Signed-off-by:
Felix Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
- Add the free memory section - Fix the input arguments for the matmul micro-kernel Signed-off-by:
Gian Marco Iodice <gianmarco.iodice@arm.com> Approved-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com>
-
- Jun 05, 2024
-
-
Viet-Hoa Do authored
* Explicitly build NEON-only files with the base architecture. * Remove redundant compiler flags. Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
- Move lhs_stride position in the list of input arguments - Add dst_stride_col in the list of input arguments - Add the interface header file Signed-off-by:
Gian Marco Iodice <gianmarco.iodice@arm.com> Approved-by:
Viet-Hoa Do <viet-hoa.do@arm.com>
-
- Jun 03, 2024
-
-
- Add guide to explain how to invoke the int4 matmul micro-kernel Signed-off-by:
Gian Marco Iodice <gianmarco.iodice@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Viet-Hoa Do <viet-hoa.do@arm.com>
-
Signed-off-by:
Gian Marco Iodice <gianmarco.iodice@arm.com> Approved-by:
Viet-Hoa Do <viet-hoa.do@arm.com>
-
- May 31, 2024
-
-
Signed-off-by:
Gian Marco Iodice <gianmarco.iodice@arm.com> Approved-by:
Viet-Hoa Do <viet-hoa.do@arm.com>
-
- May 30, 2024
-
-
Viet-Hoa Do authored
* The library is move into kai directory (previously src). * All microkernels are in kai/ukernels directory. * Packing kernels for matmul will be in kai/ukernels/matmul/pack directory. Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Gian Marco Iodice <gianmarco.iodice@arm.com>
-
Viet-Hoa Do authored
* Add GeMM hybrid kernels. - Block size: 6x16. - Data type: FP16. * Update test framework and add tests for new kernels. Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com> Approved-by:
Gian Marco Iodice <gianmarco.iodice@arm.com>
-
- May 29, 2024
-
-
- Make RHS packing function accessing in-bound elements when N is not multiple of nr - Make LHS packing function accessing in-bound elements when K is not multiple of kr * sr Signed-off-by:
Gian Marco Iodice <gianmarco.iodice@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Viet-Hoa Do <viet-hoa.do@arm.com>
-
- May 23, 2024
-
-
- The LHS matrix is quantized (Q) Asymmetric (A) 8-bit (8) with per-row (DX) quantization parameters - The RHS matrix is quantized (Q) Symmetric (S) 4-bit (4) with per-channel (cx) quantization parameters - The destination is F32 - Implement matmul int4 micro-kernels with intrinsics by using the dotprod and i8mm extensions - Implement a micro-kernel to pack the RHS matrix - Implement two micro-kernels to dynamically quantize and pack the LHS matrix - Add README.md - No test added into this PR. Test will be added in a separate PR Signed-off-by:
Gian Marco Iodice <gianmarco.iodice@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Jakub Sujak <jakub.sujak@arm.com>
-
Viet-Hoa Do authored
* Also remove trailing whitespace. Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Felix Johnny Thomasmathibalan <felixjohnny.thomasmathibalan@arm.com>
-
- May 10, 2024
-
-
Viet-Hoa Do authored
Signed-off-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Approved-by:
Jakub Sujak <jakub.sujak@arm.com>
-
- May 03, 2024
-
-
Jakub Sujak authored
Signed-off-by:
Jakub Sujak <jakub.sujak@arm.com> Approved-by:
Viet-Hoa Do <viet-hoa.do@arm.com>
-