Add GEMM F32 using SME1 MOPA with block size 2VLx2VL
* Add GEMM F32 kernel using SME1 MOPA with block size 2VLx2VL. * Add tests for the newly added kernel. * Add CI job to run the kernel on FVP with SME1 and without SME2. Signed-off-by:Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Viet-Hoa Do <viet-hoa.do@arm.com> Reviewed-by:
Anton Bondarenko <anton.bondarenko@arm.com> Reviewed-by:
Emil Ohlsson <emil.ohlsson@arm.com> Approved-by:
Anton Bondarenko <anton.bondarenko@arm.com>