Optimize scalar RHS packing function NxK F32 <- QAI8DXP x QSU4C32
- Optimize the generic RHS packing NxK. The performance improvement is around ~1.5x Signed-off-by:Gian Marco Iodice <gianmarco.iodice@arm.com> Reviewed-by:
Anton Bondarenko <anton.bondarenko@arm.com> Approved-by:
Anton Bondarenko <anton.bondarenko@arm.com>
parent
504b1754
Loading
Loading
Pipeline
#17573
passed
with stages
in
3 minutes and 22 seconds
Loading
Please register or sign in to comment