Skip to content
Commit 392f2b4f authored by Gian Marco Iodice's avatar Gian Marco Iodice
Browse files

Refactor RHS packing function for F32 <- QAI8DXP x QSU4C32



- Rename the packing function to include the the bf16 scale factor
- Optimize the scalar variant. The new implementation is ~1.5x faster
than the previous one

Signed-off-by: Gian Marco Iodice's avatarGian Marco Iodice <gianmarco.iodice@arm.com>
parent 180098df
Loading
Loading
Loading
Pipeline #17282 failed with stages
in 57 seconds
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment