Implement NEON 4-buffer SNOW3G UEA2 by assembly
Following changes have be done in this patch. 1. implement 4-buffer snow3g uea2 by assembley code. Compared with 4-buffer NEON intrinsic implementation, the throughput improvement on different micro-archetecutre is listed below. Neoverse V1 : x1.13 @ 1024 buffer-sz We use DPDK command to calculate performance improvement. sudo ./build/app/dpdk-test-crypto-perf -c 0x3 --no-huge --vdev crypto_snow3g -- --devtype crypto_snow3g --ptest throughput --pool-sz 16384 --total-ops 1000000 --burst-sz 8 --optype cipher-only --cipher-algo snow3g-uea2 --cipher-iv-sz 16 --auth-op generate --cipher-key-sz 16 --buffer-sz 64,128,256,512,1024 --cipher-op encrypt Change-Id: I9146fb17fdc8ef717301fa2c51df413c16a14399
Loading
Please register or sign in to comment