swscale: aarch64: Add a NEON implementation of interleaveBytes

Multimedia / FFmpeg - Martin Storsjö [martin.st] - 15 May 2020 20:38 EDT

This allows speeding up format conversions from yuv420 to nv12.

Cortex A53 A72 A73 interleave_bytes_c: 86077.5 51433.0 66972.0 interleave_bytes_neon: 19701.7 23019.2 15859.2 interleave_bytes_aligned_c: 86603.0 52017.2 67484.2 interleave_bytes_aligned_neon: 9061.0 7623.0 6309.0

e0604d508e swscale: aarch64: Add a NEON implementation of interleaveBytes
libswscale/aarch64/Makefile | 4 +-
libswscale/aarch64/rgb2rgb.c | 41 ++++++++++++++++++++
libswscale/aarch64/rgb2rgb_neon.S | 79 +++++++++++++++++++++++++++++++++++++++
libswscale/rgb2rgb.c | 2 +
libswscale/rgb2rgb.h | 1 +
5 files changed, 126 insertions(+), 1 deletion(-)

  • Share