aarch64: Optimized implementation of memcmp

System Internals / glibc - Xuelei Zhang [huawei.com] - 19 December 2019 19:31 UTC

The loop body is expanded from a 16-byte comparison to a 64-byte comparison, and the usage of ldp is replaced by the Post-index mode to the Base plus offset mode. Hence, compare can faster 18% around > 128 bytes in all.

Checked on aarch64-linux-gnu.

233efd433d aarch64: Optimized implementation of memcmp
sysdeps/aarch64/memcmp.S | 132 ++++++++++++++++++++++++++++-------------------
1 file changed, 79 insertions(+), 53 deletions(-)

Upstream: sourceware.org


  • Share