Vector cosf for x86_64.

System Internals / glibc - Andrew Senkevich [intel.com] - 9 June 2015 10:29 UTC

Here is implementation of vectorized cosf containing SSE, AVX, AVX2 and AVX512 versions according to Vector ABI .

- sysdeps/x86_64/fpu/Makefile (libmvec-support): Added new files.
- sysdeps/x86_64/fpu/Versions: New versions added.
- sysdeps/x86_64/fpu/svml_s_cosf4_core.S: New file.
- sysdeps/x86_64/fpu/multiarch/svml_s_cosf4_core.S: New file.
- sysdeps/x86_64/fpu/multiarch/svml_s_cosf4_core_sse4.S: New file.
- sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S: New file.
- sysdeps/x86_64/fpu/svml_s_cosf8_core.S: New file.
- sysdeps/x86_64/fpu/multiarch/svml_s_cosf8_core.S: New file.
- sysdeps/x86_64/fpu/multiarch/svml_s_cosf8_core_avx2.S: New file.
- sysdeps/x86_64/fpu/svml_s_cosf16_core.S: New file.
- sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core.S: New file.
- sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S: New file.
- sysdeps/x86_64/fpu/svml_s_wrapper_impl.h: New file.
- sysdeps/x86_64/fpu/svml_s_cosf_data.S: New file.
- sysdeps/x86_64/fpu/svml_s_cosf_data.h: New file.
- sysdeps/x86_64/fpu/multiarch/Makefile (libmvec-sysdep_routines): Added build of SSE, AVX2 and AVX512 IFUNC versions.
- sysdeps/unix/sysv/linux/x86_64/libmvec.abilist: New versions added.
- sysdeps/x86/fpu/bits/math-vector.h: Added SIMD declaration for cosf.
- NEWS: Mention addition of x86_64 vector cosf.

04f496d Vector cosf for x86_64.
ChangeLog | 21 +
NEWS | 2 +-
sysdeps/unix/sysv/linux/x86_64/libmvec.abilist | 4 +
sysdeps/x86/fpu/bits/math-vector.h | 2 +
sysdeps/x86_64/fpu/Makefile | 4 +-
sysdeps/x86_64/fpu/Versions | 1 +
sysdeps/x86_64/fpu/multiarch/Makefile | 3 +-
sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core.S | 39 +
.../fpu/multiarch/svml_s_cosf16_core_avx512.S | 460 ++++++++
sysdeps/x86_64/fpu/multiarch/svml_s_cosf4_core.S | 38 +
.../x86_64/fpu/multiarch/svml_s_cosf4_core_sse4.S | 227 ++++
sysdeps/x86_64/fpu/multiarch/svml_s_cosf8_core.S | 38 +
.../x86_64/fpu/multiarch/svml_s_cosf8_core_avx2.S | 215 ++++
sysdeps/x86_64/fpu/svml_s_cosf16_core.S | 25 +
sysdeps/x86_64/fpu/svml_s_cosf4_core.S | 29 +
sysdeps/x86_64/fpu/svml_s_cosf8_core.S | 29 +
sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S | 25 +
sysdeps/x86_64/fpu/svml_s_cosf_data.S | 1130 ++++++++++++++++++++
sysdeps/x86_64/fpu/svml_s_cosf_data.h | 58 +
sysdeps/x86_64/fpu/svml_s_wrapper_impl.h | 111 ++
20 files changed, 2458 insertions(+), 3 deletions(-)

Upstream: sourceware.org


  • Share