I benchmarked the result by measuring the number of gperftools samples that hit anywhere in the AAC decoder (starting from aac_decode_frame()) or specifically in vector_fmul_window_c() / ff_vector_fmul_window_vfp() for the same sample AAC stream:
Before After Mean StdDev Mean StdDev Confidence Change Audio decode 1598.2 47.4 1529.2 25.4 100.0% +4.5%
vector_fmul_window 244.0 22.1 188.9 22.3 100.0% +29.2%
5edad2c armv6: Accelerate vector_fmul_window
libavutil/arm/float_dsp_init_vfp.c | 7 +-
libavutil/arm/float_dsp_vfp.S | 204 ++++++++++++++++++++++++++++++++++++
2 files changed, 210 insertions(+), 1 deletion(-)
Upstream: git.libav.org