4.5x faster than C float version with autovectorization 10 x faster than C int version 25 x faster than C float version without autovectorization
222e6da x86/vf_blend: Add SSE2 optimization for divide
libavfilter/x86/vf_blend.asm | 30 ++++++++++++++++++++++++++++++
libavfilter/x86/vf_blend_init.c | 2 ++
2 files changed, 32 insertions(+)