Optimize NSQ_del_dec() for AVX2
The optimization is bit-exact with C function. This optimization speeds up SILK encoder (floating point) as following: AMD Zen: Complexity 0-5 : 0% Complexity 6-7 : 3 - 7% Complexity 8-10: 8 - 15% Intel Skylake: Complexity 0-5 : 0% Complexity 6-7 : 14 - 18% Complexity 8-10: 17 - 22% Adapted by Jean-Marc Valin
Showing
- Makefile.am 2 additions, 0 deletionsMakefile.am
- silk/x86/NSQ_del_dec_avx2.c 1072 additions, 0 deletionssilk/x86/NSQ_del_dec_avx2.c
- silk/x86/main_sse.h 27 additions, 1 deletionsilk/x86/main_sse.h
- silk/x86/x86_silk_map.c 1 addition, 1 deletionsilk/x86/x86_silk_map.c
- silk_sources.mk 3 additions, 0 deletionssilk_sources.mk
silk/x86/NSQ_del_dec_avx2.c
0 → 100644
This diff is collapsed.
Please register or sign in to comment