Skip to content

Draft: Optimize NSQ_del_dec() for AVX2

Victor Ding requested to merge 0dvictor/opus:NSQ_del_dec into master

The optimization is bit-exact with C function.

This optimization speeds up SILK encoder (floating point) as following:

AMD Zen:
Complexity 0-5 :      0%
Complexity 6-7 : 3 -  7%
Complexity 8-10: 8 - 15%

Intel Skylake:
Complexity 0-5 :       0%
Complexity 6-7 : 14 - 18%
Complexity 8-10: 17 - 22%
Edited by Victor Ding

Merge request reports