Skip to content
Snippets Groups Projects
  • Victor Ding's avatar
    735c4070
    Optimize NSQ_del_dec() for AVX2 · 735c4070
    Victor Ding authored and Jean-Marc Valin's avatar Jean-Marc Valin committed
    The optimization is bit-exact with C function.
    
    This optimization speeds up SILK encoder (floating point) as following:
    
    AMD Zen:
    Complexity 0-5 :      0%
    Complexity 6-7 : 3 -  7%
    Complexity 8-10: 8 - 15%
    
    Intel Skylake:
    Complexity 0-5 :       0%
    Complexity 6-7 : 14 - 18%
    Complexity 8-10: 17 - 22%
    
    Adapted by Jean-Marc Valin
    Verified
    735c4070
    History
    Optimize NSQ_del_dec() for AVX2
    Victor Ding authored and Jean-Marc Valin's avatar Jean-Marc Valin committed
    The optimization is bit-exact with C function.
    
    This optimization speeds up SILK encoder (floating point) as following:
    
    AMD Zen:
    Complexity 0-5 :      0%
    Complexity 6-7 : 3 -  7%
    Complexity 8-10: 8 - 15%
    
    Intel Skylake:
    Complexity 0-5 :       0%
    Complexity 6-7 : 14 - 18%
    Complexity 8-10: 17 - 22%
    
    Adapted by Jean-Marc Valin