Skip to content
  • Jingning Han's avatar
    Optimze inv 16x16 DCT with 10 non-zero coeffs - P2 · af31b27a
    Jingning Han authored
    This commit further optimizes SSE2 operations in the second 1-D
    inverse 16x16 DCT, with (<10) non-zero coefficients. The average
    runtime of this module goes down from 779 cycles -> 725 cycles.
    
    Change-Id: Iac31b123640d9b1e8f906e770702936b71f0ba7f
    af31b27a