1. 20 Jun, 2014 1 commit
  2. 08 May, 2014 1 commit
    • Jingning Han's avatar
      Change eob threshold for partial inverse 8x8 2D-DCT to 12 · 41a350a8
      Jingning Han authored
      The scanning order has the first 12 coefficients of the 8x8 2D-DCT
      sitting in the top left 4x4 block. Hence the partial inverse 8x8
      2D-DCT allows to handle cases with eob below 12.
      
      The overall runtime of the inverse 8x8 2D-DCT unit is reduced from
      166 cycles (using SSE2) to 150 cycles (using SSSE3).
      
      Change-Id: I4514f9748042809ac84df4c14382c00f313f1cd2
      41a350a8
  3. 27 Feb, 2014 1 commit
  4. 15 Feb, 2014 1 commit
  5. 13 Feb, 2014 2 commits
  6. 12 Feb, 2014 1 commit
  7. 05 Feb, 2014 2 commits
  8. 01 Feb, 2014 1 commit
  9. 28 Jan, 2014 1 commit
  10. 27 Jan, 2014 1 commit
  11. 24 Jan, 2014 2 commits
  12. 22 Jan, 2014 1 commit
  13. 15 Jan, 2014 1 commit
  14. 08 Jan, 2014 1 commit
  15. 18 Dec, 2013 1 commit
    • Jim Bankoski's avatar
      rename loop filter functions · b720ba16
      Jim Bankoski authored
      This renames all the loop filter functions so that they no
      longer refer to mb
      
      Change-Id: I8a58a8c7fd253d835cb619bde13913e896ece90b
      b720ba16
  16. 26 Nov, 2013 1 commit
  17. 22 Nov, 2013 1 commit
    • Yunqing Wang's avatar
      Do vertical loopfiltering in parallel · ed36720b
      Yunqing Wang authored
      This patch followed "Add filter_selectively_vert_row2 to enable
      parallel loopfiltering" commit, and added x86 SSE2 optimization
      to do 16-pixel filtering in parallel. For other optimizations
      (neon and dspr2), current 16-pixel functions were done by calling
      8-pixel functions twice, and real 16-pixel functions could be added
      later.
      
      Decoder speedup:
      tulip clip:     2% speed gain;
      old_town_cross: 1.2% speed gain;
      bus:            2% speed gain.
      
      Change-Id: I4818a0c72f84b34f5fe678e496cf4a10238574b7
      ed36720b
  18. 21 Nov, 2013 2 commits
  19. 16 Nov, 2013 1 commit
    • Yunqing Wang's avatar
      Do horizontal loopfiltering in parallel · 64f728ca
      Yunqing Wang authored
      This patch followed "Rewrite filter_selectively_horiz for parallel
      loopfiltering" commit, and added x86 SSE2 optimization to do
      16-pixel filtering in parallel. Also, corrected the declaration
      of aligned arrays. For 8-pixel-in-parallel case, improved the
      calculation of the masks and filters. Updated the threshold loading
      since the thresholds were already duplicated. Updated neon C functions
      to call neon loopfilters twice.
      
      Using tulip clip, tests showed it gave a ~1.5% decoder speed gain.
      
      Change-Id: Id02638626ac27a4b0e0b09d71792a24c0499bd35
      64f728ca
  20. 12 Nov, 2013 1 commit
    • Johann's avatar
      Use lowercase 'b' to branch · e72d49a9
      Johann authored
      iOS doesn't recognize B:
      bad instruction `B idct32_pass_loop'
      
      Change-Id: I3cf6aede4639f1d9efa97f7962fa287ba6feaaef
      e72d49a9
  21. 11 Nov, 2013 1 commit
  22. 05 Nov, 2013 1 commit
  23. 12 Oct, 2013 1 commit
  24. 11 Oct, 2013 1 commit
  25. 10 Oct, 2013 1 commit
  26. 07 Oct, 2013 1 commit
    • Dmitry Kovalev's avatar
      Giving consistent names to IDCT 16x16 functions. · b096c5a3
      Dmitry Kovalev authored
      Renames:
        vp9_short_idct16x16_add    -> vp9_idct16x16_256_add
        vp9_short_idct16x16_10_add -> vp9_idct16x16_10_add
        vp9_short_idct16x16_1_add  -> vp9_idct16x16_1_add
        vp9_idct_add_16x16         -> vp9_idct16x16_add
      
      Change-Id: Ief8a3904de78deab0f4ede944c4d0339c228cfc3
      b096c5a3
  27. 06 Oct, 2013 1 commit
    • Dmitry Kovalev's avatar
      Giving consistent names to IDCT 8x8 functions. · c6ad70d5
      Dmitry Kovalev authored
      Renames:
        vp9_short_idct8x8_add    -> vp9_idct8x8_64_add
        vp9_short_idct8x8_1_add  -> vp9_idct8x8_1_add
        vp9_short_idct8x8_10_add -> vp9_idct8x8_10_add
        vp9_idct_add_8x8         -> vp9_idct8x8_add
      
      Change-Id: Ifb8d3a45b4c0397aa805b30463f3d14581bf72c1
      c6ad70d5
  28. 04 Oct, 2013 1 commit
    • Dmitry Kovalev's avatar
      Giving consistent names to IDCT/IWHT functions. · 3a060257
      Dmitry Kovalev authored
      The idea is to have the following names for each transform size:
      
      vp9_idct4x4_add
        vp9_idct4x4_1_add
        vp9_idct4x4_10_add
        vp9_idct4x4_16_add
      
      vp9_idct8x8_add
        vp9_idct8x8_1_add
        vp9_idct8x8_10_add
        vp9_idct8x8_64_add
      
      etc for 16x16, 32x32
      
      The actual list of renames in this patch:
      
      vp9_idct_add_lossless     -> vp9_iwht4x4_add
      vp9_short_iwalsh4x4_add   -> vp9_iwht4x4_16_add
      vp9_short_iwalsh4x4_1_add -> vp9_iwht4x4_1_add
      
      vp9_idct_add            -> vp9_idct4x4_add
      vp9_short_idct4x4_add   -> vp9_idct4x4_16_add
      vp9_short_idct4x4_1_add -> vp9_idct4x4_1_add
      
      Change-Id: I6f43f7437c68dd30cdd05d72e213765578ed30b1
      3a060257
  29. 27 Sep, 2013 2 commits
  30. 26 Sep, 2013 2 commits
  31. 20 Sep, 2013 1 commit
    • Johann's avatar
      Use lowercase instruction in assembly · a6a00fc6
      Johann authored
      The iOS compiler does not recognize BLE:
      bad instruction `BLE idct32_transpose_pair_loop'
      
      Change-Id: I7426694c66bc31caf939a2d5000968da1222c15b
      a6a00fc6
  32. 16 Sep, 2013 1 commit
  33. 12 Sep, 2013 1 commit
  34. 11 Sep, 2013 1 commit