1. 21 Nov, 2013 2 commits
    • Yunqing Wang's avatar
      Add filter_selectively_vert_row2 to enable parallel loopfiltering · b5e6d6cc
      Yunqing Wang authored
      Added filter_selectively_vert_row2 to be ready for parallel
      loopfiltering in vertical direction. This change did 2-row
      filtering at a time. If 2 vertically adjacent 8x8 blocks do same
      type of filtering, we can do 16-pixel filtering in parallel.
      
      Next, we need to provide 16-pixel loopfiltering functions in c
      and optimized versions for codec speedup.
      
      Change-Id: Idf97bbdd70566e55bd30e1fd25cb8544e33291be
      b5e6d6cc
    • Frank Galligan's avatar
      Add 16 wide neon horz loopfilter. · 98de1513
      Frank Galligan authored
      Add support to do 16 pixel horizontal filtering in Neon.
      Nexus devices saw about 0.5% decode speed increase.
      
      Change-Id: I2993f6c2d49f31fa74976879eeaa289fd3f4e15d
      98de1513
  2. 20 Nov, 2013 8 commits
  3. 19 Nov, 2013 13 commits
  4. 18 Nov, 2013 3 commits
    • Yaowu Xu's avatar
      Fixed a bug in commit a4a5a210 · e63aaf9a
      Yaowu Xu authored
      Commit a4a5a210 enabled lossless coding, but the commit incorrectly
      disabled the usage of skip in encoder even when skip should be used.
      This commit make sure that skip is enabled even in lossless mode.
      
      Change-Id: I276954f952c6ac68f17a316ebc72f09001228a08
      e63aaf9a
    • Yaowu Xu's avatar
      Move vp9_extend.{h,c} from common to encoder · 1c61e196
      Yaowu Xu authored
      Since they used in encoder only. This commit also re-order includes
      for the files that include vp9_extend.h
      
      Change-Id: I929fc113f2135d3198cd1fc6a17434e5a2f8a459
      1c61e196
    • Jingning Han's avatar
      Constrain encoder motion search range · bbe68fbd
      Jingning Han authored
      Explicitly constrain the upper limit of motion search range (in the
      unit of full pixel) to be [-1023, +1023]. It is intended to control
      the effective motion search range for 4K sequences.
      
      Change-Id: I645539c70885eec0f155781f439d97d333336e88
      bbe68fbd
  5. 17 Nov, 2013 1 commit
  6. 16 Nov, 2013 1 commit
    • Yunqing Wang's avatar
      Do horizontal loopfiltering in parallel · 64f728ca
      Yunqing Wang authored
      This patch followed "Rewrite filter_selectively_horiz for parallel
      loopfiltering" commit, and added x86 SSE2 optimization to do
      16-pixel filtering in parallel. Also, corrected the declaration
      of aligned arrays. For 8-pixel-in-parallel case, improved the
      calculation of the masks and filters. Updated the threshold loading
      since the thresholds were already duplicated. Updated neon C functions
      to call neon loopfilters twice.
      
      Using tulip clip, tests showed it gave a ~1.5% decoder speed gain.
      
      Change-Id: Id02638626ac27a4b0e0b09d71792a24c0499bd35
      64f728ca
  7. 15 Nov, 2013 7 commits
  8. 14 Nov, 2013 5 commits