1. 16 Sep, 2014 2 commits
  2. 15 Sep, 2014 5 commits
  3. 12 Sep, 2014 3 commits
    • Jingning Han's avatar
      Refactor reference frame control in sub8x8 block RD search · adb20849
      Jingning Han authored
      This commit unifies the reference frame control in the rate-
      distortion optimization search loop of sub8x8 block size to remove
      the control dependency on mode search order.
      
      Change-Id: I3a174099f71a7cc176ede9fd60e2374243ae9232
      adb20849
    • Deb Mukherjee's avatar
      Adds high bitdepth transform functions and tests · 10783d4f
      Deb Mukherjee authored
      Adds various high bitdepth transform functions and tests.
      Much of the changes are related to using typedefs tran_low_t
      and tran_high_t for the final transform cofficients and intermediate
      stages of the transform computation respectively rather than fixed
      types int16_t/int. When vp9_highbitdepth configure flag is off,
      these map tp int16_t/int32_t, but when the flag is on, they map
      to int32_t/int64_t to make space for needed extra precision.
      
      Change-Id: I3c56de79e15b904d6f655b62ffae170729befdd8
      10783d4f
    • Jingning Han's avatar
      Format fixes in vp9_rd_pick_inter_mode_sb · 74ddde01
      Jingning Han authored
      Change-Id: Ie45687405dcaa34ba465dce2aa14f76017d3a794
      74ddde01
  4. 11 Sep, 2014 8 commits
  5. 09 Sep, 2014 4 commits
    • Jingning Han's avatar
      Fix a bug in vp9_rd_pick_inter_mode_sb · 68d79146
      Jingning Han authored
      This commit fixes a bug related to skipping intra mode checking, by
      using a separate variable to store the best prediction error from
      inter mode. It avoids unintentionally overwriting intra mode
      rate-distortion cost, and hence affecting other speed features.
      
      Change-Id: I99e12993339c84c8b4f597996b372012e5858fae
      68d79146
    • Jingning Han's avatar
      Remove redundant ref frame pointer assignment · 9a9e2aef
      Jingning Han authored
      Assigning selected reference frame pointer is done in the
      encode_superblock function. No need to do this at the end of
      rate-distortion optimization search.
      
      Change-Id: I33fcede0fd304b4a4c4deef2d126d79546a9c070
      9a9e2aef
    • Jingning Han's avatar
      Remove dependency of intra mode search skip check on mode order · 33593d1f
      Jingning Han authored
      This commit refactors the vp9_rd_pick_inter_mode_sb function to
      remove the intra mode early termination dependency on the mode
      search order.
      
      Change-Id: If6ac49aa7c530c7b9a5bd31b0ab84db83e192bec
      33593d1f
    • Jingning Han's avatar
      Replace best_mode_index table retrieve with fetching best_mbmode · d96228a0
      Jingning Han authored
      This commit allows the encoder to find current best prediction mode
      state using best_mbmode, instead of fetching from the static mode
      search table via best_mode_index.
      
      Change-Id: Ibefeab83aed33a49c2be03e83f09153856ca4271
      d96228a0
  6. 05 Sep, 2014 1 commit
    • Yunqing Wang's avatar
      Correct the mode decisions in special cases · 1dd9a639
      Yunqing Wang authored
      The rate costs calculated for inter modes are not precise in some
      cases, which causes NEWMV is chosen instead of NEARESTMV, NEARMV,
      and ZEROMV. This patch added checks for these cases, and corrected
      the mode decisions.
      
      Borg tests at speed 3 showed:
      1. stdhd set: 0.102% PSNR gain and 0.088% SSIM gain.
      2. derf set:  0.147% PSNR gain and 0.132% SSIM gain.
      No speed change.
      
      Change-Id: I35d17684b89ad4734fb610942d707899146426db
      1dd9a639
  7. 04 Sep, 2014 1 commit
    • Jingning Han's avatar
      Enable adaptive motion search for ARF coding · d435148f
      Jingning Han authored
      This commit turns on adaptive motion search for ARF coding, in
      addition to other normal inter frame coding. It improves the
      average compression efficiency:
      
      stdhd 0.1%
      derf  0.04%
      
      For the test sequences, the speed 3 runtime is reduced:
      
      pedestrian 1080p 2000 kbps, 149932 ms -> 144580 ms, (3.3% speed-up)
      bus CIF 1000 kbps, 8050 ms -> 7895 ms, (1.9%)
      highway CIF 100 bkps, 45033 ms -> 44078 ms, (2.2%)
      
      Change-Id: I5228565b609f99e8ae04f6140a2bf2b64a831d21
      d435148f
  8. 03 Sep, 2014 3 commits
    • Jingning Han's avatar
      Speed up compound inter prediction mode check · d62d804e
      Jingning Han authored
      This commit allows the encoder to store outcomes of single reference
      frame modes and compares them to decide if the inter prediction
      filter, forward transform, and quantization can be skipped.
      
      The compression performance of speed 3 is down
      derf  -0.364%
      stdhd -0.198%
      
      For test sequences, the speed 3 runtime is reduced
      highway CIF 100 kbps, 51976 ms -> 45033 ms, 13% speed-up
      stockholm 720p 1000 kbps, 71826 ms -> 67838 ms, 5.5% speed-up
      pedestrian 1080p 2000 kbps, 154924 ms -> 150702 ms, 2.6% speed-up
      
      Change-Id: I5aa26f918d2b4b5197a2c0afa2779319f1c88e44
      d62d804e
    • Yaowu Xu's avatar
      Merge two similar functions into one · e759d957
      Yaowu Xu authored
      intra_super_block_yrd() and inter_super_block_yrd() are largely same,
      this commit merges them into one to reduce code duplication.
      
      Change-Id: I64d7042a5b099345627cf55663010c185b25ec37
      e759d957
    • Jingning Han's avatar
      Skip comp inter mode test in RD loop with same frame bias signs · 801fef26
      Jingning Han authored
      This commit allows the encoder to skip check on compound inter
      modes in the rate-distortion optimization loop, if the reference
      frame bias signs are the same.
      
      Change-Id: Ib753e6bb11cbdd338aee69dbe2b649671f75a6b0
      801fef26
  9. 02 Sep, 2014 1 commit
    • Jingning Han's avatar
      Skip comp inter mode tests for arf coding · 33176fef
      Jingning Han authored
      This commit skips the compound inter mode prediction check in the
      rate-distortion optimization loop for ARF coding. It reduces the
      runtime for certain test clips at speed 3, at no compression
      performance change:
      
      bus CIF 1000 kbps, 8260 ms -> 8090 ms, 1.8% speed-up
      stockholm 720p 1000 kbps, 74453 ms -> 71826 ms, 2.9% speed-up
      
      No visible speed-up for pedestrian area 1080p at 2000 kbps.
      
      Change-Id: Ic68aa56837159b726563b784e2e3729e846465ad
      33176fef
  10. 29 Aug, 2014 3 commits
    • Jingning Han's avatar
      Fix int64_t to unsigned int conversion warnings · 6ddf1e15
      Jingning Han authored
      Use unsigned int type to store the sse in the pixel domain. The
      precision is sufficient to handle sse of block size up to 64x64.
      The transform domain version however needs int64_t, since there is
      a transfer gain applied in the forward transformation that might
      cause unsigned int overflow.
      
      Change-Id: Ifef97c38597e426262290f35341fbb093cf0a079
      6ddf1e15
    • Jingning Han's avatar
      Skip intra mode tests depending on inter residuals · 4282955e
      Jingning Han authored
      This commit allows encoder to skip intra coding mode test, when
      the known inter residual is less than the source variance. It
      reduces the runtime of speed 3 for test clips:
      bus cif 1000 kbps: 8587 ms -> 8260 ms, 3.8% speed-up
      pedestrian 1080p 2000 kbps: 161381 ms -> 155241 ms, 3.7% speed-up.
      
      The compression performance is down by
      derf   -0.36%
      stdhd  -0.25%
      
      Change-Id: I75ce1e035b4da2153cb1ac14111d1a07c05a735d
      4282955e
    • Jingning Han's avatar
      Extend block level sse to support multiple txfm blocks · 02e6ecdc
      Jingning Han authored
      This commit extends the sse and forward transform computation flag
      to support the case 64x64 blocks where there are 4 32x32 2D-DCT
      blocks.
      
      Change-Id: I86a3e805dfaa0f3abd812f590520c71aa0e40473
      02e6ecdc
  11. 28 Aug, 2014 2 commits
    • Yunqing Wang's avatar
      Early termination in encoding partition search · 4d2c3769
      Yunqing Wang authored
      In the partition search, the encoder checks all possible
      partitionings in the superblock's partition search tree.
      This patch proposed a set of criteria for partition search
      early termination, which effectively decided whether or
      not to terminate the search in current branch based on the
      "skippable" result of the quantized transform coefficients.
      The "skippable" information was gathered during the
      partition mode search, and no overhead calculations were
      introduced.
      
      This patch gives significant encoding speed gains without
      sacrificing the quality.
      
      Borg test results:
      1. At speed 1,
         stdhd set: psnr: +0.074%, ssim: +0.093%;
         derf set:  psnr: -0.024%, ssim: +0.011%;
      2. At speed 2,
         stdhd set: psnr: +0.033%, ssim: +0.100%;
         derf set:  psnr: -0.062%, ssim: +0.003%;
      3. At speed 3,
         stdhd set: psnr: +0.060%, ssim: +0.190%;
         derf set:  psnr: -0.064%, ssim: -0.002%;
      4. At speed 4,
         stdhd set: psnr: +0.070%, ssim: +0.143%;
         derf set:  psnr: -0.104%, ssim: +0.039%;
      
      The speedup ranges from several percent to 60+%.
                       speed1    speed2    speed3    speed4
      (1080p, 100f):
      old_town_cross:  48.2%     23.9%     20.8%     16.5%
      park_joy:        11.4%     17.8%     29.4%     18.2%
      pedestrian_area: 10.7%      4.0%      4.2%      2.4%
      (720p, 200f):
      mobcal:          68.1%     36.3%     34.4%     17.7%
      parkrun:         15.8%     24.2%     37.1%     16.8%
      shields:         45.1%     32.8%     30.1%      9.6%
      (cif, 300f)
      bus:              3.7%     10.4%     14.0%      7.9%
      deadline:        13.6%     14.8%     12.6%     10.9%
      mobile:           5.3%     11.5%     14.7%     10.7%
      
      Change-Id: I246c38fb952ad762ce5e365711235b605f470a66
      4d2c3769
    • Deb Mukherjee's avatar
      Updates vp9_pattern search to return integer sads · 04b100b2
      Deb Mukherjee authored
      Updates the vp9_pattern_search function to return integer one-away
      neighbors' sad values, for subsequent use in speeding up the
      sub-pel search. Also, removes code for the do_refine option
      which is not being used currently.
      Updates the integer and subpel functions to pass in a 5-element
      sad list for output or input.
      
      A new pruned sub-pel search algorithm is implemented that uses
      the sad returned from the integer pel search. But it is not
      deployed yet.
      
      Change-Id: Ifa9f5ad024b5b660570366d2bd900343e1891520
      04b100b2
  12. 27 Aug, 2014 4 commits
  13. 26 Aug, 2014 1 commit
    • Yaowu Xu's avatar
      add a new interp filter search strategy. · 1144fee3
      Yaowu Xu authored
      This commit addes a new strategy to reduce the search for optimal
      interpolation filter type. The encoder counts and store how many each
      filter type is selected and used for each of the reference frames.
      A filter type that is rarely used for all three reference frames is
      masked out to avoid computation.
      
      The impact on compression is neglectible:
      -0.02% on derf
      +0.02% on stdhd
      
      Encoding time is seen to reduce by 2~3%.
      
      Change-Id: Ibafa92291b51185de40da513716222db4b230383
      1144fee3
  14. 25 Aug, 2014 1 commit
  15. 21 Aug, 2014 1 commit