1. 11 Feb, 2015 3 commits
  2. 06 Feb, 2015 1 commit
    • Jingning Han's avatar
      Re-arrange inter mode search order in RTC coding flow · b2762a88
      Jingning Han authored
      This commit makes the ZEROMV mode first in the search order to
      ensure that the zero mv is always checked in the RTC coding mode.
      It improves the average speed -6 compression performance by 0.3%
      in both PSNR and SSIM at no visible speed change.
      Change-Id: I465a7e59f4e20cd84fee3f02ced6f98036945949
  3. 04 Feb, 2015 2 commits
    • Jingning Han's avatar
      Save an extra call for setup_pred_plane function · 4ccfc7d5
      Jingning Han authored
      Reuse the yv12_mb array to fetch the buffer pointers/strides
      corresponding to the current reference frame.
      Change-Id: I5276b7494158b2cccef15213be2dc189e9036851
    • Jingning Han's avatar
      Account for chroma component costs in RTC mode decision · 0c6d3a03
      Jingning Han authored
      This commit allows the encoder to account for additional chroma
      plane costs in the mode decision process, if the current block
      potentially contains significant color change. It improves the
      visual quality at very low bit-rates.
      The compression performance of dark720p is improved by 12.39% in
      speed 6. For jimred at 150 kbps, the PSNR of V component (red)
      increased by 0.2 dB, at the expense of about 5% increase in
      encoding time. Note that for sequences where the chroma components
      are fairly consistent, the encoding time increase is negligible.
      On average the rtc set compression performance is improved by
      1.172% in PSNR and 1.920% in SSIM.
      Change-Id: Ia55b24ef23a25304f7ec9958fbf07fd6e658505c
  4. 31 Jan, 2015 1 commit
    • hkuang's avatar
      Try again to merge branch 'frame-parallel' into master branch. · be6aeada
      hkuang authored
      In frame parallel decode, libvpx decoder decodes several frames on all
      cpus in parallel fashion. If not being flushed, it will only return frame
      when all the cpus are busy. If getting flushed, it will return all the
      frames in the decoder. Compare with current serial decode mode in which
      libvpx decoder is idle between decode calls, libvpx decoder is busy
      between decode calls.
      Current frame parallel decode will only speed up the decoding for frame
      parallel encoded videos. For non frame parallel encoded videos, frame
      parallel decode is slower than serial decode due to lack of loopfilter
      worker thread.
      There are still some known issues that need to be addressed. For example:
      decode frame parallel videos with segmentation enabled is not right sometimes.
      * frame-parallel:
        Add error handling for frame parallel decode and unit test for that.
        Fix a bug in frame parallel decode and add a unit test for that.
        Add two test vectors to test frame parallel decode.
        Add key frame seeking to webmdec and webm_video_source.
        Implement frame parallel decode for VP9.
        Increase the thread test range to cover 5, 6, 7, 8 threads.
        Fix a bug in adding frame parallel unit test.
        Add VP9 frame-parallel unit test.
        Manually pick "Make the api behavior conform to api spec." from master branch.
        Move vp9_dec_build_inter_predictors_* to decoder folder.
        Add segmentation map array for current and last frame segmentation.
        Include the right header for VP9 worker thread.
        Move vp9_thread.* to common.
        ctrl_get_reference does not need user_priv.
        Seperate the frame buffers from VP9 encoder/decoder structure.
        Revert "Revert "Revert "Revert 3 patches from Hangyu to get Chrome to build:"""
      This reverts commit a18da976.
      Change-Id: I361442ffec1586d036ea2e0ee97ce4f077585f02
  5. 23 Jan, 2015 1 commit
  6. 07 Jan, 2015 3 commits
  7. 02 Jan, 2015 1 commit
  8. 30 Dec, 2014 1 commit
  9. 25 Dec, 2014 1 commit
    • Jingning Han's avatar
      Enable sub8x8 inter block search for RTC coding mode · dad89d5c
      Jingning Han authored
      This commit enables sub8x8 inter block coding for RTC mode. The
      use of sub8x8 blocks can be turned on by allowing
      choose_partitioning function to select 4x4/4x8/8x4 block sizes.
      Change-Id: Ifbf1fb3888fe4c094fc85158ac3aa89867d8494a
  10. 23 Dec, 2014 1 commit
    • Jingning Han's avatar
      Set ref frame scaling factor in RTC inter mode decision · eb1795f6
      Jingning Han authored
      Properly set the corresponding scaling factor of the reference
      frame in the non-RD mode decision process. This allows the mode
      search process to account for the scaled reference frame when
      selecting coding mode.
      Change-Id: I9d41bff6931c98e5a82b413e37ac5e6e14b93b23
  11. 18 Dec, 2014 2 commits
    • Jingning Han's avatar
      Add a guard on intra mode skip control for RTC mode · 6ec0ef66
      Jingning Han authored
      This commit adds a guard condition to the intra mode test skip
      control in RTC coding mode. If all inter modes are skipped, force
      the encoder to check intra mode. It avoids situations where the
      encoder processes without properly assigning required mode
      Change-Id: Ibb349fee997d6584ce901d08b06e8df3ca9c01b1
    • Jingning Han's avatar
      Remove ARF mode entries from THR_MODES array in non-RD mode · dd0602e0
      Jingning Han authored
      The alternate reference frame is disabled in non-RD mode. No need
      to keep the related entries in the THR_MODES array.
      Change-Id: I53386f4bb1c6284f582801f27246c5edf55bc24b
  12. 17 Dec, 2014 2 commits
    • Jingning Han's avatar
      Rework mode search threshold update for RTC coding mode · 455514a6
      Jingning Han authored
      In RTC coding mode, the alternate reference frame modes and compound
      inter prediction modes are disabled. This commit reworks the
      related mode search threshold update process to skip interacting
      with these coding modes. It provides about 1.5% speed-up for speed
      -6 on average.
      16551 b/f, 40.451 dB, 6261 ms -> 16550 b/f, 40.459 dB, 6190 ms
      33316 b/f, 38.795 dB, 6335 ms -> 33310 b/f, 38.798 dB, 6237 ms
      33265 b/f, 41.055 dB, 7176 ms -> 33267 b/f, 41.064 dB, 7084 ms
      33329 b/f, 39.729 dB, 11235 ms -> 33331 b/f, 39.733 dB, 10731 ms
      Change-Id: If2a4090a371cd28f579be219c013b972d7d9b97f
    • Jingning Han's avatar
      Properly store the tx_size of selected intra mode · 56a8bc54
      Jingning Han authored
      Use a temporary variable to store the transform size associated
      with the best intra mode and restore the mode_info if the overall
      best mode is intra mode.
      Change-Id: I2606e0061ad32f91b095462902b1eb734b128eea
  13. 16 Dec, 2014 1 commit
    • Jingning Han's avatar
      Fix intra mode update process in vp9_pick_inter_mode · df3e3ab6
      Jingning Han authored
      When multiple intra modes are tested, the previous mode info
      update process may overwrite the selected best intra mode and make
      the final selection use an inter mode. This commit fixes this
      issue by moving the mode_info reset outside the intra mode search
      Change-Id: I15ed4288a6b3cb0832104a5e6d5d9a25cd1a5b2b
  14. 15 Dec, 2014 3 commits
    • Jingning Han's avatar
      Initialize best_tx_size with invalid value · c2c7596f
      Jingning Han authored
      If vp9_pick_inter_mode works properly, it should at least check
      one coding mode and hence get best_tx_size assigned a valid value.
      There is no need to initialize best_tx_size with a legitimate
      value before starting the mode search.
      Change-Id: Ic0496cd89672ea9c2c512a9bd1da952190af9cba
    • Jingning Han's avatar
      Use right shift to replace division in vp9_pick_inter_mode · 83e2c62a
      Jingning Han authored
      Make the variable reduction_fac log2 based and explicitly use
      right shift when computing intra_cost_penalty.
      Change-Id: I208f1fb879a02debb3b3fc64f9fd06260dcf1c86
    • Jingning Han's avatar
      Simplify rate-distortion modeling function · eefe8692
      Jingning Han authored
      Use left shift to replace one multiplication. The computation
      outcome remains identical.
      Change-Id: I1e1737af0a245de0d2a2bde10f0c171477199fc1
  15. 10 Dec, 2014 1 commit
    • Jingning Han's avatar
      Use use_prev_frame_mvs flag for ref mv search branch · 0cac834b
      Jingning Han authored
      Replace error_resilient flag with use_prev_frame_mvs in
      vp9_pick_inter_mode reference motion vector search selection.
      This effectively turns off the simplified ref mv search in the
      settings of frame resizing, even if error-resilient mode is off.
      Change-Id: I7fed814ee7bc0cb419a03b846e0fc2de46ba7686
  16. 06 Dec, 2014 1 commit
    • Jingning Han's avatar
      Remove redundant rdcost reset · 17bedc54
      Jingning Han authored
      The initial reset of this_rdc in vp9_pick_inter_mode is not needed,
      since it will be re-assign when used.
      Change-Id: Ic0e12d741cbab292fc214c1eabb48b129af7839b
  17. 05 Dec, 2014 2 commits
  18. 04 Dec, 2014 1 commit
  19. 25 Nov, 2014 3 commits
    • Jingning Han's avatar
      Cosmetic change in vp9_pick_inter_mode · a04ed984
      Jingning Han authored
      Change-Id: Ic072585ebffdb36982ed7b8b9f875ca6c1c656c4
    • Jingning Han's avatar
      Adaptively adjust mode test kick-off thresholds in RTC coding · 92a7cfc8
      Jingning Han authored
      This commit allows the encoder to increase the mode test kick-off
      thresholds if the previous best mode renders all zero quantized
      coefficients, thereby saving motion search runs when possible.
      The compression performance of speed -5 and -6 is down by -0.446%
      and 0.591%, respectively. The runtime of speed -6 is improved by
      10% for many test clips.
      vidyo1, 1000 kbps
      16578 b/f, 40.316 dB, 7873 ms -> 16575 b/f, 40.262 dB, 7126 ms
      nik720p, 1000 kbps
      33311 b/f, 38.651 dB, 7263 ms -> 33304 b/f, 38.629 dB, 6865 ms
      dark720p, 1000 kbps
      33331 b/f, 39.718 dB, 13596 ms -> 33324 b/f, 39.651 dB, 12000 ms
      mmoving, 1000 kbps
      33263 b/f, 40.983 dB, 7566 ms -> 33259 b/f, 40.978 dB, 7531 ms
      Change-Id: I7591617ff113e91125ec32c9b853e257fbc41d90
    • Yunqing Wang's avatar
      vp9_ethread: modify VP9_COMP structure · edbd61e1
      Yunqing Wang authored
      This patch modified struct VP9_COMP. Created a struct ThreadData
      to include data that need to be copied for each thread. In
      multiple thread case, one thread processes one tile. all threads
      share one copy of VP9_COMP,
      (refer to VP9_COMP *cpi in the code)
      but each thread has its own copy of ThreadData,
      (refer to ThreadData *td in the code).
      Therefore, within the scope of encode_tiles(), both cpi and td
      need to be passed as function parameters.
      In single thread case, the FRAME_COUNTS pointer in ThreadData
      points to "counts" in VP9_COMMON.
      Change-Id: Ib37908b2d8e2c0f4f9c18f38017df5ce60e8b13e
  20. 24 Nov, 2014 2 commits
    • Jingning Han's avatar
      Remove redundant intra mode penalty from vp9_pick_inter_mode · 25be81e2
      Jingning Han authored
      The intra mode penalty is covered by intra_cost_penalty. This
      commit removes the other intra cost threshold, provided that the
      constant 50 is negligible in normal rate-distortion cost.
      Change-Id: I9b8b7483c43b9a41741622e7057def1f7d51bb72
    • Jingning Han's avatar
      Key frame non-RD mode decision process · 2fbdfd2c
      Jingning Han authored
      This commit makes a non-RD coding mode decision process for key
      frame coding. It can be optionally turned on in speed -6 and above.
      Change-Id: I0847258b392877a0210b4768bef88ebc9ad009b5
  21. 21 Nov, 2014 2 commits
    • Jingning Han's avatar
      Rework forward txfm/quantization skip system in RTC coding mode · 7428cebe
      Jingning Han authored
      This commit allows more aggressive decision to skip forward
      transform and quantization for luma component in RTC coding mode.
      The chroma components remains going through the normal coding
      routine, since they are not included in the non-RD mode search
      It reduces the runtime cost by 2% - 10%. In speed -6,
      vidyo1 1000 kbps
      16576 b/f, 40.281 dB, 8402 ms -> 16576 b/f, 40.323 dB, 7764 ms
      nik720p 1000 kbps
      33337 b/f, 38.622 dB, 7473 ms -> 33299 b/f, 38.660 dB, 7314 ms
      dark720p 1000 kbps
      33330 b/f, 39.785 dB, 13505 ms -> 33325 b/f, 39.714 dB, 13105 ms
      The compression performance of speed -6 is improved by 0.44% in
      PSNR and 1.31% in SSIM.
      Change-Id: Iae9e3738de6255babea734e5897f29118bebc6d7
    • Alex Converse's avatar
      Allow DC/H/V/TM on screen content. · bc1b3d84
      Alex Converse authored
      6.3% better compression
      less than 1% compression time increase
      Change-Id: Ie83c059436e54c09de9e7c87e06e0a6d40dc38fe
  22. 17 Nov, 2014 1 commit
  23. 11 Nov, 2014 1 commit
    • Jingning Han's avatar
      Use reconstructed pixels for intra prediction · e717d22b
      Jingning Han authored
      This commit makes the speed -6 and above use the reconstructed
      boundary pixels for precise intra prediction. This allows more
      intra prediction modes to be tested in the non-RD coding process.
      Enabling horizontal and vertical intra prediction modes can
      improve the speed -6 compression performance for rtc set
      by 0.331%.
      Change-Id: I3a99f9d12c6af54de2bdbf28c76eab8e0905f744
  24. 07 Nov, 2014 1 commit
  25. 04 Nov, 2014 1 commit
    • Jingning Han's avatar
      Skip ref frame mode search conditioned on predicted mv residuals · 1434f769
      Jingning Han authored
      This commit makes the RTC coding mode to conditionally skip the
      reference frame mode search, when the predicted motion vector of
      the current reference frame gives more than two times sum of
      absolute difference compared to that of other reference frames.
      It reduces the runtim by 1% - 4% for speed -5 and -6. The average
      compression performance is improved by about 0.1% in both settings.
      It is of particular benefit to light change scenarios. The
      compression performance of test clip mmmovingvga.y4m is improved by
      6.39% and 15.69% at high bit rates for speed -5 and -6, respectively.
      Speed -5
      vidyo1 16555 b/f, 40.818 dB, 12422 ms ->
             16552 b/f, 40.804 dB, 12100 ms
      nik    33211 b/f, 39.138 dB, 11341 ms ->
             33228 b/f, 39.139 dB, 11023 ms
      mmmoving 33263 b/f, 40.935 dB, 13508 ms ->
               33256 b/f, 41.068 dB, 12861 ms
      Speed -6
      vidyo1 16541 b/f, 40.227 dB, 8437 ms ->
             16540 b/f, 40.220 dB, 8216 ms
      nik    33272 b/f, 38.399 dB, 7610 ms ->
             33267 b/f, 38.414 dB, 7490 ms
      mmmoving 33255 b/f, 40.555 dB, 7523 ms ->
               33257 b/f, 40.975 dB, 7493 ms
      Change-Id: Id2aef76ef74a3cba5e9a82a83b792144948c6a91
  26. 02 Nov, 2014 1 commit