1. 12 Jun, 2014 1 commit
    • Jingning Han's avatar
      Fast computation path for forward transform and quantization · ccba289f
      Jingning Han authored
      This commit enables a fast path computational flow for forward
      transformation. It checks the sse and variance of prediction
      residuals and decides if the quantized coefficients are all
      zero, dc only, or more. It then selects the corresponding coding
      path in the forward transformation and quantization stage.
      
      It is currently enabled in rtc coding mode. Will do it for rd
      coding mode next.
      
      In speed -6, the runtime for pedestrian_area 1080p at 1000 kbps
      goes down from 14234 ms to 13704 ms, i.e., about 4% speed-up.
      Overall coding performance for rtc set is changed by -0.18%.
      
      Change-Id: I0452da1786d59bc8bcbe0a35fdae9f623d1d44e1
      ccba289f
  2. 09 Jun, 2014 1 commit
    • Yunqing Wang's avatar
      Use small transform size in non-rd real-time mode · b04d7668
      Yunqing Wang authored
      In non-rd real-time mode, choosing smaller transform size in
      encoding gives better video quality and good speed gain than
      choosing larger transform size. This patch set tx size search
      method to ALLOW_8X8, which is better than using 4x4 or other
      larger sizes.
      
      Borg tests on rtc set at speed 6 showed significant gain on quality.
      PSNR gain: 11.034% and SSIM gain: 15.466%.
      
      The speed gain is 5% - 12% for <720p clips, and 2% - 7% for
      720p clips.
      
      Change-Id: If4dc74ed2df359346b059f47fb73b4a0193ec548
      b04d7668
  3. 06 Jun, 2014 2 commits
  4. 05 Jun, 2014 2 commits
  5. 03 Jun, 2014 1 commit
  6. 02 Jun, 2014 1 commit
  7. 01 Jun, 2014 1 commit
  8. 29 May, 2014 2 commits
  9. 27 May, 2014 1 commit
  10. 23 May, 2014 1 commit
  11. 22 May, 2014 1 commit
    • Alex Converse's avatar
      Use offset mode info when filling pc tree. · 80e5326c
      Alex Converse authored
      Use the appropriate subblock offset mode info rather than the parent
      block base, when filling mbmi in the pc tree in nonrd_use_partition.
      This mimics what is done in the vertical case and what is done for
      both cases in nonrd_pick_partition.
      
      This change has little practical effect at the moment since in speed 5
      rt horizontal and vertical partitions are currently only used unpaired
      at edges of the picture.
      
      Change-Id: I4632f66ca84086dac56c7d36b45ddbe38a06f42a
      80e5326c
  12. 21 May, 2014 2 commits
    • Yaowu Xu's avatar
      Enable various thresholds of motion detection · 3bda7ec1
      Yaowu Xu authored
      This commit changed to enable the encoder to adjust motion dection
      speed threshold based on picture size. In addition, cpu-used 1 now
      does a partition search every other frame instead of every third
      frame for low resolution inputs.
      
      The change has no quality/speed impact for 720p and above. Test
      showed the change increase encoding time by between 3% to 6% for
      cpu-used 2 encodiong of 360p sequences. It also has a compression
      gain about .3%.
      
      For cpu-used 2, the change resolved some very disturbing visual
      artifacts in certain sequences when large block partitionings and
      transforms are used as a result of copying the partition from a
      previous frame.
      
      Change-Id: Ic7fd22508cdb811d4ca935655adbf20109286cfa
      3bda7ec1
    • Dmitry Kovalev's avatar
      Moving itxm_add pointer from MACROBLOCKD to MACROBLOCK. · 35a83677
      Dmitry Kovalev authored
      The final goal is eventually to get rid of both itxm_add and fwd_txm4x4.
      This patch does it in the decoder.
      
      Change-Id: Ibb3db57efbcbb1ac387c6742538a9fcf2c6f24a5
      35a83677
  13. 19 May, 2014 1 commit
  14. 16 May, 2014 2 commits
  15. 15 May, 2014 2 commits
  16. 14 May, 2014 1 commit
    • Deb Mukherjee's avatar
      Remove Wextra warnings from vp9_sad.c · 7ab9a958
      Deb Mukherjee authored
      As a side-effect, the max_sad check is removed from the
      C-implementation of VP8, for consistency with VP9, and to
      ensure that the SAD tests common to VP8/VP9 pass.
      That will make the VP8 C implementation of sad a little slower
      but given that is rarely used in practice, the impact will be
      minimal.
      
      Change-Id: I7f43089fdea047fbf1862e40c21e4715c30f07ca
      7ab9a958
  17. 12 May, 2014 1 commit
    • Yaowu Xu's avatar
      Cleanup unused parameters · d22faee9
      Yaowu Xu authored
      Remove unused function parameters from:
      vp9_encodemb.c
      vp9_encodeframe.c
      
      Change-Id: I07725e5099cf98486e70c9c70babb0cd785c58a9
      d22faee9
  18. 10 May, 2014 1 commit
    • Jingning Han's avatar
      Apply constrained partition search range to non-RD mode decision · 734c5ffa
      Jingning Han authored
      This commit enables a chessboard pattern for partition search. All
      the black blocks run regular partition search ranging from 8x8 to
      32x32. The rest white blocks take the nearby blocks' information
      to adaptively decide the effective search range.
      
      The compression performance for rtc set at speed -5 is down by 1.5%.
      For pedestrian 1080p at speed -5, the runtime goes from 41594 ms to
      39697 ms, i.e., about 5% faster.
      
      Change-Id: Ia4b96e237abfaada487c743bca08fe1afd298685
      734c5ffa
  19. 06 May, 2014 1 commit
  20. 01 May, 2014 1 commit
  21. 29 Apr, 2014 1 commit
  22. 23 Apr, 2014 1 commit
  23. 21 Apr, 2014 1 commit
    • Jingning Han's avatar
      Enable background detection for adaptive quantizer control · 80a4f559
      Jingning Han authored
      This commit enables a background detection approach for adaptive
      quantizer control. It combines the cyclic refresh pattern and the
      background information to determine the segment id for adaptive
      quantizer selection, prior to the non-RD mode decision process.
      It hence allows proper quantization information update for a more
      precise rate-distortion modeling in the non-RD mode decision.
      
      The compression performance of speed -5 for rtc set is improved
      by 2.5%, at no speed change.
      
      Change-Id: Ic3713e8ed9185b403b5b1679d19dabd57506d452
      80a4f559
  24. 17 Apr, 2014 4 commits
    • Jingning Han's avatar
      Fix broken encoding process at sub8x8 block size · a7c69a71
      Jingning Han authored
      Use the correct buffer to update the coding mode decision for
      sub8x8 blocks.
      
      Change-Id: I091ef27d2047eeb8b73ceb7c2c7c45b38ba8c6d5
      a7c69a71
    • Jingning Han's avatar
      Skip fetching original frame pointers for UV in background check · 42c1f6db
      Jingning Han authored
      The background detection only tracks luma component. This commits
      removes the frame buffer pointer retrieval for chroma components.
      
      Change-Id: I098bd2950f5e5829ed5dc2b48568167248da7fad
      42c1f6db
    • Jim Bankoski's avatar
      add a context tree structure to encoder · e890c257
      Jim Bankoski authored
      This patch sets up a quad_tree structure (pc_tree) for holding all of
      pick_mode_context data we use at any square block size during encoding
      or picking modes.  That includes contexts for 2 horizontal and 2 vertical
      splits, one none, and pointers to 4 sub pc_tree nodes corresponding
      to split.  It also includes a pointer to the current chosen partitioning.
      
      This replaces code that held an index for every level in the pick
      modes array including:  sb_index, mb_index,
      b_index, ab_index.
      
      These were used as stateful indexes that pointed to the current pick mode
      contexts you had at each level stored in the following arrays
      
      array ab4x4_context[][][],
      sb8x4_context[][][], sb4x8_context[][][], sb8x8_context[][][],
      sb8x16_context[][][], sb16x8_context[][][], mb_context[][], sb32x16[][],
      sb16x32[],  sb32_context[], sb32x64_context[], sb64x32_context[],
      sb64_context
      
      and the partitioning that had been stored in the following:
      b_partitioning, mb_partitioning, sb_partitioning, and sb64_partitioning.
      
      Prior to this patch before doing an encode you had to set the appropriate
      index for your block size ( switch statement),  update it ( up to 3
      lookups for the index array value) and then make your call into a recursive
      function at which point you'd have to call get_context which then
      had to do a switch statement based on the blocksize,  and then up to 3
      lookups based upon the block size to find the context to use.
      
      With the new code the context for the block size is passed around directly
      avoiding the extraneous switch statements and multi dimensional array
      look ups that were listed above.   At any level in the search all of the
      contexts are local to the pc_tree you are working on (in?).
      
      In addition in most places code that used to call sub functions and
      then check if the block size was 4x4 and index was > 0 and return
      now don't preferring instead to call the right none function on the inside.
      
      
      
      Change-Id: I06e39318269d9af2ce37961b3f95e181b57f5ed9
      e890c257
    • Jingning Han's avatar
      Remove redundant buffer initialization and mode_info assignments · b6ec6930
      Jingning Han authored
      There is no need to initialize source/dst frame buffers at frame
      level. These will be done at block coding stage. This commit hence
      removes the redundant operations.
      
      Change-Id: I11d9f2556058c6205c8e58ed53e31f78622c41b7
      b6ec6930
  25. 16 Apr, 2014 2 commits
    • Jingning Han's avatar
      Enable more precise background detection for partition decision · e88a2805
      Jingning Han authored
      This commit compares the current original frame to the previous
      original frame at 64x64 block level and decides if the entire
      block belongs to background area. If it is in the background area,
      skip non-RD partition search and copy the partition types of the
      collocated block in the previous frame.
      
      For vidyo1 in the rtc set, this makes the speed -5 coding speed
      about 8% faster. The overall compression performance is down by
      1.37% for rtc set.
      
      Change-Id: Iccf920562fcc88f21d377fb6a44c547c8689b7ea
      e88a2805
    • Paul Wilkins's avatar
      Remove old activity masking code. · e434d08f
      Paul Wilkins authored
      Delete code relating to the old VP8_TUNE_SSIM flag
      as this code does not currently work and is largely made
      redundant in VP9 by the various AQ modes.
      
      Change-Id: I71f28e1f680573d296422254489000678552b17b
      e434d08f
  26. 15 Apr, 2014 2 commits
  27. 14 Apr, 2014 2 commits
  28. 11 Apr, 2014 1 commit