1. 28 Nov, 2012 4 commits
    • Jim Bankoski's avatar
      fixed includes to be fully specified · c6787398
      Jim Bankoski authored
      Change-Id: Ia1cce221f8511561b9cbd8edb7726fbc286ff243
      c6787398
    • Jim Bankoski's avatar
      remove postproc invokes · 85cba19e
      Jim Bankoski authored
      and some miscellaneous invoke left overs
      
      Change-Id: I63191b1bfd3bea4ce30cceaeb686ec850570fc43
      85cba19e
    • Yaowu Xu's avatar
      removed redundant mode_context data structures · 12da793d
      Yaowu Xu authored
      This commit removed a couple of redundant data structures in frame
      coding contextsm, mode_context and mode_context_a, and changed to
      use vp9_mode_contexts only. The switch of the context for different
      frame type now relies on the switch of frame coding context between
      lfc and lfc_a. This commit also removed a number of memcpy among
      these redundant data structure.
      
      Change-Id: I42e8174bd60f466b0860afc44c1263896471b0f3
      12da793d
    • John Koleszar's avatar
      Clamp decoded feature data · a1f15814
      John Koleszar authored
      Not all segment feature data elements are full-range powers of two, so
      there are values that can be encoded that are invalid. Add a new function
      to clamp values to the maximum allowed.
      
      Change-Id: Ie47cb80ef2d54292e6b8db9f699c57214a915bc4
      a1f15814
  2. 27 Nov, 2012 1 commit
    • John Koleszar's avatar
      Add vp9_ prefix to all vp9 files · fcccbcbb
      John Koleszar authored
      Support for gyp which doesn't support multiple objects in the same
      static library having the same basename.
      
      Change-Id: Ib947eefbaf68f8b177a796d23f875ccdfa6bc9dc
      fcccbcbb
  3. 26 Nov, 2012 1 commit
    • Yunqing Wang's avatar
      Improve sad3x16 SSE2 function · e7cd8071
      Yunqing Wang authored
      Vp9_sad3x16_sse2() is heavily called in decoder, in which the
      unaligned reads consume lots of cpu cycles. When CONFIG_SUBPELREFMV
      is off, the unaligned offset is 1. In this situation,
      we can adjust the src_ptr to be 4-byte aligned, and then do the
      aligned reads. This reduced the reading time significantly. Tests
      on 1080p clip showed over 2% decoder performance gain with
      CONFIG_SUBPELREFM off.
      
      Change-Id: I953afe3ac5406107933ef49d0b695eafba9a6507
      e7cd8071
  4. 25 Nov, 2012 1 commit
  5. 21 Nov, 2012 1 commit
  6. 20 Nov, 2012 1 commit
  7. 18 Nov, 2012 1 commit
    • Jim Bankoski's avatar
      clean out some of the rtcd code. · f4871b6a
      Jim Bankoski authored
      This removes functions that are no longer needed and cleans up some warnings.
      
      Change-Id: I292a4c3694e9c1d68ce99cea390905b198434719
      f4871b6a
  8. 17 Nov, 2012 1 commit
  9. 16 Nov, 2012 5 commits
    • Yunqing Wang's avatar
      Add const before the dequant(dq) · 47d9d48f
      Yunqing Wang authored
      Modified code to use const before dq.
      
      Change-Id: I6fa59c2ed9743ded33ad08df70e15c2fe1ae7b99
      47d9d48f
    • Ronald S. Bultje's avatar
      Support 32x32 intra modes in non-keyframe superblocks. · 5b11052a
      Ronald S. Bultje authored
      Change-Id: Icf8ad313c543462e523bff89690e5daa8d49bcc0
      5b11052a
    • Paul Wilkins's avatar
      Further experimentation with the mode context · a57dbd95
      Paul Wilkins authored
      Experiments with a larger set of contexts and some
      clean up to replace magic numbers regarding the
      number of contexts.
      
      The starting values and rate of backwards adaption
      are still suspect and based on a small set of tests.
      Added forwards adjustment of probabilities.
      
      The net result of adding the new context and forward
      update is small compared to the old context from the
      legacy find_near function.  (down a little on derf but
      up by a similar amount for HD)
      
      HOWEVER.... with the new context and forward update
      the impact of disabling the reverse update (which may be
      necessary in some use cases to facilitate parallel decoding)
      is hugely reduced.
      
      For the old context without forward update, the impact of
      turning off reverse update (Experiment was with SB off) was
      Derf - 0.9, Yt -1.89, ythd -2.75 and sthd -8.35. The impact was
      mainly at low data rates.
      
      With the new context and forward update enabled the impact
      for all the test sets was no more than 0.5-1% (again most at
      the low end).
      
      Change-Id: Ic751b414c8ce7f7f3ebc6f19a741d774d2b4b556
      a57dbd95
    • Yaowu Xu's avatar
      changed mv candidate search for superblocks · 415e6bff
      Yaowu Xu authored
      added additional motion vectors at close neighborhood of a superblock
      to the list of candiate motion vectors, and removed a couple that are
      further away.
      
      The change helped std-hd set about .8% (all metrics) and smaller gain
      for derf set.
      
      Change-Id: Iaa69b98614db43420ed3fd4738d0ca5587b90045
      415e6bff
    • Deb Mukherjee's avatar
      Compound inter-intra experiment · 0c917fc9
      Deb Mukherjee authored
      A patch on compound inter-intra prediction.
      
      In compound inter-intra prediction, a new predictor for
      16x16 inter coded MBs are obtained by combining a single
      inter predictor with a 16x16 intra predictor, in a manner
      that the weight varies with distance from the top/left
      boundary. The current search strategy is to combine the best
      inter mode with the best intra mode obtained independently.
      
      Results so far:
      
      derf +0.31%
      yt +0.32%
      std-hd +0.35%
      hd +0.42%
      
      It is conceivable that the results would improve somewhat
      with a more thorough search strategy where all intra modes
      are searched given the best mv, or even a joint search for
      the best mv and the best intra mode.
      
      Change-Id: I7951f1ed0d6eb31ca32ac24d120f1585bcd8d79b
      0c917fc9
  10. 15 Nov, 2012 4 commits
    • John Koleszar's avatar
      Pack invisible frames without lengths · 64bcffc1
      John Koleszar authored
      Modify the decoder to return the ending position of the bool decoder and
      use that as the starting position for the next frame.
      
      The constant-space algorithm for parsing the appended frame lengths is
      O(n^2), which is a potential DoS concern if n is unbounded. Revisit
      the appended lengths for use as partition lengths when multipartition
      support is added.
      
      In addition, this allows decoding of raw streams outside of a container
      without additional framing information, though it's insufficient to
      be able to remux said stream into a container.
      
      Change-Id: I71e801a9c3e37abe559a56a597635b0cbae1934b
      64bcffc1
    • Yaowu Xu's avatar
      subpelrefmv for superblocks · 61416aed
      Yaowu Xu authored
      duplicate code clean-up and variable name corrections
      
      Change-Id: Ibc4703228e652ec425125de5e7bc038fa46595c5
      61416aed
    • John Koleszar's avatar
      support building vp8 and vp9 into a single lib · a9c7597a
      John Koleszar authored
      Change-Id: Ib8f8a66c9fd31e508cdc9caa662192f38433aa3d
      a9c7597a
    • John Koleszar's avatar
      detokenize: use SEG_LVL_EOB feature consistently · 6becad42
      John Koleszar authored
      Update decode_coefs() to break when c >= eob, since it's possible that
      c starts the loop from 1 and eob is 0. The loop won't terminate in that
      case.
      
      Add new get_eob() function to consistently clamp the eob based on the
      segment level EOB and the block size. It's possible to code a segment
      level EOB that's greater than the block size, and that leads to an
      out of bounds access.
      
      Change-Id: I859563b30414615cf1b30dcc2aef8a1de358c42d
      6becad42
  11. 14 Nov, 2012 3 commits
    • Ronald S. Bultje's avatar
      Don't use hybrid transform (ADST) for superblocks. · 1e3dd49f
      Ronald S. Bultje authored
      This is in line with other cases where we disable ADST if prediction
      size and transform size don't match. Before this patch, the RD loop
      will use ADST for superblocks, but frame encoding/decoding won't.
      
      Change-Id: I700368c632eb72b5e089c22ef25649d99d7697d0
      1e3dd49f
    • Paul Wilkins's avatar
      Segment mode coding bug. · b527c4db
      Paul Wilkins authored
      There are now more than 16 possible modes so 5
      bits required for segment mode feature.
      
      Note that it is likely that the mode feature and how it is
      coded will change but for now the 4 bits was a bug.
      
      Change-Id: I63348ae3a9cc31566a656c2dc78f09f5e1a9dcc9
      b527c4db
    • Paul Wilkins's avatar
      Silenced a few warnings. · 19a1ba1e
      Paul Wilkins authored
      Silenced a few VS compiler warnings.
      
      Change-Id: I658ea409c36c05cd11042675e2e42ccde0ef2420
      19a1ba1e
  12. 13 Nov, 2012 5 commits
  13. 12 Nov, 2012 2 commits
    • Paul Wilkins's avatar
      New inter mode context · 2669f42b
      Paul Wilkins authored
      This change is a fix / extension of the newbestrefmv
      experiment. As such it is presented without IFDEF.
      
      The change creates a new context for coding inter modes
      in vp9_find_mv_refs(). This replaces the context that
      was previously calculated in vp9_find_near_mvs().
      The new context is unoptimized and not necessarily
      any better at this stage (results pending), but eliminates
      the need for a legacy call to vp9_find_near_mvs().
      Based on numbers from Scott, this could help decode
      speed by several %.
      
      In a later patch I will add support for forward update of
      context (assuming this helps) and refine the context as
      necessary.
      
      Change-Id: I1cd991b82c8df86cc02237a34185e6d67510698a
      2669f42b
    • Paul Wilkins's avatar
      Restrict ref mv search range. · 6fb8953c
      Paul Wilkins authored
      Experiment to test speed trade off of reducing the
      extent of the ref mv search.
      
      Reducing the maximum number of tested candidates to 9 had
      minimal net effect on quality in any of the tests sets.
      
      Reduction to 7 has a small negative impact (worst was STD-HD
      at about -0.2%).
      
      This change is in response to the apparently high number of
      decode cycles reported in regard to mv-ref selection.
      
      Change-Id: I0e92e92e324337689358495a1ec9ccdeb23dc774
      6fb8953c
  14. 10 Nov, 2012 1 commit
    • Deb Mukherjee's avatar
      New b-intra mode where direction is contextual · d01357bb
      Deb Mukherjee authored
      Preliminary patch on a new 4x4 intra mode B_CONTEXT_PRED where the
      dominant direction from the context is used to encode. Various decoder
      changes are needed to support decoding of B_CONTEXT_PRED in conjunction
      with hybrid transforms since the scan order and tokenization depends on
      the actual direction of prediction obtained from the context. Currently
      the traditional directional modes are used in conjunction with the
      B_CONTEXT_PRED, which also seems to provide the best results.
      
      The gains are small - in the 0.1% range.
      
      Change-Id: I5a7ea80b5218f42a9c0dfb42d3f79a68c7f0cdc2
      d01357bb
  15. 08 Nov, 2012 1 commit
    • Yunqing Wang's avatar
      Optimize 16x16 dequant and idct · 6c17c9fa
      Yunqing Wang authored
      As suggested by Yaowu, simplified 16x16 dequant and idct. In decoder,
      after detoken step, we know the number of non-zero dct coefficients
      (eobs) in a macroblock. Idct calculation can be skipped or simplified
      based on eobs, which improves the decoder performance.
      
      Change-Id: I9ffa1cb134bcb5a7d64fcf90c81871a96d1b4018
      6c17c9fa
  16. 07 Nov, 2012 2 commits
    • Yaowu Xu's avatar
      merge full pixel refmv experiment · 0cedaa36
      Yaowu Xu authored
      Change-Id: Ib39ad47a7d188f3b45416937b7eeb28c3e79b74c
      0cedaa36
    • James Zern's avatar
      Fix variance (signed integer) overflow · 98473443
      James Zern authored
      In the variance calculations the difference is summed and later squared.
      When the sum exceeds sqrt(2^31) the value is treated as a negative when
      it is shifted which gives incorrect results.
      
      To fix this we force the multiplication to be unsigned.
      
      The alternative fix is to shift sum down by 4 before multiplying.
      However that will reduce precision.
      
      For 16x16 blocks the maximum sum is 65280 and sqrt(2^31) is 46340 (and
      change).
      
      This change is based on:
      16982342 Missed some variance casts
      fea3556e Fix variance overflow
      
      Change-Id: I2c61856cca9db54b9b81de83b4505ea81a050a0f
      98473443
  17. 06 Nov, 2012 6 commits
    • Yaowu Xu's avatar
      fixed function prototype · a879b4e6
      Yaowu Xu authored
      so they are consistent with actual definitions of the functions
      
      Change-Id: Ie4b4e81b3da3e288fc2edbbd2b393a5c54d2556b
      a879b4e6
    • Yaowu Xu's avatar
      group refmv experiment related functions · acadcec5
      Yaowu Xu authored
      Change-Id: Iedaa108ddb65f54d768424f9c47ad4d069b656fd
      acadcec5
    • James Zern's avatar
      fix test builds · 2e3e6857
      James Zern authored
      s/([vV][pP])8/$19/
      additionally dct.h was removed; declare the _c functions that are used
      in the tests. the TODO for conversion to parameterized tests still
      remains.
      
      Change-Id: I73db9425a57075bbb78a92693ba6b320578981cd
      2e3e6857
    • Yunqing Wang's avatar
      Convert 16x16 dct/idct to integer forms · 4626faf1
      Yunqing Wang authored
      Converted vp9_short_fdct16x16_c and vp9_short_idct16x16_c to
      integer versions.
      
      Change-Id: Ie3ec985a890ac0f4f4f5818e6f0122e00c8af69f
      4626faf1
    • Yaowu Xu's avatar
      silent a lot of MSVC compiler warnings · 8a336b0d
      Yaowu Xu authored
      there are still a couple type of warning left, which are related to
      double constants assigned to float type. As those would be addressed
      by the conversion of transforms into integer version. This commit
      has left those un-dealt with.
      
      Change-Id: I48fd9b489c0c27ad6b543f4177423419f929f2bb
      8a336b0d
    • Jim Bankoski's avatar
      remove invoke_search macro · 7849aa20
      Jim Bankoski authored
      Removed invoke search from encoder
      
      Change-Id: I3d809b795abe6df0e71366edfe94026aaede14fb
      7849aa20