1. 19 Oct, 2011 2 commits
  2. 18 Oct, 2011 1 commit
    • Scott LaVarnway's avatar
      Remove usage of predict buffer for decode · ed9c66f5
      Scott LaVarnway authored
      Instead of using the predict buffer, the decoder now writes
      the predictor into the recon buffer.  For blocks with eob=0,
      unnecessary idcts can be eliminated.  This gave a performance
      boost of ~1.8% for the HD clips used.
      
      Tero: Added needed changes to ARM side and scheduled some
            assembly code to prevent interlocks.
      
      Patch Set 6:  Merged (I1bcdca7a95aacc3a181b9faa6b10e3a71ee24df3)
      into this commit because of similarities in the idct
      functions.
      Patch Set 7: EC bug fix.
      
      Change-Id: Ie31d90b5d3522e1108163f2ac491e455e3f955e6
      ed9c66f5
  3. 17 Oct, 2011 1 commit
  4. 14 Oct, 2011 4 commits
  5. 11 Oct, 2011 3 commits
  6. 10 Oct, 2011 1 commit
  7. 04 Oct, 2011 4 commits
  8. 03 Oct, 2011 1 commit
  9. 30 Sep, 2011 7 commits
    • Johann's avatar
    • Johann's avatar
      Merge "combine loopfilter data access" · 48b19171
      Johann authored
      48b19171
    • Scott LaVarnway's avatar
      Improved tokenize · ab00d209
      Scott LaVarnway authored
      For a realtime HD encodings, up to 1.6% gains seen.
      
      
      
      Change-Id: If45028e23db95124da63f9d38ffe06e05596cc6e
      ab00d209
    • Johann's avatar
      combine loopfilter data access · 3556deac
      Johann authored
      The data processed by the loopfilter overlaps. At the block level, this
      results in some redundant transforms. Grouping the filtering allows for
      a single 16x16 transpose (and inversion) instead of three 16x8 transposes
      (and three more inversions).
      
      This implementation is x86_64 only. We retain the previous
      implementation for x86.
      
      Improvements are obviously material dependant, but it seems to be ~%1 in
      tests here.
      
      Change-Id: I467b7ec3655be98fb5f1a94b5d145e5e5a660007
      3556deac
    • Alpha Lam's avatar
      Call vp8_find_near_mvs lazily · 7bce513a
      Alpha Lam authored
      vp8_find_near_mvs() is being called on all possible reference frames
      but the data computed may be used if the loop exits early, which can
      be due to x->skip beign set to 1.
      
      Optimize this by call vp8_find_near_mvs() laziy only if it is going
      to be used and not computed yet.
      
      Change-Id: Iccdbd4c962a670c9f2c99b8aca8096042ca5dc98
      7bce513a
    • Paul Wilkins's avatar
      Merge "CQ and two pass rate control." · a572ac83
      Paul Wilkins authored
      a572ac83
    • Paul Wilkins's avatar
      CQ and two pass rate control. · b6e27d5f
      Paul Wilkins authored
      Changes to the selection of Q limits for two pass
      and two pass CQ mode.
      
      Allowance made for Mode and motion vector costs.
      Some refactoring of common code.
      
      For Derf and YT sets CQ mode average improvement
      circa 1% (SSIM and Global PSNR).
      
      Some increased tendency to undershoot even when
      user CQ not reached.
      
      Patch2: Removed some test code accidentally merged.
      
      Change-Id: Icf74d13af77437c08602571dc7a97e747cce5066
      b6e27d5f
  10. 29 Sep, 2011 3 commits
    • Aaron Watry's avatar
      Reduce computational complexity of generic C loop filter. · 69aa303d
      Aaron Watry authored
      Change-Id: I1e7f9ed3cd907844a495b9e0073bc140b87e5c06
      69aa303d
    • John Koleszar's avatar
      makefile: fix target 'all' · 22ea8592
      John Koleszar authored
      'all' is the conventional target for building everything in the
      makefile, but the child make was expecting all-$(target), for debugging
      reasons that I don't recall exactly. Restore the expected behavior.
      
      Change-Id: Ifbb03610b55be679ce7c5e210b7a69a156bb76b9
      22ea8592
    • Attila Nagy's avatar
      Multithreaded encoder, late sync loopfilter · 380d64ec
      Attila Nagy authored
      Sync with loopfilter thread just at the beginning of next frame encoding.
      This returns control to application faster and allows a better multicore scaling.
      When PSNR packets are generated the final filtered frame is needed imediatly
      so we cannot delay the sync.
      
      Change-Id: I288d97b5e331d41d6f5bb49d97986fa12ac6f066
      380d64ec
  11. 22 Sep, 2011 7 commits
  12. 20 Sep, 2011 6 commits