1. 29 Jun, 2016 1 commit
  2. 28 Jun, 2016 1 commit
  3. 27 Jun, 2016 1 commit
    • Yi Luo's avatar
      Fix bugs in convolution filter optimization · 8404253f
      Yi Luo authored
      - Fix the over-writing bug in horizontal filtering as width = 2.
      - Fix 10-tap vertical filtering which no longer reads one row of
        pixel above the block.
      - Fix 10-tap filter zero padding.
      - Encoder speed slow down ~4.0%, compared to,
        81ad9536 Convolution vertical filter SSSE3 optimization
      
      Change-Id: I9bb294a4529300081c29bf284e6bc6eb081cc536
      8404253f
  4. 24 Jun, 2016 2 commits
  5. 23 Jun, 2016 1 commit
    • Yi Luo's avatar
      Convolution vertical filter SSSE3 optimization · 81ad9536
      Yi Luo authored
      - Apply 8-pixel vertical filtering direction parallelism.
      - Add unit tests to verify bit exact.
      - Encoder speed improves ~29% (enable EXT_INTERP) on Xeon E5-2680.
      - Combinational cycle count of vp10_convolve() drops from 26.06%
        to 6.73%.
      
      Change-Id: Ic1ae48f8fb1909991577947a8c00d07832737e57
      81ad9536
  6. 22 Jun, 2016 1 commit
  7. 21 Jun, 2016 1 commit
  8. 20 Jun, 2016 1 commit
    • Yi Luo's avatar
      Convolution horizontal filter SSSE3 optimization · 229690a9
      Yi Luo authored
      - Apply signal direction/4-pixel vertical/8-pixel vertical
        parallelism.
      - Add unit test to verify the bit exact result.
      - Overall encoding time improves ~24% on Xeon E5-2680 CPU.
      
      Change-Id: I104dcbfd43451476fee1f94cd16ca5f965878e59
      229690a9
  9. 18 Jun, 2016 1 commit
  10. 17 Jun, 2016 1 commit
  11. 16 Jun, 2016 7 commits
  12. 15 Jun, 2016 1 commit
  13. 11 Jun, 2016 1 commit
  14. 10 Jun, 2016 3 commits
  15. 09 Jun, 2016 2 commits
  16. 07 Jun, 2016 1 commit
    • Alex Converse's avatar
      Turn ActiveMapTest back on. · 7e26f013
      Alex Converse authored
      If it's creating problems with some experiments, disable it under the
      actual conditions where it doesn't work and file a bug.
      
      Change-Id: Iab9f4bfe42ea926d49d371918da25f9a8938a20f
      7e26f013
  17. 06 Jun, 2016 3 commits
    • Angie Chiang's avatar
      Move #if out of TEST_P in vp10_fwd/inv_txfm2d_test.cc · f67196b2
      Angie Chiang authored
      Change-Id: I1d5b2408f27a1e277574c2238f1e49e884596309
      f67196b2
    • Geza Lore's avatar
      Optimize wedge partition selection. · efda2831
      Geza Lore authored
      We can optimize wedge partition selection by pre-computing the
      residuals of the 2 underlying predictors, and then blend these
      to compute the sse of the compound predictor, without actually
      having to compute and subtract the compound predictor.
      
      Similarly we can pre-compute a proxy array which we can use to
      cheaply check which mask sign would have lower sse.
      
      Details are in wedge_utils.c.
      
      Mathematically these are equivalence transformations, but due to the
      finite precision the encoder output will be perturbed, though on
      average this should make 0% difference.
      
      ext-inter gains about ~4.5% speedup.
      
      Change-Id: Ib2657c3209ae161b4090b58b4b6c392641bf2792
      efda2831
    • Geza Lore's avatar
      Fix decoder crash with supertx · 6c4306c2
      Geza Lore authored
      xd->plane[0].n4_h and xd->plane[0].n4_w are not set at that point
      when using supertx.
      
      While this fixes the immediate crash described in the referenced
      bug report, there are still issues in the ref-mv experiment that
      causes these tests to fail, so they are kept disabled.
      
      BUG=https://bugs.chromium.org/p/webm/issues/detail?id=1230
      
      Change-Id: Ibf8ef02847a903f8d10e6be28e16694db10c75af
      6c4306c2
  18. 03 Jun, 2016 2 commits
  19. 31 May, 2016 1 commit
  20. 27 May, 2016 1 commit
    • James Zern's avatar
      vp10_inv_txfm2d_test: fix memory leak · 5d237f09
      James Zern authored
      input_, ref_input_ and output_ were being allocated with new[] followed
      by vpx_memalign, remove the former
      
      Change-Id: Ia16d0f9b9317042a24445095ad3c284f4e7bb481
      5d237f09
  21. 25 May, 2016 1 commit
    • Yi Luo's avatar
      Integrate HBD inverse HT flip types sse4.1 optimization · bfe4c0ae
      Yi Luo authored
      - tx_size: 4x4, 8x8, 16x16.
      - tx_type: FLIPADST_DCT, DCT_FLIPADST, FLIPADST_FLIPADST,
        ADST_FLIPADST, FLIPADST_ADST.
      - Encoder speed improvement:
        park_joy_1080p_12: ~11%, crowd_run_1080p_12: ~7%.
      - Add unit test cases for bit-exact against C.
      
      Change-Id: Ia69d069031fa76c4625e845bfbfe7e6f6ed6e841
      bfe4c0ae
  22. 24 May, 2016 1 commit
    • Yi Luo's avatar
      HBD inverse HT 8x8 and 16x16 sse4.1 optimization · 28cdee44
      Yi Luo authored
      - Covers tx_type: DCT_DCT, DCT_ADST, ADST_DCT, ADST_ADST.
      - Encoding speed improves ~27% on crowd_run_1080p_12.
      - Merge 4x4, 8x8, 16x16 unit tests in one test file.
      
      Change-Id: I058ef5254d068a9523a826480c78ebbdd231824c
      28cdee44
  23. 23 May, 2016 1 commit
    • Geza Lore's avatar
      Add optimized vpx_blend_mask6 · a661bc87
      Geza Lore authored
      This is to replace vp10/common/reconinter.c:build_masked_compound.
      Functionality is equivalent, but the interface is slightly more
      generic.
      
      Total encoder speedup with ext-inter: ~7.5%
      
      Change-Id: Iee18b83ae324ffc9c7f7dc16d4b2b06adb4d4305
      a661bc87
  24. 20 May, 2016 1 commit
  25. 19 May, 2016 3 commits