1. 17 Feb, 2016 7 commits
  2. 16 Feb, 2016 2 commits
  3. 29 Jan, 2016 3 commits
    • Yaowu Xu's avatar
      Enable sse2 version of inverse wht for hbd build · 0aef1bc8
      Yaowu Xu authored
      Change-Id: If8f5efd701a11c8a7ad3078d10ec3cd0fe27667e
      0aef1bc8
    • Yaowu Xu's avatar
      SSSE3 idct8x8 functions for highbitdpeth build · b2297108
      Yaowu Xu authored
      This commit changes SSSE3 optimized idct8x8 functions to work with
      highbitdepth build.
      
      With this commit and the previous one that enabled SSSE3 idct32x32
      functions, tests showed virtually no difference on decoding speed for
      file fdJc1_IBKJA.248.webm for the build with -enable-vp9-highbitdpeth
      option and the build without the option.
      
      Change-Id: Ibe0634149ec70e8b921e6b30171664b8690a9c45
      b2297108
    • Yaowu Xu's avatar
      Enable hbd_build to use SSSE3optimized functions · aac1ef7f
      Yaowu Xu authored
      This commit changes the SSSE3 assembly functions for idct32x32 to
      support highbitdepth build.
      
      On test clip fdJc1_IBKJA.248.webm, this cuts the speed difference
      between hbd and lbd build from between 3-4% to 1-2%.
      
      Change-Id: Ic3390e0113bc1ca5bba8ec80d1795ad31b484fca
      aac1ef7f
  4. 13 Jan, 2016 1 commit
  5. 23 Dec, 2015 3 commits
  6. 19 Dec, 2015 2 commits
    • James Zern's avatar
      sad_sse2: fix sad4xN(_avg) on windows · 8b2ddbc7
      James Zern authored
      reduce the register count by 1 to avoid xmm6 and unnecessarily
      penalizing the other users of the base macro
      
      Change-Id: I59605c9a41a31c1b74f67ec06a40d1a7f92c4699
      8b2ddbc7
    • Jian Zhou's avatar
      Code clean of highbd_tm_predictor_4x4 · db113075
      Jian Zhou authored
      Replace MMX with SSE2, reduce mem access to left neighbor,
      loop unrolled.
      
      Change-Id: I941be915af809025f121ecc6c6443f73c9903e70
      db113075
  7. 18 Dec, 2015 4 commits
    • Jian Zhou's avatar
      Code clean of highbd_v_predictor_4x4 · c91dd55e
      Jian Zhou authored
      MMX replaced with SSE2, same performance.
      
      Change-Id: I2ab8f30a71e5fadbbc172fb385093dec1e11a696
      c91dd55e
    • Jian Zhou's avatar
      Code clean of highbd_dc_predictor_4x4 · 8366b414
      Jian Zhou authored
      MMX replaced with SSE2, same performance.
      
      Change-Id: Ic57855254e26757191933c948fac6aa047fadafc
      8366b414
    • Peter de Rivaz's avatar
      Fix for issue 1114 compile error · 7361ef73
      Peter de Rivaz authored
      In 32-bit build with --enable-shared, there is a lot of
      register pressure and register src_strideq is reused.
      The code needs to use the stack based version of src_stride,
      but this doesn't compile when used in an lea instruction.
      
      This patch also fixes a related segmentation fault caused by the
      implementation using src_strideq even though it has been
      reused.
      
      This patch also fixes the HBD subpel variance tests that fail
      when compiled without disable-optimizations.
      These failures were caused by local variables in the assembler
      routines colliding with the caller's stack frame.
      
      Change-Id: Ice9d4dafdcbdc6038ad5ee7c1c09a8f06deca362
      7361ef73
    • Jian Zhou's avatar
      Code clean of sad4xNx4D_sse · 789dbb31
      Jian Zhou authored
      Replace MMX with SSE2.
      
      Change-Id: I948ca1be6ed9b8e67f16555e226f1203726b7da6
      789dbb31
  8. 17 Dec, 2015 1 commit
    • Jian Zhou's avatar
      Code clean of sad4xN(_avg)_sse · b158d9a6
      Jian Zhou authored
      Replace MMX with SSE2, reduce psadbw ops which may help Silvermont.
      
      Change-Id: Ic7aec15245c9e5b2f3903dc7631f38e60be7c93d
      b158d9a6
  9. 14 Dec, 2015 1 commit
  10. 11 Dec, 2015 1 commit
    • Jian Zhou's avatar
      Code clean of tm_predictor_32x32 · 88120481
      Jian Zhou authored
      Reallocate the xmm register usage so that no ARCH_X86_64 required.
      Reduce memory access to the left neighbor by half.
      Speed up by single digit on big core machine.
      
      Change-Id: I392515ed8e8aeb02e6a717b3966b1ba13f5be990
      88120481
  11. 10 Dec, 2015 4 commits
  12. 08 Dec, 2015 1 commit
    • Jian Zhou's avatar
      Re-enable SSE2 based intra 4x4 prediction · aa5b517a
      Jian Zhou authored
      4x4 Intra predictor implemented with MMX is replaced with SSE2.
      Segfault in change 315561 when decoding vp8 is taken care of.
      
      Change-Id: I083a7cb4eb8982954c20865160f91ebec777ec76
      aa5b517a
  13. 07 Dec, 2015 1 commit
  14. 05 Dec, 2015 1 commit
  15. 04 Dec, 2015 4 commits
    • Jian Zhou's avatar
      Speed up h_predictor_16x16 · e86c7c86
      Jian Zhou authored
      Relocate the function from SSSE3 to SSE2, Unroll loop from 8 to 4,
      and reduce mem access to left.
      Speed up by >20% in ./test_intra_pred_speed.
      
      Change-Id: Ie48229c2e32404706b722442942c84983bda74cc
      e86c7c86
    • Jian Zhou's avatar
      Speed up h_predictor_8x8 · da3f08fa
      Jian Zhou authored
      Relocate the function from SSSE3 to SSE2, Unroll loop from 4 to 2,
      and reduce mem access to left.
      Speed up by >20% in ./test_intra_pred_speed.
      
      Change-Id: Ib9f1846819783b6e05e2a310c930eb844b2b4d2e
      da3f08fa
    • Jian Zhou's avatar
      MMX in intra 8x8 prediction replaced with SSE2 · aa2764ab
      Jian Zhou authored
      8x8 Intra predictor implemented with MMX is replaced with SSE2.
      
      Change-Id: I0c90e7c1e1e6942489ac2bfe58903b728aac7a52
      aa2764ab
    • Jian Zhou's avatar
      MMX in intra 4x4 prediction replaced with SSE2 · 89a1efa4
      Jian Zhou authored
      4x4 Intra predictor implemented with MMX is replaced with SSE2.
      
      Change-Id: Id57da2a7c38832d0356bc998790fc1989d39eafc
      89a1efa4
  16. 02 Dec, 2015 1 commit
  17. 01 Dec, 2015 1 commit
    • Jian Zhou's avatar
      Speed up tm_predictor_16x16 · c7fae5d8
      Jian Zhou authored
      Reduce mem access to left. Speed up by 10% in ./test_intra_pred_speed
      with the same instruction size.
      
      Change-Id: Ia33689d62476972cc82ebb06b50415aeccc95d15
      c7fae5d8
  18. 30 Nov, 2015 1 commit
    • Jian Zhou's avatar
      SSE2 speed up of h_predictor_4x4 · 9d29d762
      Jian Zhou authored
      Relocate h_predictor_4x4 from SSSE3 to SSE2 with XMM registers.
      Speed up by ~25% in ./test_intra_pred_speed.
      
      Change-Id: I64e14c13b482a471449be3559bfb0da45cf88d9d
      9d29d762
  19. 25 Nov, 2015 1 commit