1. 13 Feb, 2017 1 commit
  2. 18 Jan, 2017 1 commit
  3. 13 Jan, 2017 1 commit
    • Angie Chiang's avatar
      Add rounding option into av1_convolve · 674bffdc
      Angie Chiang authored
      Use a round flag in ConvolveParams to indicate if the destination buffer
      has the result rounded by FILTER_BITS or not.
      This CL is part of the goal of reducing interpolation rounding error in
      compound prediction mode.
      
      Change-Id: I49e522a89a67a771f5a6e7fbbc609e97923aecb6
      674bffdc
  4. 07 Jan, 2017 1 commit
  5. 01 Dec, 2016 1 commit
  6. 30 Nov, 2016 1 commit
  7. 17 Nov, 2016 1 commit
  8. 01 Nov, 2016 1 commit
  9. 09 Sep, 2016 1 commit
    • James Zern's avatar
      s/INTERP_FILTER/InterpFilter/ · 7b9407a8
      James Zern authored
      this matches style guidelines and stabilizes successive runs of
      clang-format across the tree. remaining types should be address in
      successive commits.
      
      Change-Id: I6ad3f69cf0a22cb9a9b895b272195f891f71170f
      7b9407a8
  10. 02 Sep, 2016 1 commit
  11. 01 Sep, 2016 2 commits
  12. 30 Aug, 2016 1 commit
    • Yunqing Wang's avatar
      Fix Visual Studio build warnings · ed07056f
      Yunqing Wang authored
      Fixed a list of VS warnings. Warning message:
      ..\test\vp10_convolve_test.cc(34): warning C4244: 'initializing' : conversion
      from 'ptrdiff_t' to 'int', possible loss of data
      
      Change-Id: I9a1d3978a79fbb7b1ac028c5713ac72b6ff99172
      ed07056f
  13. 12 Aug, 2016 1 commit
  14. 28 Jul, 2016 1 commit
  15. 23 Jun, 2016 1 commit
    • Yi Luo's avatar
      Convolution vertical filter SSSE3 optimization · 81ad9536
      Yi Luo authored
      - Apply 8-pixel vertical filtering direction parallelism.
      - Add unit tests to verify bit exact.
      - Encoder speed improves ~29% (enable EXT_INTERP) on Xeon E5-2680.
      - Combinational cycle count of vp10_convolve() drops from 26.06%
        to 6.73%.
      
      Change-Id: Ic1ae48f8fb1909991577947a8c00d07832737e57
      81ad9536
  16. 20 Jun, 2016 1 commit
    • Yi Luo's avatar
      Convolution horizontal filter SSSE3 optimization · 229690a9
      Yi Luo authored
      - Apply signal direction/4-pixel vertical/8-pixel vertical
        parallelism.
      - Add unit test to verify the bit exact result.
      - Overall encoding time improves ~24% on Xeon E5-2680 CPU.
      
      Change-Id: I104dcbfd43451476fee1f94cd16ca5f965878e59
      229690a9
  17. 09 May, 2016 1 commit
    • Jingning Han's avatar
      Fix dual filter type for high bit-depth · 9de916eb
      Jingning Han authored
      This commit fixes the compiler error in high bit-depth inter
      predictor when dual filter type experiment is turned on.
      
      Change-Id: I404a76a246477f2fcffc38a3275007d5dfe229cd
      9de916eb
  18. 07 May, 2016 1 commit
  19. 26 Feb, 2016 1 commit
  20. 25 Feb, 2016 1 commit
    • Angie Chiang's avatar
      convolve8 sse2 test · 8878fa4f
      Angie Chiang authored
      This experiment shows that when frame size is 64x64
      vpx_highbd_convolve8_sse2 and vpx_convolve8_sse2's speed are similar.
      However when frame size becomes 1024x1024
      vpx_highbd_convolve8_sse2 is around 50% slower than vpx_convolve8_sse2
      we think the bottleneck is from memory IO
      
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_8_64
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_8_64 (17 ms)
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_16_64
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_16_64 (42 ms)
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_32_64
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_32_64 (139 ms)
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_64_64
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_64_64 (499 ms)
      
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_8_64
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_8_64 (16 ms)
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_16_64
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_16_64 (40 ms)
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_32_64
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_32_64 (130 ms)
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_64_64
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_64_64 (485 ms)
      
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_8_1024
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_8_1024 (32 ms)
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_16_1024
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_16_1024 (61 ms)
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_32_1024
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_32_1024 (196 ms)
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_64_1024
      
      VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_64_1024 (694 ms)
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_8_1024
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_8_1024 (21 ms)
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_16_1024
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_16_1024 (44 ms)
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_32_1024
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_32_1024 (138 ms)
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_64_1024
      VP10ConvolveTest.vpx_convolve8_sse2_speed_l_64_1024 (491 ms)
      
      Change-Id: I3131a031e0380e8eae748cfcccc6cbb961d05943
      8878fa4f
  21. 20 Feb, 2016 1 commit
    • Angie Chiang's avatar
      Fix 12 TAP convolution bug · 1e403064
      Angie Chiang authored
      Priviously, we do 12-tap interpolation even there is no sub pixel,
      This could cause a bug becuase decoder doesn't extend border when there
      is no sub pixel. In this situation, if we still do interpolation, we
      will access the border extension which doesn't exist and cause a
      memory error
      
      Change-Id: I55b879722f0a10c5d13261bd9617a75c826a2418
      1e403064
  22. 06 Feb, 2016 1 commit