1. 12 Jul, 2016 1 commit
    • Yi Luo's avatar
      HBD convolution filtering (10/12 taps) SSE4.1 optimization · 8cacca73
      Yi Luo authored
      - For experiment EXT_INTERP under high bit depth.
      - Add unit test to verify bit-exact.
      - Speed performance improvement:
        On Xeon E5-2680, park_joy_1080p_12.y4m, 50 frames, encoding time
        drops from 6682503 ms to 5390270 ms.
      
      Change-Id: Iea4debf5414f3accf1eb5672abeab56a0539ac77
      8cacca73
  2. 27 Jun, 2016 1 commit
    • Yi Luo's avatar
      Fix bugs in convolution filter optimization · 8404253f
      Yi Luo authored
      - Fix the over-writing bug in horizontal filtering as width = 2.
      - Fix 10-tap vertical filtering which no longer reads one row of
        pixel above the block.
      - Fix 10-tap filter zero padding.
      - Encoder speed slow down ~4.0%, compared to,
        81ad9536 Convolution vertical filter SSSE3 optimization
      
      Change-Id: I9bb294a4529300081c29bf284e6bc6eb081cc536
      8404253f
  3. 23 Jun, 2016 1 commit
    • Yi Luo's avatar
      Convolution vertical filter SSSE3 optimization · 81ad9536
      Yi Luo authored
      - Apply 8-pixel vertical filtering direction parallelism.
      - Add unit tests to verify bit exact.
      - Encoder speed improves ~29% (enable EXT_INTERP) on Xeon E5-2680.
      - Combinational cycle count of vp10_convolve() drops from 26.06%
        to 6.73%.
      
      Change-Id: Ic1ae48f8fb1909991577947a8c00d07832737e57
      81ad9536
  4. 20 Jun, 2016 1 commit
    • Yi Luo's avatar
      Convolution horizontal filter SSSE3 optimization · 229690a9
      Yi Luo authored
      - Apply signal direction/4-pixel vertical/8-pixel vertical
        parallelism.
      - Add unit test to verify the bit exact result.
      - Overall encoding time improves ~24% on Xeon E5-2680 CPU.
      
      Change-Id: I104dcbfd43451476fee1f94cd16ca5f965878e59
      229690a9
  5. 19 May, 2016 1 commit
    • Jingning Han's avatar
      Properly handle the filter extension in highbd setting · d84a2e7d
      Jingning Han authored
      This commit makes the filter extension in highbd aware of the
      dual filter and ext-interp experiments to prevent enc/dec mismatch
      when both experiments are turned on.
      
      Change-Id: I11ac1f041bd5f73d61e839d6386d9c5d008da3f7
      d84a2e7d
  6. 16 May, 2016 1 commit
    • Jingning Han's avatar
      Properly handle 2D filter boundary extension · 14dd5538
      Jingning Han authored
      The amount of border extension needed in the first stage inter
      filtering is decided by the length of the second stage filter
      kernel.
      
      Change-Id: Icddbc58c02234d5df09ff0eeebcf166ffe689203
      14dd5538
  7. 09 May, 2016 1 commit
    • Jingning Han's avatar
      Fix dual filter type for high bit-depth · 9de916eb
      Jingning Han authored
      This commit fixes the compiler error in high bit-depth inter
      predictor when dual filter type experiment is turned on.
      
      Change-Id: I404a76a246477f2fcffc38a3275007d5dfe229cd
      9de916eb
  8. 07 May, 2016 1 commit
  9. 30 Mar, 2016 1 commit
    • Geza Lore's avatar
      Extend superblock size fo 128x128 pixels. · 552d5cd7
      Geza Lore authored
      If --enable-ext-partition is used at build time, the superblock size
      (sometimes also referred to as coding unit (CU) size) is extended to
      128x128 pixels.
      
      Change-Id: Ie09cec6b7e8d765b7555ff5d80974aab60803f3a
      552d5cd7
  10. 26 Feb, 2016 1 commit
  11. 20 Feb, 2016 1 commit
    • Angie Chiang's avatar
      Fix 12 TAP convolution bug · 1e403064
      Angie Chiang authored
      Priviously, we do 12-tap interpolation even there is no sub pixel,
      This could cause a bug becuase decoder doesn't extend border when there
      is no sub pixel. In this situation, if we still do interpolation, we
      will access the border extension which doesn't exist and cause a
      memory error
      
      Change-Id: I55b879722f0a10c5d13261bd9617a75c826a2418
      1e403064
  12. 06 Feb, 2016 1 commit