1. 12 Jul, 2016 1 commit
    • Yi Luo's avatar
      HBD convolution filtering (10/12 taps) SSE4.1 optimization · 8cacca73
      Yi Luo authored
      - For experiment EXT_INTERP under high bit depth.
      - Add unit test to verify bit-exact.
      - Speed performance improvement:
        On Xeon E5-2680, park_joy_1080p_12.y4m, 50 frames, encoding time
        drops from 6682503 ms to 5390270 ms.
      
      Change-Id: Iea4debf5414f3accf1eb5672abeab56a0539ac77
      8cacca73
  2. 09 Jul, 2016 1 commit
    • Yue Chen's avatar
      Fix assertion failures in mips+msa setting · 4ab19eac
      Yue Chen authored
      Directly call c functions, otherwise when EXT_TX is enabled, hybrid
      transform other than combination of DCT/ADST has not been implemented, thus
      will cause assertion failures in the switch loops in vp10_fhtnxn_msa() and
      vp10_ihtnxn_nxn_add_msa().
      
      BUG=webm:1239
      
      Change-Id: I2379a07e5406f9489edcd2f3205682f679c9b091
      4ab19eac
  3. 08 Jul, 2016 3 commits
  4. 07 Jul, 2016 5 commits
  5. 06 Jul, 2016 1 commit
  6. 04 Jul, 2016 1 commit
    • Geza Lore's avatar
      Refactoring in preparation for OBMC optimizations. · 007aa7dd
      Geza Lore authored
      - Use int32_t instead of int in vpx_obmc{variance,sad} functions
      - Remove weigthed_src and obmc mask strides and assume contiguous
        buffers. These inputs can always be packed as contiguous arrays.
      
      Change-Id: I74c09b3fb3337f13d39e13a9cb61e140536f345d
      007aa7dd
  7. 01 Jul, 2016 2 commits
    • Wei-ting Lin's avatar
      Remove reference frame buffer update for show_exsiting_frame · f9e38a7b
      Wei-ting Lin authored
      Originally we need to send the refresh flag and the virtual indices
      mapping for the reference frame buffer update for show_existing_frame to
      have the BWDREF_FRAME replace the LAST_FRAME.
      
      To remove sending this information, we update the the virtual indices
      of the reference frame buffer after the last_bipred_frame is encoded,
      and therefore the decoder will receive the updated reference mapping
      at the next non-show-existing frame.
      
      As a result, we can save 4 bytes per show-existing frame, and get 0.12,
      0.2, and 0.07 BDRATE improvement in lowres, derf, and midref test set
      respectively.
      
      Change-Id: I63d41ee6ea99884798f0778b789d2701e2f2d3e0
      f9e38a7b
    • Geza Lore's avatar
      Minor cleanup of inter mode search. · b04ea832
      Geza Lore authored
      Change-Id: I523a3b30eb80fc6c6ed83638fdb82cf65c22b2e5
      b04ea832
  8. 30 Jun, 2016 1 commit
    • Geza Lore's avatar
      Reject ext-inter compound modes based on modelled RD. · 532304e4
      Geza Lore authored
      Reject ext-inter compound modes before doing full rate distortion
      evaluation, if the corresponding single reference modes had a lower
      modelled RD.
      
      ext-inter speedup up to TBD.
      
      Coding performance: TBD
      
      Change-Id: I358bfb879c5ebe5e7afbf6f540cc784f8de14857
      532304e4
  9. 29 Jun, 2016 4 commits
  10. 28 Jun, 2016 2 commits
    • Geza Lore's avatar
      Remove skip_txfm optimization. · 92922be8
      Geza Lore authored
      Commit 0d6980d7 removed some use
      of the skip_txfm optimization, and the rest are not productive.
      
      The current use of this optimization is only used with --good
      and --cpu-used >= 3, however the overhead of this is higher than the
      speedup it yields.
      
      Removing this, and subsequently simplifying model_rd_for_sb yields
      a net encoder speedup:
      --cpu-used=0    ~1.5% faster
      --cpu-used=3    ~2.0% faster
      
      The code simplification is also significant.
      
      Change-Id: I1dd668c32de15a2e912c59c42379d0f9e1032ff8
      92922be8
    • Debargha Mukherjee's avatar
      Quantization fix for new-quant/var-tx · f3dfa0c3
      Debargha Mukherjee authored
      Also use the fp quantizer consistently
      
      lowres: -0.07 BDRATE improvement
      
      Change-Id: I9174f6ad54a74d38541004b99cb3689d0c09be55
      f3dfa0c3
  11. 27 Jun, 2016 2 commits
    • Jingning Han's avatar
      Disable trellis optimized quantization in the first-pass · 813201e1
      Jingning Han authored
      This resolves the use of uninitialized value in the first-pass
      encoding.
      
      Change-Id: I78bc19214a1bfde5c5641424550cbbe4e52cae99
      813201e1
    • Yi Luo's avatar
      Fix bugs in convolution filter optimization · 8404253f
      Yi Luo authored
      - Fix the over-writing bug in horizontal filtering as width = 2.
      - Fix 10-tap vertical filtering which no longer reads one row of
        pixel above the block.
      - Fix 10-tap filter zero padding.
      - Encoder speed slow down ~4.0%, compared to,
        81ad9536 Convolution vertical filter SSSE3 optimization
      
      Change-Id: I9bb294a4529300081c29bf284e6bc6eb081cc536
      8404253f
  12. 25 Jun, 2016 3 commits
  13. 24 Jun, 2016 3 commits
  14. 23 Jun, 2016 6 commits
  15. 22 Jun, 2016 5 commits
    • Jingning Han's avatar
      Refactor reference frame type defs · b605de07
      Jingning Han authored
      Move the reference frame type definitions to common/enums.h file.
      Replace hard coded numbers.
      Combine repeated definitions.
      
      Change-Id: I288e079a03e448014cc181bcdb3f88ee8ec8d139
      b605de07
    • Zoe Liu's avatar
      Remove unnecessary macros · cb2c037c
      Zoe Liu authored
      Change-Id: Id0975947b4e7b76b2c2464905f3b9a29245946c2
      cb2c037c
    • Yue Chen's avatar
      (Cosmetics) Remove unnecessary new parameters in obmc experiment · 02596589
      Yue Chen authored
      pred_variance in obmc experiment is equivalant to recon_variance in
      baseline
      
      Change-Id: Iba8fb9bd973898be5a0d87a507ceaf65c75bdc51
      02596589
    • Jingning Han's avatar
      Fix uninitialized context use case in supertx and var-tx · d2681556
      Jingning Han authored
      This commit fixes the use of uninitialized context values in the
      combination of supertx and var-tx.
      
      Change-Id: I2d36badf5c9806ea402ce3e19515cc299e6b79e8
      d2681556
    • Jingning Han's avatar
      Make drl support bi-directional reference frames · c2195c5b
      Jingning Han authored
      This commit refactors the reference frame structure used in the
      dynamic motion vector referencing system, and makes it support
      the bi-directional reference frames. This resolves unit test
      failure (enc/dec mismatch) when both are turned on.
      
      The compression performance (ref-mv + ext-refs) is improved by
      0.2% for lowres.
      
      Change-Id: I233624d8fccc1f69e82295f94de984ff056365dc
      c2195c5b