1. 27 Dec, 2017 1 commit
  2. 22 Dec, 2017 1 commit
    • Linfeng Zhang's avatar
      Add av1_get_nz_map_contexts_sse2() · 0ba23e86
      Linfeng Zhang authored
      10x - 50x faster than C code.
      
      av1_cost_coeffs_txb() is about 6% faster.
      
      av1_cost_coeffs() is about 3% faster.
      
      Change-Id: Ib9cbed02a65b9cb0c5deb7a5d99c95d0d8ba32c0
      0ba23e86
  3. 21 Dec, 2017 3 commits
  4. 20 Dec, 2017 1 commit
  5. 19 Dec, 2017 1 commit
  6. 18 Dec, 2017 1 commit
    • Cheng Chen's avatar
      JNT_COMP: add SIMD and interface for high bit-depth · bf3d4964
      Cheng Chen authored
      Add high bit-depth macro definitions:
      highbd_jnt_sad
      highbd_8(10/12)_jnt_sub_pixel_avg.
      
      Add SIMD functions:
      aom_highbd_jnt_comp_avg_pred_sse2
      aom_highbd_jnt_comp_avg_upsampled_pred_sse2
      
      This patch also solves the seg fault caused by low bit-depth and
      high bit-depth paths
      
      BUG=aomedia:967
      BUG=aomedia:944
      
      Change-Id: Iea69f114e81ca226a30d84a540ad846f1b94b8d6
      bf3d4964
  7. 15 Dec, 2017 1 commit
    • Johann's avatar
      add copyright to rtcd files · aecbba6d
      Johann authored
      Allows them to pass the license check in chromium.
      
      Based on libvpx e4b3f03
      
      BUG=chromium:795297
      
      Change-Id: I2bb49ecb62f20d7bc5093a1732b6a8228ef5c87f
      aecbba6d
  8. 14 Dec, 2017 1 commit
    • Urvang Joshi's avatar
      round_shift_array: Use SSE4 version everywhere. · 1ac47a7c
      Urvang Joshi authored
      Usage of CPU by round_shift_array goes from 2.01% to 1.04%.
      Overall encoding is slightly faster (~0.05%).
      
      This means some of the intermediate array have to be aligned.
      Also, these functions were moved to common header/source files.
      
      BUG=aomedia:1106
      
      Change-Id: I492c9b1f2e7339c6cb83cfe68a61218642654d1b
      1ac47a7c
  9. 13 Dec, 2017 4 commits
  10. 05 Dec, 2017 4 commits
  11. 04 Dec, 2017 4 commits
  12. 30 Nov, 2017 3 commits
  13. 29 Nov, 2017 3 commits
    • James Zern's avatar
      Unify highbd loopfilter function names · 684b7bd1
      James Zern authored
      Rename aom_highbd_lpf_horizontal_edge_8() to aom_highbd_lpf_horizontal_16().
      Rename aom_highbd_lpf_horizontal_edge_16() to aom_highbd_lpf_horizontal_16_dual().
      
      based on the same change from libvpx:
      7f1f35183 Unify loopfilter function names
      
      Change-Id: I40cd587e74e0fe02bae23e6c10280c8e269df1d6
      684b7bd1
    • Sebastien Alaiwan's avatar
      Remove MRC_TX experiment · 9f001f35
      Sebastien Alaiwan authored
      This experiment has been abandonned for AV1.
      
      Change-Id: Ib9a8eea6b0ad5580dc81836be8fc5e898bb9bb45
      9f001f35
    • Yaowu Xu's avatar
      Consider quantizer error in coefficient clamping · f013205a
      Yaowu Xu authored
      BUG=aomedia:1074
      
      Change-Id: Icb973b56ac60780ef4fa61bcd4114de37f6fd120
      f013205a
  14. 28 Nov, 2017 1 commit
    • Yi Luo's avatar
      Fix the dual loopfilter for cb4x4 · 771a80ab
      Yi Luo authored
      In cb4x4, dual loopfilter filters 2 * 4 = 8 pixels.
      This patch does not influence encoder/decoder since
      they are not applied in bit mask implementation.
      
      Change-Id: Ifdeb8990127de39143971156db69a69ee3bd3136
      771a80ab
  15. 27 Nov, 2017 1 commit
    • James Zern's avatar
      Unify loopfilter function names · 1dbe80bc
      James Zern authored
      Rename aom_lpf_horizontal_edge_8() to aom_lpf_horizontal_16().
      Rename aom_lpf_horizontal_edge_16() to aom_lpf_horizontal_16_dual().
      
      based on the same change from libvpx:
      7f1f35183 Unify loopfilter function names
      
      Change-Id: I4fda7a2e3a893fc3dee0779975e2d4145c32f5d2
      1dbe80bc
  16. 25 Nov, 2017 2 commits
    • Sebastien Alaiwan's avatar
      Remove dead convolve functions · 4e510826
      Sebastien Alaiwan authored
      Change-Id: I5ec79635c716b2d1f1b200dcc3067213f2eedd08
      4e510826
    • Sebastien Alaiwan's avatar
      Split big file into two · 20dadeae
      Sebastien Alaiwan authored
      The file sad.c alone takes 35 seconds to compile.
      This often happens to be on the build critical path.
      Split it into two source files so they can be compiled in parallel.
      
      Change-Id: I35636d8a3da9d67edb8dbf202fd5e7a687a6aaa9
      20dadeae
  17. 23 Nov, 2017 3 commits
  18. 22 Nov, 2017 2 commits
    • Cheng Chen's avatar
      JNT_COMP: add ssse3 implementations for sad_avg · d0179a6b
      Cheng Chen authored
      Add ssse3 implementations for the sad_avg c function at low bit-depth.
      With this, aom_jnt_sad c functions can all have simd implementations.
      This CL follows existing MACRO definitions for multiple combinations
      of block sizes.
      
      Change-Id: I882343684026525f5589a239337cfac2dd411e11
      d0179a6b
    • Cheng Chen's avatar
      JNT_COMP: SIMD implementation for aom_jnt_sub_pixel_avg · d286443c
      Cheng Chen authored
      Change function names and add SIMD implementation for two c functions:
      (1) var_filter_block2d_bil_first_pass
      (2) var_filter_block2d_bil_second_pass
      
      This CL allows aom_jnt_sub_pixel_avg_variance now in SIMD.
      
      Change-Id: Ib41ef13d62ae91a0ca481bcebb24568dcd4722c4
      d286443c
  19. 21 Nov, 2017 2 commits
  20. 20 Nov, 2017 1 commit
    • Monty Montgomery's avatar
      Move Daala TX to fixed coeff depth of 12 (Q4) · 358abfb7
      Monty Montgomery authored
      This patch activates all the preceeding work, moving Daala TX to a
      greater, fixed coefficient depth (12).  This reclaims the regression
      caused by going to Q3.
      
      subset-1:
      monty-rest-of-stack-rmscale-s1@2017-11-13T14:40:20.646Z ->
       monty-rest-of-stack-Q4-s1@2017-11-13T14:40:44.807Z
      
         PSNR | PSNR Cb | PSNR Cr | PSNR HVS |    SSIM | MS SSIM | CIEDE 2000
      -0.0745 |     N/A | -0.1040 |  -0.1017 | -0.0660 | -0.0522 |    -0.0806
      
      Change-Id: If2a0853b320d57c2fa3a66f919ceb2dc526d017f
      358abfb7