1. 02 Mar, 2017 6 commits
    • Debargha Mukherjee's avatar
      Some optimizations on integer affine estimation · 93105538
      Debargha Mukherjee authored
      1. Adds a limit on number of candidate samples used for the
      estimation.
      2. Adds a limit on max mv magnitude for use in the least-squares
      3. Makes some of the internal variables 32-bit.
      
      Impact on coding efficiency in the noise range.
      
      Change-Id: I8c1c3216368ceb2e3548660a3b8c159df54a8312
      93105538
    • Steinar Midtskogen's avatar
      Remove ASM_REGISTER_STATE_CHECK when testing v64/v128/v256 intrinsics · c20176e5
      Steinar Midtskogen authored
      Since the tested functions are always forced inline in regular use,
      ASM_REGISTER_STATE_CHECK doesn't make sense on this level (the test
      should rather be applied to unit tests checking functions making use
      of these inlined functions).  The test fails on Win64 because the
      Win64 ABI requires xmm6 to xmm15 to be preserved across function
      calls, but the ABI is only relevant for non-inlined functions.
      
      BUG=aomedia:371
      
      Change-Id: Icb795083f69465cf09ec8f6871899943efaeaab8
      c20176e5
    • David Barker's avatar
      Remove double rounding in selfguided filter · 7dcd7f5e
      David Barker authored
      In av1_selfguided_restoration, the values stored into 'dgd' are
      unnecessarily rounded twice. This patch replaces this by a single
      rounding operation.
      
      Change-Id: I188d283137b74823f5d5447d441250520d6ee294
      7dcd7f5e
    • Jingning Han's avatar
      Avoid the use of undefined marco value · b83e64ba
      Jingning Han authored
      Always define USE_TXTYPE_SEARCH_FOR_SUB8X8_IN_CB4X4 to avoid the
      use of undefined value.
      
      Change-Id: I0ad90c5b5316db231e9538487bb4591dfd6a9ce7
      b83e64ba
    • Yue Chen's avatar
      Use 3-tap spatial filter in FILTER_INTRA experiment · 8d8638a1
      Yue Chen authored
      3-tap recursive intra prediction filters are added.
      Macro USE_3TAP_INTRA_FILTER is set to 1 to use 3-tap by default.
      Coding gain of FILTER_INTRA experiment in AWCY, high delay 150f
      3-tap: 0.51%
      4-tap: 0.68%
      
      Change-Id: I44192dd08bfd8155f58a9b0b5cf1de88fceb762e
      8d8638a1
    • Sarah Parker's avatar
      Turn off global motion for sub8x8 blocks · ae7c458a
      Sarah Parker authored
      Lowres: 0.03% improvement, 1% improvement on waterfall_cif.y4m
      Midres: 0.085% overall improvement, 1.253% improvement on station2_480p25.y4m
      Change-Id: I3872934d978bb4ca828c6b9acd2fdb951d9da299
      ae7c458a
  2. 01 Mar, 2017 22 commits
  3. 28 Feb, 2017 9 commits
    • Angie Chiang's avatar
      Turn on SIMD implementation of av1_fht32x32 · e4f98f67
      Angie Chiang authored
      Change-Id: Ie1bfece43c81ee5d149ed25c3f7fd959a8f95030
      e4f98f67
    • Michael Bebenita's avatar
      Add SIMD code for PVQ search · 3a88de8f
      Michael Bebenita authored
      This reduces the runtime profile of pvq_search_rdo_double from 37%
      to 15% and improves overall encoding speed when PVQ is enabled by ~40%.
      The SIMD code is not bit accurate with the C version and introduces a
      slight PSNR regression on AWCY:
      
        PSNR | PSNR Cb | PSNR Cr | PSNR HVS | SSIM | MS SSIM | CIEDE 2000
      0.0607 |  0.1044 |     N/A |   0.0126 |  N/A | -0.0309 |        N/A
      
      Change-Id: Ie22cebc62df2e72618305f2268668d79167860c6
      3a88de8f
    • Angie Chiang's avatar
      Add av1_cost_coeffs_txb() for lv_map experiment · 47c72189
      Angie Chiang authored
      Change-Id: I44842387207b19f8e0c3894d3f4e8d0646a4cafd
      47c72189
    • Alex Converse's avatar
      Simplify rabs_read() · bff32ac0
      Alex Converse authored
      This is branchless on newer gcc and clang and is about 1% faster overall
      at cq-level=16 frame-parallel=1.
      
      Change-Id: I7f5608ab0f0abbc29aa3419a103addf945ea9f0a
      bff32ac0
    • Urvang Joshi's avatar
      SMOOTH_PRED: Use 8-bit weights. · 3e42acd4
      Urvang Joshi authored
      Using 8-bit weights gives similar results as 12-bit, with only noise
      level difference. Here's what 8-bit looks like compared to 12-bit:
      
      * AWCY Objective-1-fast:
                                high latency          low latency
      ALL keyframes             0.00                  0.01
      Video                     0.00                  0.04
      
      * Google sets:
      
      All Keyframes:
      lowres: 0
      midres: -0.001
      hdres: -0.001
      
      Video overall:
      lowres: 0
      midres: -0.063
      hdres: 0.026
      
      Change-Id: Ibed6015aa7cce12fcc6f314ffde76624df4ad2a1
      3e42acd4
    • Debargha Mukherjee's avatar
      Assign offsets correctly to compute warped motion · 246d2737
      Debargha Mukherjee authored
      Offsets for the least-squares for affine motion computation
      are now set at the top left corner of the current block.
      
      Improves stability and performance a little.
      
      Change-Id: I68ca7e74c6102502daa8ca3373af2b2dd59400c3
      246d2737
    • Jingning Han's avatar
      Disable compound mode in sub8x8 coding blocks · c41a549a
      Jingning Han authored
      Disable the support of compound prediction modes for sub8x8 codking
      blocks. Make the rate-distortion optimizations process account for
      such constraints.
      
      With the use 2x2 chroma prediction block, this makes the wrost case
      number of inter predictors same as vp9. It affects the coding
      gains by 0.35% for lowres, 0.17% for midres, and 0.08% for hdres.
      
      The encoding speed is up by 10%.
      
      Change-Id: Ieb2a83030676911baa403e586f1f800cbf485d81
      c41a549a
    • Yaowu Xu's avatar
      Use correct segment · 1e2aae1a
      Yaowu Xu authored
      Segmment based lossless flag is used in select transform size, this
      commit fixes a bug where wrong segment_id is used in such selection.
      
      BUG=aomedia:350
      
      Change-Id: Ibc981c779739849bac00447155180abbd319eb28
      1e2aae1a
    • Yaowu Xu's avatar
      Move asserts into correct scope · cdf8a14e
      Yaowu Xu authored
      The macro used in assert is defined under CONFIG_VAR_TX. This fixes a
      build issuse when --enable-var-tx and --enable-rd-debug are both on.
      
      Change-Id: I497fe4a8b1fa6c7b05ac2b41c97522f7bdedc0ce
      cdf8a14e
  4. 27 Feb, 2017 3 commits