1. 07 Oct, 2017 4 commits
    • Rupert Swarbrick's avatar
      Upscale frame correctly in foreach_rtile_in_tile · b66894ac
      Rupert Swarbrick authored
      The restoration tiles (rtiles) divide the upscaled frame, not the
      encoded one.
      Change-Id: I2d08fe926d694fee7064461685289d3fd1c1de0c
    • Debargha Mukherjee's avatar
      Remove the speed optimization for rd_stats_stack · 9245d89d
      Debargha Mukherjee authored
      This optimization for speed was useful only when max tx-size
      was 32x32. However with tx64x64 this was breaking certain assumptions
      causing huge drops in coding efficiency. So I am removing this
      optimization for now. This can be brought back latger as a speed feature.
      The removal of this optimzation brings back the loss when 32x64
      and 64x32 transforms are used.
      Change-Id: I15987ea9ff53fa36a2962fe5f156c30a11e809ed
    • Joe Young's avatar
      [intra-edge] Pad intra edge samples to avoid valgrind warning · 7cfd5343
      Joe Young authored
      The SSE4 function filter_intra_edge_sse4_1() reads data slightly
      past the initialized part of the array. Those data are discarded
      later, but causes a valgrind warning. This change avoids the warning
      by initializing the array an extra +16 positions.
      Change-Id: Ib610492cff91492ae379c5d62895773f8747c4bc
    • Luc Trudeau's avatar
      [CFL] Extract sum top row and left col from custom DC_PRED · 13281d4f
      Luc Trudeau authored
      To simplify high bit depth commit, the summing the top row and the left
      column are extracted out of cfl_dc_pred. This does not change the
      Change-Id: I5c9fe91df4942f736c5af29c1d93abb3a6c8501f
  2. 06 Oct, 2017 17 commits
    • Jingning Han's avatar
      Rework key frame intra mode context model · a45d842d
      Jingning Han authored
      Reduce the context model size for key frame modes from 30240 bits
      to 4500 bits, i.e., less than 1/6 of the original context model.
      The coding performance loss on key frame is 0.14% for lowres and
      noise level difference for video sequence. The loss on key frame
      for midres is 0.05% and noise level for whole video. The change
      on hdres kf coding is 0.015%.
      Change-Id: I9e36825e5c5ee6ba35038c3ca349ad1ad3429910
    • Debargha Mukherjee's avatar
      Avoid large stack allocations · 5d108a36
      Debargha Mukherjee authored
      When ext-partition and ncobmc-adapt-weight is on, avoid too large
      stack allocations.
      Change-Id: I8db74e45cac80c4e5dfd9e20cfc73d9978d1578e
    • Angie Chiang's avatar
      Reduce division operations in update_scan_prob · bbcd8f76
      Angie Chiang authored
      Change-Id: I923931a9dbf828eb13670511852d55c953b479c1
    • Sebastien Alaiwan's avatar
      Avoid left shift of negative value · 92acb81b
      Sebastien Alaiwan authored
      This is undefined behaviour in C99 and could mislead the optimizer.
      This fixes the ubsan warning, and still generates optimal code
      (i.e an inlined 'sar' instruction).
      Change-Id: I36b20a6780532b8c9379b9fbfd970933d56b1bc5
    • Alexander Bokov's avatar
      Predict skip flag to speed up the TX type search · 8829a24d
      Alexander Bokov authored
      Average speed-up (lowres):
      low bitrates: 6.6%
      mid bitrates: 2.5%
      high bitrates: 0.0%
      Average PSNR loss:
      lowres: 0.010%
      midres: 0.005%
      Change-Id: Id34fb247e5e31f04ca324c58142e4b5ac4edacda
    • Yi Luo's avatar
      Lowbd SMOOTH_PRED intrapred ssse3 optimization · 46ae1ea3
      Yi Luo authored
      On i7-6700:
      Predictor    ssse3 v. C
      4x4          ~1.3x
      4x8          ~1.9x
      8x4          ~2.3x
      8x8          ~3.4x
      8x16         ~4.1x
      16x8         ~4.6x
      16x16        ~5.2x
      16x32        ~5.6x
      32x16        ~4.2x
      32x32        ~4.7x
      Change-Id: Ic12383cf9d4446361d6355eb8a480a3c7602060e
    • Sebastien Alaiwan's avatar
      Explicit requirement about sizeof(tran_low_t) · 698af562
      Sebastien Alaiwan authored
      Here, we're testing CONFIG_HIGHBITDEPTH but what we really depend upon
      is the actual size of the coefficients.
      Change-Id: I33d71e4b38b4b83bb4232346f4d449f20bcf740e
    • RogerZhou's avatar
      Fixed a bug in hash ME when it is used in intrabc · 2b6456c1
      RogerZhou authored
      Change-Id: I4f158dd15e4a7a8abb27ad0ad6aed6f8f9fce4f6
    • Rupert Swarbrick's avatar
      Simplify the ALL_ZERO_FLAG logic in av1_rd_pick_intra_mode_sb · 799ff701
      Rupert Swarbrick authored
      Since the CONFIG_EXT_INTER #if/#endif lines have been removed, it's a
      bit clearer what's going on here and this patch cleans up the code.
      Firstly, the patch pulls the cheap checks on best_mbmode.ref_frame out
      to the front of the block, so we needn't call gm_get_motion_vector at
      all for compound predictions.
      Next, second element of the zeromv array is never used, so we needn't
      compute it.
      Finally, the patch removes the calls to lower_mv_precision. These
      shouldn't be needed, but it's not exactly obvious why not so the patch
      adds some comments to gm_get_motion_vector to explain what's going on
      and adds an assertion to make sure they are true. It also adds a call
      to integer_mv_precision on the early return path of
      gm_get_motion_vector, correcting an apparent bug when CONFIG_AMVR is
      This patch shouldn't make any difference to encoder or decoder
      Change-Id: I0b4a01063574d080bbf6d30187f4e1748c60939d
    • Angie Chiang's avatar
      Implement non-recursive av1_update_scan_order · da4bbb51
      Angie Chiang authored
      The performance difference is
      lowres: 0.02% gain
      midres: 0.07% gain
      Change-Id: I68a74462f41db3bf24573cf2a08c8b5b8aa13f5f
    • Debargha Mukherjee's avatar
      Turn Loop Restoration on by default · 6f03dcb8
      Debargha Mukherjee authored
      This is an adopted experiment.
      Change-Id: If33f700c12fb3fd0f8368714d5e9795aad3729d8
    • Yunqing Wang's avatar
      Fix loop-restoration sub-pixel filter function · 93d14afb
      Yunqing Wang authored
      This patch fixed loop-restoration subpixel_8t_ssse3 function
      so that it works with CONFIG_PIC.
      Change-Id: I178efae161a2b4c8388c141c378c99149ea3a847
    • Sebastien Alaiwan's avatar
      Fix warning about bitwise 'not' on boolean · cf26ee5a
      Sebastien Alaiwan authored
      Change-Id: I4732dbbb71a0db9ac284a4b2ae5f10816e0e9264
    • RogerZhou's avatar
      Extend IntraBC to 4x4 · ca86546f
      RogerZhou authored
      Change-Id: I3f30c35bcd1bc623ad0c34c4b954ff71b2fcfd00
    • Hui Su's avatar
      Remove entropy-stats code on decoder side · 08468197
      Hui Su authored
      Change-Id: I22f4ea3b8348816d4592645e9994b4b0ff29868c
    • Debargha Mukherjee's avatar
      Change tolerance for resize_test · 34e3a145
      Debargha Mukherjee authored
      Re-enable test with intra-edge
      Change-Id: I189398a2426a8ca66edaf15f91286a2aff9b47b0
    • Debargha Mukherjee's avatar
      Fix compile conflict w/ unpoison-partition-context · ab8bf508
      Debargha Mukherjee authored
      Fixes compile failures with unpoison-partition-context and
      Change-Id: Ibedab7d927d6bf7329a5fc5a3050aa6272d4b254
  3. 05 Oct, 2017 19 commits