1. 14 Jun, 2017 12 commits
  2. 13 Jun, 2017 9 commits
    • Yi Luo's avatar
      Add fast path quantizer AVX2 · 2d44b697
      Yi Luo authored
      - Function level improves 36% against sse2.
      - Encoder speeds up 2.6% at user level on i7-6700.
      
      Change-Id: I9e43ce60b1e0de8f532249e5c035851463d75dbb
      2d44b697
    • Yushin Cho's avatar
      Fix to use correct tx w/h sizes with daala-dist · 1c602b38
      Yushin Cho authored
      In function av1_xform_quant().
      
      For subset1, high delay mode:
         PSNR | PSNR Cb | PSNR Cr | PSNR HVS |    SSIM | MS SSIM | CIEDE 2000
      -0.0565 | -0.0975 |  0.0317 |  -0.0937 | -0.0930 | -0.1261 |    -0.0994
      
      Change-Id: I638c2a4c2ad359709069c1587b677a813fb589c2
      1c602b38
    • Fergus Simpson's avatar
      Make loop-restoration compatible w/ frame_superres · 9cd57cf8
      Fergus Simpson authored
      There were several places where loop_restoration used the encoded width
      and height while superres was active. This patch changes it to use the
      upscaled width and height, since loop_restoration is supposed to occur
      after superres has done its upscaling.
      
      Change-Id: I2b9bbb06b5370618758bf81d8eb63f2eef26af80
      9cd57cf8
    • David Barker's avatar
      Replace '#ifdef CONFIG_...' with '#if CONFIG_...' · 26762357
      David Barker authored
      This affects two places:
      * Fixes a compile error with frame-superres when
        highbitdepth is disabled.
      * Avoids including some supertx-related code when supertx
        is disabled
      
      BUG=aomedia:602
      
      Change-Id: Idfc478fd88ade91d48c93cfd8abdd2bea86de898
      26762357
    • David Barker's avatar
      Remove unused 'inter_mode_probs/cdf' · 846f1086
      David Barker authored
      Change-Id: I4ac551f52f9a5f589d58b643a9849ec17e3c11e8
      846f1086
    • Yushin Cho's avatar
      Fix a bug in daala-dist · 09b01a24
      Yushin Cho authored
      Fix the bug that height of a partition is used as a stride mistakenly.
      This fixes the regression caused by sub8x8 tx size rd search
      for a partition >= 8x8.
      
      Change-Id: I6114814dcec70fd5198f681c0a861bc9849286fd
      09b01a24
    • Thomas Davies's avatar
      AOM_QM: Fix bug with applying QMs to 1D and identity transforms. · be44e512
      Thomas Davies authored
      Only apply quantisation matrices when the transform is 2D.
      
      Change-Id: Iae9ac910c11199f7944a80d10d334db3b711059d
      be44e512
    • Zoe Liu's avatar
      Add encoder/decoder pipeline to support single ref comp modes · 85b66463
      Zoe Liu authored
      Now the single ref comp mode should work with WEDGE and
      COMPOUND_SEGMENT. For motion_var, the OBMC_CAUSAL mode uses the 2nd
      predictor if the neighboring block is single ref comp mode predicted.
      
      This patch removes the mode of SR_NEAREST_NEWMV and leaves four
      single ref comp modes in total:
      
      SR_NEAREST_NEARMV
      SR_NEAR_NEWMV
      SR_ZERO_NEWMV
      SR_NEW_NEWMV
      
      Change-Id: If6140455771f0f1a3b947766eccf82f23cc6b67a
      85b66463
    • Yushin Cho's avatar
      Another fix of daala-dist for cb4x4 · c0f6bf25
      Yushin Cho authored
      Daala-dist replaces the luma distortion of sub8x8 partitions with
      its own distortion thus requires to split the luma distortion only.
      Doing so, there has been a bug that INT_MAX64 value comes
      when the sub8x8 parition is skipped. This happened because the existing
      code does not initialize the rd_stats_y or tmp_rd_stats_y, i.e. rd_stat struct
      for luma only in several places.
      
      Change-Id: If229b53bb7a6cff0b8751138a32b1dcf02665624
      c0f6bf25
  3. 12 Jun, 2017 10 commits
    • Fergus Simpson's avatar
      frame_superres: Fix bad assert · abd4343a
      Fergus Simpson authored
      Commit 12311 had a misplaced assert set that was causing superres debug
      runs to fail. The asserts just needed to be moved to where they were
      relevant to fix the issue.
      
      Change-Id: Ic370686c7156fcaf9380d8d8fd9d35b892d77e46
      abd4343a
    • Sarah Parker's avatar
      Clean up hbd transform code · 30dfa883
      Sarah Parker authored
      Responding to some left over cosmetic comments from
      2b5cdb1cf87c933331a16cc0221455d0a8c255e1
      
      Change-Id: I42e126593526cedd6675adf35b9c1df78e1ddf54
      30dfa883
    • Yushin Cho's avatar
      Fix a compile warning with global-motion off · c9751c59
      Yushin Cho authored
      Change-Id: I8379e4055e9c2737f1ad310095d7a318e6e74b2f
      c9751c59
    • Yushin Cho's avatar
      Fix a compile warning with dual-filter off · 60baa359
      Yushin Cho authored
      Change-Id: I68bb6ea8e26efe44bbbfb0f1a956620578578d55
      60baa359
    • Fergus Simpson's avatar
      frame_superres: Post encode/decode upscaling · d2bcbb56
      Fergus Simpson authored
      This patch implements the post-encode and post-decode upscaling for the
      frame superresolution experiment to work.
      
      Upscaling happens after cdef and before loop restoration.
      
      For now, this patch forces on random-superres.
      
      The patch also cleans up some broken rate control hooks from VP9
      days, to be brought back later when the resize and superres tools
      are stable.
      
      Change-Id: If0a8f69224dfaa0f4ae7703bd429ea2af953c7a6
      d2bcbb56
    • Yue Chen's avatar
      supertx: code refactoring + resolve conflicts with baseline · 8e689e4b
      Yue Chen authored
      Refactoring: split prediction+extension for each plane, so we can
      handle luma/chroma supertx pred in different ways.
      Compatibility fix: fix conflicts with cb4x4 and chroma_sub8x8, now
      for chroma sub8x8 supertx, only the top-left(basic cb4x4) or the
      the bottom-right(cb4x4 + chroma_sub8x8) predictor will be used
      without any blending within a 8x8 unit.
      
      Change-Id: I6cf7b12768a82d3c7e01811ada02de84af9bd8ac
      8e689e4b
    • Zoe Liu's avatar
      Remove the extra cost threshold modes · b2c62bc7
      Zoe Liu authored
      Change-Id: Id55bb8421edc0ccdf87db46f7b9b36b0a93a4efa
      b2c62bc7
    • Zoe Liu's avatar
      Add encoder/decoder support for var-refs · 7b1ec7a9
      Zoe Liu authored
      Check the availability of the reference frames at the frame level at
      both encoder and decoder, and if a reference frame is not available
      for a specific video frame, remove the signaling of such reference
      frame info at the block level.
      
      This patch adds the consideration of the bit saving inside the RD
      optimization loop.
      
      Change-Id: I4c22f1b843b21c7d2b47e118c99c3ad615a3d4e4
      7b1ec7a9
    • Steinar Midtskogen's avatar
      Speed up CDEF parameter selection for cpu-used > 0 · b1555c93
      Steinar Midtskogen authored
      High delay cpu-used=4
         PSNR | PSNR Cb | PSNR Cr | PSNR HVS |    SSIM | MS SSIM | CIEDE 2000
       0.0579 |  0.1380 | -0.1975 |   0.0361 |  0.0226 |  0.0072 |     0.0470
      
      Low delay cpu-used=4
         PSNR | PSNR Cb | PSNR Cr | PSNR HVS |    SSIM | MS SSIM | CIEDE 2000
      -0.1016 | -0.0695 |  0.1013 |  -0.1324 | -0.0903 | -0.1260 |    -0.1710
      
      Change-Id: I5a66a1ffc2d1fb2a203065b7fbb2fd2bd2b281ad
      b1555c93
    • Jingning Han's avatar
      Fix high bit-depth quantization process · 6ae7b3d9
      Jingning Han authored
      Scale the rounding factor according to the scaling factor applied
      to the quantization step size. This resolves a compression
      performance regression in 32x32 and above transform size.
      
      BUG=aomedia:599
      
      Change-Id: Id3fc9a46c4a8843ff5d77ccaa59ee3112b12d7f4
      6ae7b3d9
  4. 11 Jun, 2017 2 commits
  5. 10 Jun, 2017 1 commit
    • Timothy B. Terriberry's avatar
      var_tx: Fix distortion calc. in av1_tx_block_rd_b · ab141115
      Timothy B. Terriberry authored
      This was hard-coding the assumption that the block size for the
      smallest TX size was also the smallest block size. This is no
      longer true since fe67ed6a landed.
      
      As a result, for TX blocks that overlapped the frame edge, it was
      only measuring distortion on the upper-left 2x2 part of each 4x4
      sub-block, causing the encoder to prefer larger transforms which
      cause such overlap and avoid transforms which do not, causing a
      regression.
      
      This patch uses the appropriate conversion table, which fixes the
      regression.
      
      BUG=aomedia:593
      
      Change-Id: Id253cf0f3a5252378e3f340b8350120639ff5c88
      ab141115
  6. 09 Jun, 2017 6 commits
    • David Barker's avatar
      Vectorize av1_convolve_2d() · 8295c7c7
      David Barker authored
      Includes a test case based on the warp filter tests
      
      Change-Id: I9abea53a088f68bb8a928ebd7cb96b3266a63c13
      8295c7c7
    • Jonathan Matthews's avatar
      Avoid integer overflow in inv_txfm2d_add_facade() · 284f9d06
      Jonathan Matthews authored
      Bug introduced in change: I34cdeaed2461ed7942364147cef10d7d21e3779c
      
      BUG=aomedia:591
      
      Change-Id: I49b9edd2bf5a482b5afea5d83d56e04a0086f797
      284f9d06
    • David Barker's avatar
      Unify high-precision convolve filters: convolve-round · 726a953c
      David Barker authored
      * Reduce bit widths of intermediate values where possible
      * Change ROUND_POWER_OF_TWO_SIGNED to ROUND_POWER_OF_TWO
        in av1(_highbd)_convolve_2d
      * Apply offsetting and bounds checking, to match the intended
        hardware implementation
      * Separate the implementations of av1(_highbd)_convolve_2d
        into compound-round and non-compound-round cases. This is because
        there are now a significant number of differences between the
        functions.
      
      Overall, this is expected to affect the bitstream and encoder output
      when convolve-round alone is enabled, but *not* when compound-round
      is enabled.
      
      Change-Id: I8c21e0645fd11f64c59552885f87f4a5dd40ccf7
      726a953c
    • David Barker's avatar
      Add 'do_average' to ConvolveParams structure · e64d51a9
      David Barker authored
      The 'ref' member of ConvolveParams currently serves two purposes:
      * To indicate which component of a compound we're currently predicting,
        eg. for fetching interpolation filters with dual-filter enabled.
      * To determine whether we should average into the destination buffer.
      
      But there are two cases where we want to separate these out:
      * In joint_motion_search, we want to try combining a fixed second
        prediction with various first predictions.
      * When searching masked interinter compounds, we want to predict
        each component separately then try different combinations.
      
      In these cases, we set 'ref' to 0 and use temporary variables to
      make sure we use the correct interpolation filters. But this is
      quite fragile.
      
      This patch separates out the two uses into separate members.
      This allows us to remove some temporary variables, but more
      importantly gives easy fixes to two bugs in
      build_inter_predictors_single_buf (used by rdopt):
      
      * We previously set ref=0 but didn't fix up the interpolation filters
      * For ZERO_ZEROMV modes, the second component would accidentally
        average into the (uninitialized!) second prediction buffer
      
      BUG=aomedia:577
      BUG=aomedia:584
      BUG=aomedia:595
      
      Change-Id: Ibc31d1ac701a029ea5efaa1197dd402bc4b7af1e
      e64d51a9
    • Urvang Joshi's avatar
      Enable greedy version of av1_optimize_b(). · 09302f5a
      Urvang Joshi authored
      This has been found to be better than the original version in both ways:
      (1) Better compression: lowres -0.229, midres -0.147
      (2) Faster too in my quick test over 5 different clips with 30
      frames: 2.7% to 10.5% faster.
      
      Change-Id: I4d46e0915d6e4b8e7bfc03d0c8b88cbe3351ca20
      09302f5a
    • Thomas Davies's avatar
      AOM_QM: Use 8-bit matrices and fix 2x2 transform sizes. · 92aa22a8
      Thomas Davies authored
      2x2 transforms are now hidden behind the CHROMA_2X2 macro,
      not the CB4X4 macro.
      
      Change-Id: I5d73c679fba486ccda98fa8dbb804a3902df6c8d
      92aa22a8