1. 14 Feb, 2018 19 commits
    • Peng Bin's avatar
      Refactor inv_cos_bit for speedup · 28744b5c
      Peng Bin authored
      Replace the last parameter cos_bit for all 1D inv_txfm funcions with
      a macro define, as it is actually always equal to 12. By changing it
      to const value, compiler can do further optimization.
      
      Change-Id: If8a9fd99c7ac7eb6f485dafbce22b4803efda62e
      28744b5c
    • Linfeng Zhang's avatar
      Implement fdct4x8_new_sse2 and fadst4x8_new_sse2 · 043f4964
      Linfeng Zhang authored
      Change-Id: I9ab260c5ca31fe7e06bfc0f806893463c5255c45
      043f4964
    • Linfeng Zhang's avatar
      Implement fdct4x4_new_sse2 · 1fffc1f4
      Linfeng Zhang authored
      Change-Id: I5b905d54f7e551b15dac5f9cfd4e6d96e19d52c3
      1fffc1f4
    • Linfeng Zhang's avatar
      Enable av1_lowbd_fwd_txfm2d_4x8_sse2 · 686c5979
      Linfeng Zhang authored
      Forgot to put it in the function table.
      
      Change-Id: I86192158f74df84fca6c4a3dbda8b01659f6c9bd
      686c5979
    • Linfeng Zhang's avatar
      Implement av1_lowbd_fwd_txfm2d_8x4_sse2 · 7bd00743
      Linfeng Zhang authored
      So far the implemented av1_lowbd_fwd_#x#_sse2 provides 10% encoder
      speed up on speed 1.
      
      Change-Id: I3dab438c4498059262b065300743ba1519db64b4
      7bd00743
    • Michelle Findlay-Olynyk's avatar
      hash_based_trellis speed feature update · dea531da
      Michelle Findlay-Olynyk authored
      Add speed feature that uses hash tables to
      reuse deltas from previously found optimized
      coefficients in av1_optimize_txb. This skips some
      expensive optimize_txb calls.
      
      Work stopped (no current benefit):
      Current setup has worse speed (~1%), and is
      turned off by default in speed_features.c.
      Overall PSNR was ~0.1% worse. See
      go/hashBasedTrellis for feature details.
      
      Requires hash_me.
      
      Change-Id: Id12ada82984ea6682e2b6f27534fa57f3a3abb4a
      dea531da
    • Ola Hugosson's avatar
      [NORMATIVE] Enable CONFIG_EXT_INTRA_MOD2 by default · 63f1216a
      Ola Hugosson authored
      CONFIG_EXT_INTRA_MOD2 was provisionally adopted 16th of January
      
      Change-Id: I11f96cdff29a772813f0cc7cd4cae684e9a07448
      63f1216a
    • Peng Bin's avatar
      Refactor pair_set_epi16 for speedup · 8b8aaffc
      Peng Bin authored
      Use _mm_set1_epi32 instead of _mm_set_epi16, less instructions produced
      by compiler. This patch also removes the duplicate define of the same
      function.
      
      Speed test results:
      1. Unittest for each test cases in SSE2/AV1LbdInvTxfm2d shows 60%~80%
      speedup (except those case with TX_TYPE include iidentity)
      2. A brief speed test shows that with this CL, for speed1 encoder speeds up
      ~3% and decoder speeds up ~1.8%.
      (Baseline is 18976fa5)
      
      Change-Id: I2b0e12973fda05a21d6b6eb0f0efe11df6edfb84
      8b8aaffc
    • Yaowu Xu's avatar
      Remove unused variables · cbfffa8e
      Yaowu Xu authored
      Change-Id: I5290f94da6c1a0319357f84b2ec70b4331a0e4af
      cbfffa8e
    • Yaowu Xu's avatar
      Remove two more LPF macros · 8ec5c077
      Yaowu Xu authored
      Change-Id: I60278e399f4f65aa63526e459947e88084f0e889
      8ec5c077
    • Yaowu Xu's avatar
      Remove CONFIG_PARALLEL_DEBLOCKING · 6d0ed3ed
      Yaowu Xu authored
      The experiment is fully adopted now.
      
      Change-Id: I27906d2af4c746ce55aa17f64d1c0ef281e23ab2
      6d0ed3ed
    • Imdad Sardharwalla's avatar
      Increase seg_feature_data_max[SEG_LVL_REF_FRAME] · e4cf4fa4
      Imdad Sardharwalla authored
      Previously, segments using SEG_LVL_REF_FRAME were unable to signal the choices
      of GOLDEN = 4, BWDREF = 5, ALTREF2 = 6 and ALTREF = 7, as
      seg_feature_data_max[SEG_LVL_REF_FRAME] was set to 3. This patch increases the
      value to 7 to account for these options.
      
      BUG=aomedia:951
      
      Change-Id: I9732fa2be96ead2d4b6efdbce34a92e43c7dd04e
      e4cf4fa4
    • Imdad Sardharwalla's avatar
      Prevent undefined behaviour for AMVR experiment · bf2cc016
      Imdad Sardharwalla authored
      Sequences starting with intra-only frames previously resulted in undefined
      behaviour with CONFIG_AMVR == 1, as seq_force_integer_mv was only read for
      keyframes.
      
      This patch makes changes as follows:
      
      - The syntax element force_screen_content_tools has been added to the
        SequenceHeader struct, and is read and written correspondingly
      
      - seq_force_integer_mv has been renamed to force_integer_mv and moved to the
        SequenceHeader struct, and is read and written correspondingly (provided that
        force_screen_content_tools != 0)
      
      - The conditional reading/writing of allow_screen_content_tools now happens for
        every frame after reading/writing error_resilient_mode (CONFIG_OBU == 1) or
        the sequence header (CONFIG_OBU == 0)
      
      - The conditional reading/writing of cur_frame_force_integer_mv now happens for
        every frame after reading/writing allow_screen_content_tools
      
      BUG=aomedia:1048
      
      Change-Id: I689476fc2fa781dc8ec6fc8da91926cc8cfd3dc2
      bf2cc016
    • Yunqing Wang's avatar
      Remove redundancy code · 8da5addb
      Yunqing Wang authored
      Removed redundancy code in ref_mv finding.
      
      Change-Id: I4f9d0257a217e764cce820c01d66985778010de7
      8da5addb
    • Yunqing Wang's avatar
      [NORMATIVE] Consolidate reference mv clamping · 3e225434
      Yunqing Wang authored
      Clamp_mv_ref happens in multiple places in ref_mv search, which can be
      convoluted as reported in issue 1124. This change is to consolidate
      the clamping into one place.
      
      Borg test result on lowres set:
      avg_psnr:    ovr_psnr:   ssim:
        0.000       0.000      0.001
      
      BUG=aomedia:1377
      BUG=aomedia:1124
      BUG=aomedia:857
      
      Change-Id: I1649d5b5f37683c9c30e493c6eed13a808ab543a
      3e225434
    • Jingning Han's avatar
      [NORMATIVE] Scale up mfmv ref step size in 64x64 block · 73190512
      Jingning Han authored
      When the coding block size has one side in length of 64 and above,
      scale up the mfmv reference search step size from 8 to 16 along
      that direction. The midres coding stats get 0.02% better. Among
      all the finished hdres points, no negative results showed up.
      
      BUG=aomedia:1379
      
      Change-Id: I70ab7a9f9d1cf365d8ed1e06dbede307b6bc46ec
      73190512
    • Jingning Han's avatar
      [NORMATIVE] Reduce spatial search region from 4 to 3 cols · 92446c52
      Jingning Han authored
      Reduce the ref mv search over spatial neighbors from 4 to 3
      columns.
      
      BUG=aomedia:1382
      
      Change-Id: I44eb96e2ff4243d720a5f4f68be504995ebd69b6
      92446c52
    • Zoe Liu's avatar
      Use the correct macro for the context number · 95dfd0bf
      Zoe Liu authored
      Change-Id: I94f739840b90688506ebf5a4e7cf355cd6fb75ed
      95dfd0bf
    • Hui Su's avatar
      Remove if() that is always true in select_tx_block() · 85e75f15
      Hui Su authored
      And some cosmetic changes.
      
      Change-Id: Ibbc62bbb023d80ceee83120b4f6eb2a95b1d1a64
      85e75f15
  2. 13 Feb, 2018 10 commits
  3. 12 Feb, 2018 11 commits