1. 31 Oct, 2017 4 commits
    • Yue Chen's avatar
      Disable interintra in >32x32 blocks · 9ddd409f
      Yue Chen authored
      Disabling such cases will reduce search time without hurting
      coding performance (-0.001%).
      
      Change-Id: Iaa4385053fcf5bd59fb1f94d5583eb19cf792242
      9ddd409f
    • Sebastien Alaiwan's avatar
      Remove experimental flag of MOTION_VAR · 1bc94fcc
      Sebastien Alaiwan authored
      This experiment has been adopted, we can simplify the code
      by dropping the associated preprocessor conditionals.
      
      Change-Id: I2dce80e1e1b2116708b6ba9feeacaacc12af8fc4
      1bc94fcc
    • Luc Trudeau's avatar
      [CFL] 4:2:2 High Bit Depth · c8323c02
      Luc Trudeau authored
      Change-Id: I9f752dedfba29de9a4cfdd285c4b6dc32bd1630d
      c8323c02
    • Luc Trudeau's avatar
      [CFL] Add 4:2:2 Support · 0cfffa7f
      Luc Trudeau authored
      When 4:2:2 subsampling is used, both horizontal pixels are added and
      shifted left by 1 (instead of a right shift of 2 followed by a left
      shift of 3).
      
      Change-Id: Ib84c51cafabe0bd0de02dfe7868278e44d76f6db
      0cfffa7f
  2. 30 Oct, 2017 7 commits
    • Debargha Mukherjee's avatar
      Remove compound-segment/wedge config flags · 371968cd
      Debargha Mukherjee authored
      Change-Id: I39cfbb135add0553cadf64481b13786831fbdddd
      371968cd
    • Rupert Swarbrick's avatar
      Correct scaling in av1_loop_restoration_corners_in_sb · 34f2b74d
      Rupert Swarbrick authored
      I'd got the scaling backwards. This gets it right and adds a comment
      explaining the calculation.
      
      Change-Id: Ife2913700cc73996c09b702b394832799c449a8c
      34f2b74d
    • Jingning Han's avatar
      Speed up inter frame rate-distortion optimization · cf842ad2
      Jingning Han authored
      The frame marker system supports one to map the reference frame
      index into the natural order. It allows direct checking on the
      efficacy of the reference frames given their relative locations
      with respect to the current coding frame.
      
      This commit uses such property to filter out reference frames
      less likely to contribute coding gains from the rate-distortion
      optimization process. For example, it takes out the check on
      last2 / 3 frames, when their actual location is further away
      from the golden frame.
      
      The AWCY results show 0.6% performance regression. The encoding
      speed gets doubled.
      
      To use the speed up, one needs to turn on frame-marker experiment
      before we turn it on by default, and enable selective_ref_frame
      entry in the speed feature.
      
      Change-Id: Ifb03ed90acd980bbc7ff1c2e17982e21e68d2588
      cf842ad2
    • Sebastien Alaiwan's avatar
      Remove experimental flag of GLOBAL_MOTION · 48795807
      Sebastien Alaiwan authored
      This experiment has been adopted, we can simplify the code
      by dropping the associated preprocessor conditionals.
      
      Change-Id: I9c9d6ef5317798cbf237307a9754fe7e03bdda47
      48795807
    • Luc Trudeau's avatar
      [CFL] Sub8x8 Validation Code Rewrite · c7af36d4
      Luc Trudeau authored
      Sub8x8 Validation code is changed to be more robust. The scope of the
      validation is narrowed to validating that all of the required content in
      the storage buffer was stored between CfL predictions. The early
      termination used in the current mode decision code does not allow to
      validate more than that.
      
      This change does not change encoder output
      
      BUG=aomedia:925
      
      Change-Id: I7f1ed84da5037dcfaaf5da9cf33b4b8d664d2352
      c7af36d4
    • Debargha Mukherjee's avatar
      Remove experimental flag for rect-tx · 11812967
      Debargha Mukherjee authored
      Change-Id: I0cc53a03f07a11a6f7ea0570ff4ee8cf7c18c5aa
      11812967
    • David Barker's avatar
      loop-restoration: Remove special case in Wiener filter · 3acd3b5c
      David Barker authored
      Remove the special case handling for the topmost/bottommost
      rows in each processing unit. This causes slightly different
      effects depending on whether striped-loop-restoration is enabled.
      
      With striped-loop-restoration:
        Now that we explicitly fill out 3 rows of above/below pixels
        for each stripe, we don't need to use stepdown_wiener_kernel.
        Instead, the duplication of the topmost/bottommost pixels
        accomplishes the same task, while making the code much cleaner.
      
        This patch should not cause a change in output, except in a
        couple of cases which were already questionable. In particular,
        it fixes bug #953, where the Wiener filter could not handle
        small processing units (<4 rows high)
      
      Without striped-loop-restoration:
        The Wiener filter returns to using a full 3 pixels above/below
        the processing unit. In order to make sure there are enough
        pixels, we need to expand WIENER_BORDER_VERT to 3 pixels.
      
        This will result in a slight change in output, but should be
        fairly minor.
      
      BUG=aomedia:953
      
      Change-Id: I9530ef55909246f7ba488b7ecfd92d59e776b2f9
      3acd3b5c
  3. 28 Oct, 2017 3 commits
    • Nathan E. Egge's avatar
      Add new 4-point Type-VII DST to daala_tx. · 14a9cb1f
      Nathan E. Egge authored
      Replaces the lifting based orthonormal 4-point Type-IV DST with an
       orthonormal 4-point Type-VII DST that has no iterative multiplies.
      
      Change-Id: I0a1f1a8d8cecce1c8002b7891baea601bc088690
      14a9cb1f
    • Jingning Han's avatar
      Extend the eob context model · 35deaa73
      Jingning Han authored
      Account for 1-D/2-D transform kernels for the eob modeling. To
      maintain a smaller context cardinality, set the two 1-D transform
      kernels in the same category. The difference in directions should
      be largely covered by the scan order.
      
      This and the previous CLs on nz_map context modeling together
      improve the compression performance of level-map coefficient coding
      system by 0.4% for lowres.
      
      Change-Id: I8c4f03ca01ce3d248950d04bd1266f445b4227a0
      35deaa73
    • Jingning Han's avatar
      Account for rectangular transform block sizes in lv-map ctx · a24a6900
      Jingning Han authored
      Account for the rectangular transform block sizes in the non-zero
      map context model.
      
      Change-Id: I16cf21a4120c10c213df10950aeb4ef0ea40c477
      a24a6900
  4. 27 Oct, 2017 11 commits
    • Joe Young's avatar
      Ext-intra modification/tuning · 3ca43bf0
      Joe Young authored
      For ext-intra direcation intra modes:
      
      1. Use neighbor block modes to modify edge filtering strength
         Coding gain (lowres/midres/hdres):
           (8 keyframes)
           PSNR: -0.19 -0.22 -0.10
           SSIM: -0.29 -0.27 -0.13
      
      2. Use context-based cdf to code angle_delta syntax
           (8 keyframes)
           PSNR: -0.20 -0.24 -0.27
           SSIM: -0.29 -0.33 -0.37
      
      3. Filter corner sample:
           (8 keyframes)
           PSNR: -0.01 -0.02 -0.05
           SSIM: -0.03 -0.04 -0.05
      
      Combined Bd-rate improvement for 8 keyframes
           PSNR: -0.40 -0.47 -0.40
           SSIM: -0.57 -0.60 -0.51
      
      Change-Id: Id47ac17b6bf91cd810b70cacfc5b457341f417f3
      3ca43bf0
    • Urvang Joshi's avatar
      Superres: Fix writing/reading of denominator. · 8301018d
      Urvang Joshi authored
      Range is 9 to 16, and not 8 to 15.
      
      BUG=aomedia:972
      
      Change-Id: I7de6cea16a6377d9cd3b2af73efc841b42dad1fa
      8301018d
    • Urvang Joshi's avatar
      64X64: Keep top-left 32x32 only (other code path). · 693ae522
      Urvang Joshi authored
      Change-Id: Ib4faac1e7da40a351ec3abfe1f636a94c92ef0a3
      693ae522
    • Urvang Joshi's avatar
      Encoder: Reduce max resident set size by 23% · 5a69cd2d
      Urvang Joshi authored
      We reduce max stack size from 16 to 8.
      
      Memory reduction:
      - peak usage for 1080p video: 2.328 GB → 1.788 GB
      - sizeof ref_mv_stack: 6144 → 3072
      - sizeof(MB_MODE_INFO_EXT): 6456 → 3384
      - sizeof(PICK_MODE_CONTEXT):8056 → 5000
      - sizeof(PC_TREE): 201440 → 125040
      
      Compression performance is roughly neutral:
      - AWCY objective-1-fast: +0.03
      - Google lowres: 0.0
      - Google midres: -0.006
      
      BUG=aomedia:940
      
      Change-Id: Ifd38359c58e40b1c94552c5034618da8ce510f62
      5a69cd2d
    • Cheng Chen's avatar
      JNT_COMP: 4. add context and entropy read/write · 0a7f2f51
      Cheng Chen authored
      Change-Id: I0e6f7ab981e31f7120105515f6204568b6dc82d3
      0a7f2f51
    • Cheng Chen's avatar
      JNT_COMP: 3. rd select the best weight · ca6958c6
      Cheng Chen authored
      Select the best compound_idx in rd.
      The rate/cost for compound_idx and their ctx will be in patch 4.
      
      But there's a bug for now if we don't encode one more time using the
      selected compound_idx. It remains a issue to be solved in the future.
      
      Change-Id: I5e1ba51da2b6ab5bacd8aba752dda43bd2257014
      ca6958c6
    • Zhijie Yang's avatar
      Add short_filter experiment · f02f8aef
      Zhijie Yang authored
      Reduce the motion interpolation filter taps for inter prediction
      blocks with widths or heights smaller than or equal to 4 to alleviate the memory
      bandwidth increase.
      
      AWCY HL: 0.01% Y, -0.20% U, -0.29% V (positive number means loss)
      
      Change-Id: Ic454340e20aea2f1aae622336990f24a9e5b54d8
      f02f8aef
    • David Barker's avatar
      striped-loop-restoration: Save/restore more context rows · fa1e4b2a
      David Barker authored
      Save and restore 3 rows above and below each stripe, instead of 2.
      The extra rows are filled with duplicates of the outermost context
      rows.
      
      This should not affect the encoder or decoder output in any way,
      as currently these outer rows are not used. But this will enable
      later patches to simplify the code and make it a closer match
      to the way things are described in the striped-loop-restoration
      design document.
      
      Change-Id: I8ae5433e321d6025c6dc1b473330f485f1599340
      fa1e4b2a
    • Sebastien Alaiwan's avatar
      Accept all warped motion model settings · 163710c0
      Sebastien Alaiwan authored
      When needed, fallback regular interp filter at reconstruction stage.
      
      Such bitstreams are valid.
      However, as we don't expect aomenc to generate them,
      print a helper warning.
      
      Change-Id: If30c8d8e478688d142abd857f4c35f3e8c68edb4
      163710c0
    • Nathan E. Egge's avatar
      Fix bug when enabling 32-point DST in daala_tx. · 856d1798
      Nathan E. Egge authored
      Change-Id: I567420e45f54cfe991065614d0a8c0c4d637e116
      856d1798
    • RogerZhou's avatar
      Fixed build conflict (amvr,intrabc). · 10a0380a
      RogerZhou authored
      Change-Id: Ibfeb424bf0ebab7bbeb69f6f6df24a4f4924ec97
      10a0380a
  5. 26 Oct, 2017 9 commits
    • David Barker's avatar
      striped-loop-restoration: Fix line buffer width · e7745025
      David Barker authored
      The last restoration unit in a tile is allowed to be up to 1.5x
      the nominal restoration unit size. This was not properly accounted
      for in the definition of RESTORATION_LINEBUFFER_WIDTH, leading to
      memory corruption whenever we hit a particularly wide restoration
      unit.
      
      Change-Id: I6e858278bf1e3304eedb5f974f1db6961245e7bf
      e7745025
    • Jingning Han's avatar
      Merge eob-first into lv-map · 3422ac17
      Jingning Han authored
      Change-Id: Ib36a8df1a3ebddbf4320fb7b9b5537041bddc3a3
      3422ac17
    • Jingning Han's avatar
      Clean up br-node in lv-map · 36773c7a
      Jingning Han authored
      Use br-node approach, which can be easily turned into multi-symbol
      if desired.
      
      Change-Id: I40df5178ab299af24d347d91f01a88dbfc9305a6
      36773c7a
    • Jingning Han's avatar
      Consolidate lv-map experiment · 00803a77
      Jingning Han authored
      Change-Id: I2ae2a33574bc3072561e696a31e0ea2e0770afa9
      00803a77
    • Sebastien Alaiwan's avatar
      Remove dead functions · 2457ec8c
      Sebastien Alaiwan authored
      Change-Id: Idcb0a6660ac3b34eb79c216d71c8a71ffb863669
      2457ec8c
    • Angie Chiang's avatar
      Collect coeff level distribution in symbolrate · 9c168370
      Angie Chiang authored
      Change-Id: If77800c0904b5e004508274acb32ae46a641405b
      9c168370
    • Angie Chiang's avatar
      Count superblock num in symbol rate accounting · d9af8ac3
      Angie Chiang authored
      Change-Id: Id955e62c89b44781cef6b562fbc1e5782fccf95e
      d9af8ac3
    • Rupert Swarbrick's avatar
      Stop loop rest units from straddling tile boundaries · bcb65fe6
      Rupert Swarbrick authored
      With this patch, restoration units are allocated within each tile as
      if it were its own image. Arrays of information that need one entry
      per restoration unit are laid out in tiles, with rsi->units_per_tile
      units for each tile.
      
      Change-Id: I485c17166f33e24d281079b3138b76f98f0fe081
      bcb65fe6
    • Nathan E. Egge's avatar
      Fix a bug in the DAALA_TX 4-point DST functions. · b634e7ed
      Nathan E. Egge authored
      The OD_FDST_4() and OD_IDST_4() macros were written for use in the
       OD_FDCT_8_ASYM macro which took asymmetrically scaled input and
       after running an asymmetric butterfly step, passed it through to
       the 4-point Type-II DCT and 4-point Type-IV DST.
      Because the DST implementations were never tested as stand alone
       transforms, some of the signs from the butterfly step ended up inside
       the DST macros.
      These extra operations will be addressed in a follow up patch.
      
      Change-Id: I5ad1dee7b903d3a6dc3d512ae430841244851bc0
      b634e7ed
  6. 25 Oct, 2017 6 commits
    • Jingning Han's avatar
      Fix reference frame mvs access · 058d0889
      Jingning Han authored
      Resolve an enc/dec mismatch issue when tmv is off and mfmv is on.
      
      Change-Id: Ia64005acd85f51d3162baafab1540095ad06187d
      058d0889
    • Sebastien Alaiwan's avatar
      av1_rtcd_defs.pl: deduplicate HBD/LBD · 27427722
      Sebastien Alaiwan authored
      There's no change to the generated file.
      
      Change-Id: I77e9d78d22d084bc77dbf1dc5b8b99368cd2444e
      27427722
    • Yue Chen's avatar
      Optimizations for filter_intra · 57b8ff68
      Yue Chen authored
      Reduce number of modes from 10 to 6, and disable fi modes in UV.
      To reduce complexity, apply filter directly without subtracting
      the estimated means.
      
      Change-Id: Iaf78d92d31e4a7cc30ea7863b57a9611c5f503e6
      57b8ff68
    • Ola Hugosson's avatar
      striped_loop_restoration bug fixes · 54671902
      Ola Hugosson authored
      * The above/below buffers did not fit the extra replication pixels to the right and left
      * The wiener filter stripe has to be at least 4 pixel high (because of the
        split into above/mid/below parts)
      
      Change-Id: I360bef114c7ceb439e11b76bd4724af15e051348
      54671902
    • David Michael Barr's avatar
      [CFL] Switch to txfm_rd_in_plane in alpha search · 1f8d0950
      David Michael Barr authored
      This is more precise than the dist functions it replaces.
      
      Results on Subset1 (compared with previous commit with CfL enabled)
        PSNR | PSNR Cb | PSNR Cr | PSNR HVS |   SSIM | MS SSIM | CIEDE 2000
      0.0634 | -0.9188 | -0.9429 |   0.0609 | 0.0722 |  0.0593 |    -0.3226
      
      Change-Id: I955a7d7eceea50482edb40b0d1041b300e3c9042
      1f8d0950
    • Sebastien Alaiwan's avatar
      Remove dead struct member · dea4d313
      Sebastien Alaiwan authored
      Change-Id: Id228c94fbe6005ac37a59bb8c23cfb0f95f97af0
      dea4d313