1. 30 Nov, 2017 8 commits
    • Michelle Findlay-Olynyk's avatar
      Add speed feature use_fast_interpolation_filter... · a3eb912b
      Michelle Findlay-Olynyk authored
      Applies to speed >=1. Instead of searching all dual filter space
      {R,Sm,Sh}x{R,Sm,Sh}, only check {R}x{R,Sm,Sh} followed by
      {R,Sm,Sh}x{best of prev R,Sm,Sh}.
      
      Saves ~6% of cycles by reducing av1_convolve_2d_sse2, with 0.023
      overall psnr drop.
      
      Change-Id: I82d7a6321b335293124a007ff4c87f0e260052e1
      a3eb912b
    • Luc Trudeau's avatar
      [CFL] Clean cfl.h include · d183b646
      Luc Trudeau authored
      cfl.h is no longer included in the onyxc_int.h. cfl.h is now included in
      each file that uses cfl functions.
      
      Change-Id: I80a2e57159217b0ee20b2562fc9711d929472b8c
      d183b646
    • Luc Trudeau's avatar
      [CFL] CfL Initialization Simplification · 1e84af52
      Luc Trudeau authored
      The CfL context is now stored inside MACROBLOCKD instead of
      MACROBLOCKD only storing a pointer to the CfL context.
      The intent is to avoid race conditions as MACROBLOCKD is stored
      inside ThreadData. This change also simplifies CfL Initialization.
      
      Change-Id: I991503716b21fc9aca60caddb2008b8bff397e6d
      1e84af52
    • Debargha Mukherjee's avatar
      Do not use avx2 quantizer for 4:1 transforms · 24e31043
      Debargha Mukherjee authored
      Change-Id: I4b1a386ff989d63c7230923b36cb230b18466143
      24e31043
    • Jingning Han's avatar
      Make adaptive scan order support multi-threading · 0bd3bf69
      Jingning Han authored
      Count the non-zero appearance per tile and update per frame. This
      resolves an enc/dec mismatch in multi-threading coding.
      
      BUG=aomedia:1073
      
      Change-Id: I474f05913e6c3f75ca3fa1057bd4809ef51af164
      0bd3bf69
    • Sebastien Alaiwan's avatar
      rdopt.c: inline temporary · feca25a5
      Sebastien Alaiwan authored
      Change-Id: I9e79d37b83d0185e3079ce4f7c7277840fc0d753
      feca25a5
    • Hui Su's avatar
      Make the signaling of allow_screen_content_tools consistent · df89ee3a
      Hui Su authored
      in keyframe and all-intra frame.
      
      Change-Id: I09d6f83efd3c4fe3f9a464d6a0e22e2e68220a8d
      df89ee3a
    • Debargha Mukherjee's avatar
      Refactor/Change the entropy context for transforms · b3eda2f4
      Debargha Mukherjee authored
      The change makes the entropy context for transforms use
      the same mechanism as with and without lv_map.
      For the non-lv-map case the context is now based on the
      the larger transform dim for 2:1 rect transforms. The context
      is now the average for 4:1 rect transforms for both lv-map and
      non-lv-map cases.
      There is one small fix for level map for getting the correct
      rate when skip is set.
      
      BDRATE: lowres, 30 frames, speed 1: -0.15% gain for the
      non-lv-map case on the baseline.
      
      Change-Id: I06a583d33bef68202d72a88e077f8d31cc5e7fe4
      b3eda2f4
  2. 29 Nov, 2017 17 commits
  3. 28 Nov, 2017 5 commits
  4. 27 Nov, 2017 3 commits
    • Jingning Han's avatar
      Use sub-frame statistics for adaptive scan order update · 025c6c41
      Jingning Han authored
      Skip the last SB row counting for per frame adaptive scan order.
      This allows enough time window for HW decoder to process the
      scan order update for next frame decoding.
      
      Change-Id: I8a3b48fe452c68c921d55dc76cc787f0a8e00e29
      025c6c41
    • Debargha Mukherjee's avatar
      Add option to disable split partitions for chroma · 891a8774
      Debargha Mukherjee authored
      When the flag DISABLE_VARTX_FOR_CHROMA is on chroma is
      constrained to always use the largest transform size
      for the prediction unit size.
      This is meant to simnplify the logic for transform size
      selection for chroma with hopefully no loss.
      
      Results:
      lowres 30 frames, speed 1: -0.038% (a slight improvement).
      lowres 30 frames, speed 0: 0.000% (noise level difference).
      
      Change-Id: I14dd5b1983d908bd98e59b7d252e11f5755c97e6
      891a8774
    • Debargha Mukherjee's avatar
      Add chroma bsize modifications correctly · 3aa28111
      Debargha Mukherjee authored
      Change-Id: I807dc86489bc9219a24801841e66b555a2d7a51f
      3aa28111
  5. 25 Nov, 2017 1 commit
  6. 24 Nov, 2017 1 commit
    • Rupert Swarbrick's avatar
      Disallow 4:1/1:4 partitions for BLOCK_128x128 · eb123933
      Rupert Swarbrick authored
      The change can be temporarily reverted by setting ALLOW_128X32_BLOCKS
      to 1 (for easy "is this better?" testing: we'll get rid of the option
      once it's clear what we're doing).
      
      Since this means we have to modify the "num_partition_types"
      calculation in functions like read_partition, we move it into a helper
      function called partition_cdf_length(). Doing this makes the bodies of
      read_partition and write_partition somewhat simpler than they were.
      
      Change-Id: I3573f62f71dc3344aed2dcb3e423e034eb56a7c5
      eb123933
  7. 23 Nov, 2017 3 commits
    • Rupert Swarbrick's avatar
      Refactor to allow optimization in SGR code · 13927866
      Rupert Swarbrick authored
      The first stage of the selfguided filter is to generate box sums of
      the input image (and its squares). This is done with a pair of
      integral images, which are the same for both calls in
      apply_selfguided_restoration.
      
      This patch refactors things so that av1_selfguided_restoration
      calculates both "flt" buffers, allowing it to reuse the integral
      images that it calculated.
      
      Change-Id: Ica2f6f66e41bea38eb1a135c78c1d7ddab434d8e
      13927866
    • Rupert Swarbrick's avatar
      Get rid of the highbd versions of the SGR code · 625e50bd
      Rupert Swarbrick authored
      This doesn't have a big performance impact, and it's rather simpler
      just having one version of everything.
      
      Change-Id: I5fa5e7640a63d0ccb0c371f266c6eee99d9520f9
      625e50bd
    • Rupert Swarbrick's avatar
      Remove unused highpass filter from SGR code · 7cf60961
      Rupert Swarbrick authored
      Change-Id: Ifac3a3bf620061865b82b986d6b16bcabd96a187
      7cf60961
  8. 22 Nov, 2017 2 commits
    • Frederic Barbier's avatar
      [idct] Fix initialization of tx_set_type · 33b39f01
      Frederic Barbier authored
      Previous assumption on reduced_tx_set_used=0 led to many assertion
      failures and prevented signalling reduced_tx_set_used equal to 1.
      
      BUG=aomedia:1053
      
      Change-Id: If9a9dff8d01ba3ec942e06559c153f06d34555f9
      33b39f01
    • Cheng Chen's avatar
      JNT_COMP: add ssse3 implementations for sad_avg · d0179a6b
      Cheng Chen authored
      Add ssse3 implementations for the sad_avg c function at low bit-depth.
      With this, aom_jnt_sad c functions can all have simd implementations.
      This CL follows existing MACRO definitions for multiple combinations
      of block sizes.
      
      Change-Id: I882343684026525f5589a239337cfac2dd411e11
      d0179a6b