1. 30 Nov, 2017 8 commits
    • Michelle Findlay-Olynyk's avatar
      Add speed feature use_fast_interpolation_filter... · a3eb912b
      Michelle Findlay-Olynyk authored
      Applies to speed >=1. Instead of searching all dual filter space
      {R,Sm,Sh}x{R,Sm,Sh}, only check {R}x{R,Sm,Sh} followed by
      {R,Sm,Sh}x{best of prev R,Sm,Sh}.
      Saves ~6% of cycles by reducing av1_convolve_2d_sse2, with 0.023
      overall psnr drop.
      Change-Id: I82d7a6321b335293124a007ff4c87f0e260052e1
    • Luc Trudeau's avatar
      [CFL] Clean cfl.h include · d183b646
      Luc Trudeau authored
      cfl.h is no longer included in the onyxc_int.h. cfl.h is now included in
      each file that uses cfl functions.
      Change-Id: I80a2e57159217b0ee20b2562fc9711d929472b8c
    • Luc Trudeau's avatar
      [CFL] CfL Initialization Simplification · 1e84af52
      Luc Trudeau authored
      The CfL context is now stored inside MACROBLOCKD instead of
      MACROBLOCKD only storing a pointer to the CfL context.
      The intent is to avoid race conditions as MACROBLOCKD is stored
      inside ThreadData. This change also simplifies CfL Initialization.
      Change-Id: I991503716b21fc9aca60caddb2008b8bff397e6d
    • Debargha Mukherjee's avatar
      Do not use avx2 quantizer for 4:1 transforms · 24e31043
      Debargha Mukherjee authored
      Change-Id: I4b1a386ff989d63c7230923b36cb230b18466143
    • Jingning Han's avatar
      Make adaptive scan order support multi-threading · 0bd3bf69
      Jingning Han authored
      Count the non-zero appearance per tile and update per frame. This
      resolves an enc/dec mismatch in multi-threading coding.
      Change-Id: I474f05913e6c3f75ca3fa1057bd4809ef51af164
    • Sebastien Alaiwan's avatar
      rdopt.c: inline temporary · feca25a5
      Sebastien Alaiwan authored
      Change-Id: I9e79d37b83d0185e3079ce4f7c7277840fc0d753
    • Hui Su's avatar
      Make the signaling of allow_screen_content_tools consistent · df89ee3a
      Hui Su authored
      in keyframe and all-intra frame.
      Change-Id: I09d6f83efd3c4fe3f9a464d6a0e22e2e68220a8d
    • Debargha Mukherjee's avatar
      Refactor/Change the entropy context for transforms · b3eda2f4
      Debargha Mukherjee authored
      The change makes the entropy context for transforms use
      the same mechanism as with and without lv_map.
      For the non-lv-map case the context is now based on the
      the larger transform dim for 2:1 rect transforms. The context
      is now the average for 4:1 rect transforms for both lv-map and
      non-lv-map cases.
      There is one small fix for level map for getting the correct
      rate when skip is set.
      BDRATE: lowres, 30 frames, speed 1: -0.15% gain for the
      non-lv-map case on the baseline.
      Change-Id: I06a583d33bef68202d72a88e077f8d31cc5e7fe4
  2. 29 Nov, 2017 17 commits
  3. 28 Nov, 2017 5 commits
  4. 27 Nov, 2017 3 commits
    • Jingning Han's avatar
      Use sub-frame statistics for adaptive scan order update · 025c6c41
      Jingning Han authored
      Skip the last SB row counting for per frame adaptive scan order.
      This allows enough time window for HW decoder to process the
      scan order update for next frame decoding.
      Change-Id: I8a3b48fe452c68c921d55dc76cc787f0a8e00e29
    • Debargha Mukherjee's avatar
      Add option to disable split partitions for chroma · 891a8774
      Debargha Mukherjee authored
      When the flag DISABLE_VARTX_FOR_CHROMA is on chroma is
      constrained to always use the largest transform size
      for the prediction unit size.
      This is meant to simnplify the logic for transform size
      selection for chroma with hopefully no loss.
      lowres 30 frames, speed 1: -0.038% (a slight improvement).
      lowres 30 frames, speed 0: 0.000% (noise level difference).
      Change-Id: I14dd5b1983d908bd98e59b7d252e11f5755c97e6
    • Debargha Mukherjee's avatar
      Add chroma bsize modifications correctly · 3aa28111
      Debargha Mukherjee authored
      Change-Id: I807dc86489bc9219a24801841e66b555a2d7a51f
  5. 25 Nov, 2017 1 commit
  6. 24 Nov, 2017 1 commit
    • Rupert Swarbrick's avatar
      Disallow 4:1/1:4 partitions for BLOCK_128x128 · eb123933
      Rupert Swarbrick authored
      The change can be temporarily reverted by setting ALLOW_128X32_BLOCKS
      to 1 (for easy "is this better?" testing: we'll get rid of the option
      once it's clear what we're doing).
      Since this means we have to modify the "num_partition_types"
      calculation in functions like read_partition, we move it into a helper
      function called partition_cdf_length(). Doing this makes the bodies of
      read_partition and write_partition somewhat simpler than they were.
      Change-Id: I3573f62f71dc3344aed2dcb3e423e034eb56a7c5
  7. 23 Nov, 2017 3 commits
    • Rupert Swarbrick's avatar
      Refactor to allow optimization in SGR code · 13927866
      Rupert Swarbrick authored
      The first stage of the selfguided filter is to generate box sums of
      the input image (and its squares). This is done with a pair of
      integral images, which are the same for both calls in
      This patch refactors things so that av1_selfguided_restoration
      calculates both "flt" buffers, allowing it to reuse the integral
      images that it calculated.
      Change-Id: Ica2f6f66e41bea38eb1a135c78c1d7ddab434d8e
    • Rupert Swarbrick's avatar
      Get rid of the highbd versions of the SGR code · 625e50bd
      Rupert Swarbrick authored
      This doesn't have a big performance impact, and it's rather simpler
      just having one version of everything.
      Change-Id: I5fa5e7640a63d0ccb0c371f266c6eee99d9520f9
    • Rupert Swarbrick's avatar
      Remove unused highpass filter from SGR code · 7cf60961
      Rupert Swarbrick authored
      Change-Id: Ifac3a3bf620061865b82b986d6b16bcabd96a187
  8. 22 Nov, 2017 2 commits
    • Frederic Barbier's avatar
      [idct] Fix initialization of tx_set_type · 33b39f01
      Frederic Barbier authored
      Previous assumption on reduced_tx_set_used=0 led to many assertion
      failures and prevented signalling reduced_tx_set_used equal to 1.
      Change-Id: If9a9dff8d01ba3ec942e06559c153f06d34555f9
    • Cheng Chen's avatar
      JNT_COMP: add ssse3 implementations for sad_avg · d0179a6b
      Cheng Chen authored
      Add ssse3 implementations for the sad_avg c function at low bit-depth.
      With this, aom_jnt_sad c functions can all have simd implementations.
      This CL follows existing MACRO definitions for multiple combinations
      of block sizes.
      Change-Id: I882343684026525f5589a239337cfac2dd411e11