1. 01 Mar, 2017 22 commits
  2. 28 Feb, 2017 9 commits
    • Angie Chiang's avatar
      Turn on SIMD implementation of av1_fht32x32 · e4f98f67
      Angie Chiang authored
      Change-Id: Ie1bfece43c81ee5d149ed25c3f7fd959a8f95030
      e4f98f67
    • Michael Bebenita's avatar
      Add SIMD code for PVQ search · 3a88de8f
      Michael Bebenita authored
      This reduces the runtime profile of pvq_search_rdo_double from 37%
      to 15% and improves overall encoding speed when PVQ is enabled by ~40%.
      The SIMD code is not bit accurate with the C version and introduces a
      slight PSNR regression on AWCY:
      
        PSNR | PSNR Cb | PSNR Cr | PSNR HVS | SSIM | MS SSIM | CIEDE 2000
      0.0607 |  0.1044 |     N/A |   0.0126 |  N/A | -0.0309 |        N/A
      
      Change-Id: Ie22cebc62df2e72618305f2268668d79167860c6
      3a88de8f
    • Angie Chiang's avatar
      Add av1_cost_coeffs_txb() for lv_map experiment · 47c72189
      Angie Chiang authored
      Change-Id: I44842387207b19f8e0c3894d3f4e8d0646a4cafd
      47c72189
    • Alex Converse's avatar
      Simplify rabs_read() · bff32ac0
      Alex Converse authored
      This is branchless on newer gcc and clang and is about 1% faster overall
      at cq-level=16 frame-parallel=1.
      
      Change-Id: I7f5608ab0f0abbc29aa3419a103addf945ea9f0a
      bff32ac0
    • Urvang Joshi's avatar
      SMOOTH_PRED: Use 8-bit weights. · 3e42acd4
      Urvang Joshi authored
      Using 8-bit weights gives similar results as 12-bit, with only noise
      level difference. Here's what 8-bit looks like compared to 12-bit:
      
      * AWCY Objective-1-fast:
                                high latency          low latency
      ALL keyframes             0.00                  0.01
      Video                     0.00                  0.04
      
      * Google sets:
      
      All Keyframes:
      lowres: 0
      midres: -0.001
      hdres: -0.001
      
      Video overall:
      lowres: 0
      midres: -0.063
      hdres: 0.026
      
      Change-Id: Ibed6015aa7cce12fcc6f314ffde76624df4ad2a1
      3e42acd4
    • Debargha Mukherjee's avatar
      Assign offsets correctly to compute warped motion · 246d2737
      Debargha Mukherjee authored
      Offsets for the least-squares for affine motion computation
      are now set at the top left corner of the current block.
      
      Improves stability and performance a little.
      
      Change-Id: I68ca7e74c6102502daa8ca3373af2b2dd59400c3
      246d2737
    • Jingning Han's avatar
      Disable compound mode in sub8x8 coding blocks · c41a549a
      Jingning Han authored
      Disable the support of compound prediction modes for sub8x8 codking
      blocks. Make the rate-distortion optimizations process account for
      such constraints.
      
      With the use 2x2 chroma prediction block, this makes the wrost case
      number of inter predictors same as vp9. It affects the coding
      gains by 0.35% for lowres, 0.17% for midres, and 0.08% for hdres.
      
      The encoding speed is up by 10%.
      
      Change-Id: Ieb2a83030676911baa403e586f1f800cbf485d81
      c41a549a
    • Yaowu Xu's avatar
      Use correct segment · 1e2aae1a
      Yaowu Xu authored
      Segmment based lossless flag is used in select transform size, this
      commit fixes a bug where wrong segment_id is used in such selection.
      
      BUG=aomedia:350
      
      Change-Id: Ibc981c779739849bac00447155180abbd319eb28
      1e2aae1a
    • Yaowu Xu's avatar
      Move asserts into correct scope · cdf8a14e
      Yaowu Xu authored
      The macro used in assert is defined under CONFIG_VAR_TX. This fixes a
      build issuse when --enable-var-tx and --enable-rd-debug are both on.
      
      Change-Id: I497fe4a8b1fa6c7b05ac2b41c97522f7bdedc0ce
      cdf8a14e
  3. 27 Feb, 2017 9 commits
    • Angie Chiang's avatar
      Remove redundant return in set_offsets · 44701f2c
      Angie Chiang authored
      Change-Id: Idf8f03052a7e21b8a273986204038545573d7962
      44701f2c
    • Debargha Mukherjee's avatar
      Better block center in gm_get_motion_vector fn · f6dd3c68
      Debargha Mukherjee authored
      Also supports homography models for future experiments.
      
      Change-Id: I4510540f54133e063891ed491c95c087222f7810
      f6dd3c68
    • Adrian Grange's avatar
      Remove unnecessary #ifdef · d152fc04
      Adrian Grange authored
      The line of code is already within the scope
      of an #if CONFIG_EC_MULTISYMBOL.
      
      Change-Id: I62e28c8586f5d04a1e1be4ea5a2551d3123fde9f
      d152fc04
    • Debargha Mukherjee's avatar
      Adds macro to test cb4x4 w/o sub8x8 txtype search · 094c9439
      Debargha Mukherjee authored
      USE_TXTYPE_SEARCH_FOR_SUB8X8_IN_CB4X4 macro added to turn
      tx_type search on/off for sub8x8 in cb4x4 mode.
      
      The purpose is mainly to analyze the coding gains from cb4x4
      but this later can be made into a speed feature as well.
      
      Change-Id: Ic22026c373eebba87f324689ac5686a2844315b6
      094c9439
    • Debargha Mukherjee's avatar
      Integerize warped motion computation · e6eb3b53
      Debargha Mukherjee authored
      Integerizes computation of the least squares for warped motion.
      The model is restricted to only Affine. Affine seems easiest
      to compute and integerize since it can be split into two 3-dim
      least squares problems, as opposed to rotation-zoom which needs
      a 4-dim least-squares problem to be solved.
      The current implementation requires only one division per block.
      
      BDRATE impact is mminimal. The upgrade to the affine model improves
      coding efficiency but integerization also degrades efficiency a
      little. Overall there is a net gain of about -0.07% BDRATE on
      the lowres set.
      BDRATE lowres: -1.113% with ----enable-warped-motion vs. without
      (up from -1.044%).
      
      Change-Id: I6b9216ac0737d76f59054293eabee48e17739ec4
      e6eb3b53
    • Tom Finegan's avatar
      Move cmake build setup for testing into test/test.cmake. · 4db04d36
      Tom Finegan authored
      - Move source list vars.
      - Split source list vars into common/decoder/encoder sources.
      - Move target definitions into function.
      - Split targets into common/decoder/encoder targets.
      - Update CMakeLists.txt to include test.cmake and call
        setup_aom_test_targets() at the appropriate time.
      
      BUG=https://bugs.chromium.org/p/aomedia/issues/detail?id=76
      
      Change-Id: Icd9ce67593c2de7ebd5c8ef921e31517b6d20945
      4db04d36
    • Yaowu Xu's avatar
      Remove const from int ext_tx_set · 7640f5f3
      Yaowu Xu authored
      The variable was later assigned value in the function.
      
      Change-Id: I93f283a134499a050b46d9dcd6f0c0b4e8d54049
      7640f5f3
    • Angie Chiang's avatar
      Prefer using get_tx_size() · 7fcfee40
      Angie Chiang authored
      Change-Id: Ifcdd3ce2953c1ecb1d0962da412a4b5ba2cda912
      7fcfee40
    • Yaowu Xu's avatar
      Correct a macro · 345a22db
      Yaowu Xu authored
      --enable-lowbitdepth defines the flag CONFIG_LOWBITDEPTH, not
      CONFIG_AOM_LOWBITDEPTH.
      
      Change-Id: Ifa1c12847bee4978d08d010f4fc3601d75e59c31
      345a22db