1. 10 Sep, 2016 1 commit
  2. 09 Sep, 2016 2 commits
    • Nathan E. Egge's avatar
      Decoder performance improvement with daala_ec. · 39d69bc0
      Nathan E. Egge authored
      Cherry-pick Daala b5020bee:
       Remove redundant test in od_ec_decode_bool_q15().
      Using a test that decodes 100M random binary symbols, making this change
       produced a speed up of 8.81% with gcc-4.9.3 and 3.71% with clang-3.7.1,
       both compiled with -O2.
      
      Change-Id: If6d0077a56121a575ae53bcd4d1d9b7d800a317d
      39d69bc0
    • Alex Converse's avatar
      Remove custom rans types · 11206c60
      Alex Converse authored
      Change-Id: Ic74d9d8850b8c80a51e55e425bbf472a67e2653f
      11206c60
  3. 08 Sep, 2016 1 commit
    • Steinar Midtskogen's avatar
      Reduce memory footprint for CLPF decoding. · eb5794da
      Steinar Midtskogen authored
      Instead of having CLPF write to an entire new frame and
      copy the result back into the original frame, make the
      filter able to work in-place by keeping a buffer of size
      frame_width*filter_block_size and delay the write-back
      by one filter_block_size row.
      
      This reduces the cycles spent in the filter to ~75%.
      
      Change-Id: I78ca74380c45492daa8935d08d766851edb5fbc1
      eb5794da
  4. 07 Sep, 2016 2 commits
    • Nathan E. Egge's avatar
      Change return type of tell and tell_frac to uint32_t. · c1a87fad
      Nathan E. Egge authored
      The bit accounting functions aom_reader_tell() and aom_reader_tell_frac()
       return the number of bits and 1/8th bits respectively.
      This patch changes the return type from ptrdiff_t which is signed to
       uint32_t which is unsigned.
      The size_t type is not used since we only care about the number of bits
       or 1/8 bits per entropy coder context and we don't expect to code more
       than 512 megabits per tile.
      
      Change-Id: I84a119d1f52829dcbdb66a92656eacca06e42b11
      c1a87fad
    • Michael Bebenita's avatar
      Bit accounting. · e6b12944
      Michael Bebenita authored
      This patch adds bit account infrastructure to the bit reader API.
      When configured with --enable-accounting, every bit reader API
      function records the number of bits necessary to decoding a symbol.
      Accounting symbol entries are collected in global accounting data
      structure, that can be used to understand exactly where bits are
      spent (http://aomanalyzer.org). The data structure is cleared and
      reused each frame to reduce memory usage. When configured without
      --enable-accounting, bit accounting does not incur any runtime
      overhead.
      
      All aom_read_xxx functions now have an additional string parameter
      that specifies the symbol name. By default, the ACCT_STR macro is
      used (which expands to __func__). For more precise accounting,
      these should be replaced with more descriptive names.
      
      Change-Id: Ia2e1343cb842c9391b12b77272587dfbe307a56d
      e6b12944
  5. 06 Sep, 2016 3 commits
  6. 04 Sep, 2016 1 commit
  7. 03 Sep, 2016 1 commit
  8. 02 Sep, 2016 2 commits
  9. 01 Sep, 2016 6 commits
  10. 25 Aug, 2016 1 commit
  11. 24 Aug, 2016 1 commit
    • Nathan E. Egge's avatar
      Revert code formatting of OD_UNIFORM_CDFS_Q15. · 53c1bbbf
      Nathan E. Egge authored
      The formatting of OD_UNIFORM_CDFS_Q15[] in entcode.c is helpful for
       for understanding what is contained in the array (e.g., the uniform
       probability distributions of small sizes 2 through 16).
      This patch reverts the change made in f4b2926d and adds linter hints to
       ignore the formatting.
      
      Change-Id: I2ad9fe6673b86e6067cb97b40f0f0e69a119cdf5
      53c1bbbf
  12. 23 Aug, 2016 3 commits
  13. 22 Aug, 2016 5 commits
  14. 19 Aug, 2016 1 commit
    • Alex Converse's avatar
      Skip the empty symbol in ANS symbol lookup. · 25729f1c
      Alex Converse authored
      There are lots of potential ways to speed up symbol lookup, but this is
      faster than what is checked in, and matches what was used in the
      benchmark.
      
      Longterm the empty symbol should still be removed.
      
      Change-Id: I2fb9347be578a07ea6f0615c19f50ddd5aace140
      25729f1c
  15. 18 Aug, 2016 2 commits
    • Nathan E. Egge's avatar
      Rename aom_read_tree_cdf() to aom_read_symbol(). · 9ac7a9dc
      Nathan E. Egge authored
      Change-Id: Ifc065eb3eee0960be2e085e7e18478bf91bf56dd
      9ac7a9dc
    • Nathan E. Egge's avatar
      Fix warning when discarding const qualifier. · b6ec6381
      Nathan E. Egge authored
      Cherry-pick Daala 211c2a41: Clean up EC tell() and tell_frac() functions.
      Add a const qualifier to the od_ec_enc and od_ec_dec parameters of
       the od_ec_enc_tell(), od_ec_enc_tell_frac(), od_ec_dec_tell(), and
       od_ec_dec_tell_frac() functions.
      Add an OD_WARN_UNUSED_RESULT to od_ec_enc_tell_frac().
      
      Change-Id: Ia50e2fd75e98d8a03d993449d658b695cf56e6fb
      b6ec6381
  16. 16 Aug, 2016 1 commit
    • clang-format's avatar
      apply clang-format · 6885324b
      clang-format authored
      post DerivePointerAlignment change
      
      Change-Id: Ibdabea5710ea364f45d70fff018d260355124817
      6885324b
  17. 15 Aug, 2016 1 commit
  18. 10 Aug, 2016 1 commit
  19. 01 Aug, 2016 1 commit
    • Yue Chen's avatar
      Add weighted motion search for obmc predictor · 72d3ba8a
      Yue Chen authored
      Also port SIMD optimization of weighted sad/variance functions to
      av1.
      Coding gain improvement: 0.339/0.413/0.328 (lowres/midres/hdres)
      Current coding gain: 2.437/2.428/2.294
      Encoding time overhead: 17% (soccer_cif), 30% (ped_1080p25), was
      12% and 18% without motion search
      
      Change-Id: I101d6ce729f769853756edc8ced6f3a2b8d8f824
      72d3ba8a
  20. 29 Jul, 2016 1 commit
  21. 26 Jul, 2016 1 commit
    • Yue Chen's avatar
      Port SIMD optimization for obmc blending functions to av1 · 2478bed5
      Yue Chen authored
      SIMD optimization for 1d blending functions in obmc mode, and some
      code refactoring and cleanup.
      
      (ped_1080p25.y4m, 150 frame, 2000 tb)
      Encoding time overhead: +18.8% -> +18.1%
      Decoding time overhead: +21.3% -> +8.7%
      Change-Id: I9d856c32136e7e0e6e24ab5520ef901d7b1ee9c8
      2478bed5
  22. 20 Jul, 2016 1 commit
  23. 18 Jul, 2016 1 commit