1. 13 Jun, 2014 1 commit
  2. 12 Jun, 2014 1 commit
    • Jingning Han's avatar
      Fast computation path for forward transform and quantization · ccba289f
      Jingning Han authored
      This commit enables a fast path computational flow for forward
      transformation. It checks the sse and variance of prediction
      residuals and decides if the quantized coefficients are all
      zero, dc only, or more. It then selects the corresponding coding
      path in the forward transformation and quantization stage.
      
      It is currently enabled in rtc coding mode. Will do it for rd
      coding mode next.
      
      In speed -6, the runtime for pedestrian_area 1080p at 1000 kbps
      goes down from 14234 ms to 13704 ms, i.e., about 4% speed-up.
      Overall coding performance for rtc set is changed by -0.18%.
      
      Change-Id: I0452da1786d59bc8bcbe0a35fdae9f623d1d44e1
      ccba289f
  3. 19 May, 2014 1 commit
    • Jingning Han's avatar
      Adjust the forward 16x16 DCT computation steps · 7f547336
      Jingning Han authored
      This commit adjusts the forward 16x16 DCT computation steps to
      simplify the register level operations. It fixes the corresponding
      sse2 version accordingly.
      
      Change-Id: I72a9c25b8ca9442fc5e113f47cd701ae55aa7f08
      7f547336
  4. 13 Feb, 2014 1 commit
  5. 06 Feb, 2014 1 commit
  6. 28 Jan, 2014 1 commit
  7. 21 Nov, 2013 1 commit
    • Jingning Han's avatar
      Take out assertion from inverse transforms · 272c82c1
      Jingning Han authored
      Separate the rounding and right shift operations of forward transform
      from those of inverse transform. Take out the assertion check from
      inverse transforms. If the transform coefficients were constructed to
      cause intermediate steps of inverse transform overflow, the codec will
      just let it overflow without breaking the decoding flow.
      
      Change-Id: Ia7ce15dfd1a73b4abbaa78cbc74ec718523c5b1b
      272c82c1
  8. 15 Nov, 2013 1 commit
    • Jingning Han's avatar
      Take out assertion from inverse transforms · bdc43711
      Jingning Han authored
      Separate the rounding and right shift operations of forward transform
      from those of inverse transform. Take out the assertion check from
      inverse transforms. If the transform coefficients were constructed to
      cause intermediate steps of inverse transform overflow, the codec will
      just let it overflow without breaking the decoding flow.
      
      Change-Id: I73cfc3706c4e840fc543a77cbc4cdb0b05d07730
      bdc43711
  9. 26 Oct, 2013 1 commit
    • Dmitry Kovalev's avatar
      Adding fht{4x4, 8x8, 16x16} functions. · ae2f732e
      Dmitry Kovalev authored
      Adding these functions to encapsulate tx_type check. Changing TX_TYPE to
      int to match the declaration in vo9_rtch.h.
      
      Change-Id: I6f3a2df6e35595ca73b6aaa9e3909ee7bc3fd16f
      ae2f732e
  10. 24 Oct, 2013 1 commit
  11. 23 Oct, 2013 4 commits
  12. 21 Oct, 2013 1 commit
  13. 18 Oct, 2013 2 commits
  14. 17 Oct, 2013 1 commit
  15. 15 Oct, 2013 1 commit
  16. 11 Oct, 2013 1 commit
  17. 10 Oct, 2013 1 commit
    • Dmitry Kovalev's avatar
      Consistent names for FDCT functions. · fc82dbb4
      Dmitry Kovalev authored
      Renames:
        fdct4_1d   -> fdct4
        fadst4_1d  -> fadst4
        fdct8_1d   -> fdct8
        fadst8_1d  -> fadst8
        fdct16_1d  -> fdct16
        fadst16_1d -> fadst16
      
      "_1d" suffix is redundant, so removing it. The same will happen with idct
      in the next change sets.
      
      Change-Id: Ibf421cd2f569146c6079269df7a31819c098265e
      fc82dbb4
  18. 04 Oct, 2013 1 commit
  19. 24 Sep, 2013 1 commit
    • Yaowu Xu's avatar
      Rename defined constants · 6037f179
      Yaowu Xu authored
      The change is to better reflect the nature of the constants.
      
      Change-Id: Icabac6e9bceefbdb3f03f8218f88ef75943c30fb
      6037f179
  20. 19 Sep, 2013 1 commit
  21. 01 Sep, 2013 1 commit
    • Jingning Han's avatar
      Fix 32x32 forward transform SSE2 version · 3cf46fa5
      Jingning Han authored
      This commit fixed the potential overflow issue in the SSE2
      implementation of 32x32 forward DCT. It resolved the corrupted
      coded frames in the border of scenes.
      
      Change-Id: If87eef2d46209269f74ef27e7295b6707fbf56f9
      3cf46fa5
  22. 03 Jul, 2013 1 commit
    • Jingning Han's avatar
      Refactor SSE2 8x8 functional units · 2cb75c96
      Jingning Han authored
      These serve as building blocks for SSE2 8x8 and 16x16 ADST/DCT
      hybrid transform coding.
      
      Change-Id: I4089a754c66e0c986f67d9b8ec4dfb9627ad430d
      2cb75c96
  23. 29 Jun, 2013 1 commit
  24. 25 Jun, 2013 1 commit
    • Jingning Han's avatar
      Add 8x8 dct/adst unit tests · ab362621
      Jingning Han authored
      This commit enables 8x8 DCT and hybrid transform unit tests. It
      also tunes the forward hybrid transform rounding opertions for
      more precise round-trip performance.
      
      Change-Id: If05c1ce59d75d641b9c6c91527d02d3a6ef498c3
      ab362621
  25. 18 Jun, 2013 1 commit
    • Jingning Han's avatar
      Make fdct32 computation flow within 16bit range · a41a4860
      Jingning Han authored
      This commit makes use of dual fdct32x32 versions for rate-distortion
      optimization loop and encoding process, respectively. The one for
      rd loop requires only 16 bits precision for intermediate steps.
      The original fdct32x32 that allows higher intermediate precision (18
      bits) was retained for the encoding process only.
      
      This allows speed-up for fdct32x32 in the rd loop. No performance
      loss observed.
      
      Change-Id: I3237770e39a8f87ed17ae5513c87228533397cc3
      a41a4860
  26. 30 May, 2013 1 commit
    • Yaowu Xu's avatar
      Changed to use a new variant of WHT · 042e70e4
      Yaowu Xu authored
      The commit changed to use a new variant of Walsh-Hadamard Transform
      by Tim Terriberry. This new variant has the best compression among a
      number of variants that developed by Tim.
      
      Change-Id: Icb3a88515463cfc644b17ca046fcd139db2557e9
      042e70e4
  27. 27 May, 2013 1 commit
  28. 16 Apr, 2013 2 commits
  29. 15 Mar, 2013 1 commit
    • Christian Duvivier's avatar
      Faster vp9_short_fdct16x16. · 4418b790
      Christian Duvivier authored
      Scalar path is about 1.5x faster (3.1% overall encoder speedup).
      SSE2 path is about 7.2x faster (7.8% overall encoder speedup).
      
      Change-Id: I06da5ad0cdae2488431eabf002b0d898d66d8289
      4418b790
  30. 13 Mar, 2013 1 commit
    • Yaowu Xu's avatar
      removed reference to "LLM" and "x8" · 00555263
      Yaowu Xu authored
      The commit changed the name of files and function to remove obselete
      reference to LLM and x8.
      
      Change-Id: I973b20fc1a55149ed68b5408b3874768e6f88516
      00555263
  31. 28 Feb, 2013 1 commit
    • Christian Duvivier's avatar
      Faster vp9_short_fdct8x8. · c129203f
      Christian Duvivier authored
      Scalar path is about 1.4x faster (4% overall encoder speedup).
      SSE2 path is about 7x faster (13% overall encoder speedup).
      
      Change-Id: I7e85d8225a914a74c61ea370210414696560094d
      c129203f
  32. 27 Feb, 2013 1 commit
    • Dmitry Kovalev's avatar
      Code cleanup. · 347f3a0a
      Dmitry Kovalev authored
      Fixing code style, using array lookup instead of switch statements for
      forward hybrid transforms (in the same way as for their inverses).
      Consistent usage of ROUND_POWER_OF_TWO macro in appropriate places.
      
      Change-Id: I0d3822ae11f928905fdbfbe4158f91d97c71015f
      347f3a0a
  33. 26 Feb, 2013 2 commits
    • Yaowu Xu's avatar
      Improve 32x32 forward dct · 66d94ac1
      Yaowu Xu authored
      The commit improves the 32x32 forward dct implementation:
      1. change to use same constants and rounding as other forward dcts
      2. select rounding to specifically minimize the roundtrip error, which
      improved average 19/block to .77/block using 100000 random input.
      
      Test showed a small but consistent gain on all test sets, about .15%
      
      Change-Id: If0afd6a71880a522f60c1c234be0462092c2eb53
      66d94ac1
    • Dmitry Kovalev's avatar
      Changing pitch value meaning for fht and iht transforms. · 9bf3f751
      Dmitry Kovalev authored
      Pitch now means the number of elements, not the number of bytes.
      
      Change-Id: Idb9f2f012e39b09d596a3cc1802305a80b7c13af
      9bf3f751
  34. 25 Feb, 2013 1 commit
    • Jingning Han's avatar
      Improving the forward 16x16 ADST/DCT accuracy · 65821d66
      Jingning Han authored
      Increase the first stage dynamic range by 4 times, and reduce it
      back with proper rounding before applying the second stage. Hence
      it still fits in the given dynamic range and slightly improves
      the key frame coding performance.
      
      Change-Id: Ia4c5907446f20a95dc3de079c314b3ad1221d8aa
      65821d66