1. 07 Feb, 2013 2 commits
    • Jingning Han's avatar
      Butterfly ADST based hybrid transform · d15e1da4
      Jingning Han authored
      Refactor the 8x8 inverse hybrid transform. It is now consistent
      with the new inverse DCT. Overall performance loss (due to the
      use of this variant ADST, and the rounding errors in the butterfly
      implementation) for std-hd is -0.02.
      
      Fixed BUILD warning.
      
      Devise a variant of the original ADST, which allows butterfly
      computation structure. This new transform has kernel of the
      form: sin((2k+1)*(2n+1) / (4N)). One of its butterfly structures
      using floating-point multiplications was reported in Z. Wang,
      "Fast algorithms for the discrete W transform and for the discrete
      Fourier transform", IEEE Trans. on ASSP, 1984.
      
      This patch includes the butterfly implementation of the inverse
      ADST/DCT hybrid transform of dimension 8x8.
      
      Change-Id: I3533cb715f749343a80b9087ce34b3e776d1581d
      d15e1da4
    • Ronald S. Bultje's avatar
      Use fdct8x4 instead of fdct4x4 where the block size allows it. · 5cfd82bc
      Ronald S. Bultje authored
      This allows for faster SIMD implementations in the future (currently
      there is no speed impact).
      
      Change-Id: I732647e9148b5dcb44e6bc8728138f0141218329
      5cfd82bc
  2. 05 Feb, 2013 1 commit
    • Ronald S. Bultje's avatar
      [WIP] Add column-based tiling. · 1407bdc2
      Ronald S. Bultje authored
      This patch adds column-based tiling. The idea is to make each tile
      independently decodable (after reading the common frame header) and
      also independendly encodable (minus within-frame cost adjustments in
      the RD loop) to speed-up hardware & software en/decoders if they used
      multi-threading. Column-based tiling has the added advantage (over
      other tiling methods) that it minimizes realtime use-case latency,
      since all threads can start encoding data as soon as the first SB-row
      worth of data is available to the encoder.
      
      There is some test code that does random tile ordering in the decoder,
      to confirm that each tile is indeed independently decodable from other
      tiles in the same frame. At tile edges, all contexts assume default
      values (i.e. 0, 0 motion vector, no coefficients, DC intra4x4 mode),
      and motion vector search and ordering do not cross tiles in the same
      frame.
      t log
      
      Tile independence is not maintained between frames ATM, i.e. tile 0 of
      frame 1 is free to use motion vectors that point into any tile of frame
      0. We support 1 (i.e. no tiling), 2 or 4 column-tiles.
      
      The loopfilter crosses tile boundaries. I discussed this briefly with Aki
      and he says that's OK. An in-loop loopfilter would need to do some sync
      between tile threads, but that shouldn't be a big issue.
      
      Resuls: with tiling disabled, we go up slightly because of improved edge
      use in the intra4x4 prediction. With 2 tiles, we lose about ~1% on derf,
      ~0.35% on HD and ~0.55% on STD/HD. With 4 tiles, we lose another ~1.5%
      on derf ~0.77% on HD and ~0.85% on STD/HD. Most of this loss is
      concentrated in the low-bitrate end of clips, and most of it is because
      of the loss of edges at tile boundaries and the resulting loss of intra
      predictors.
      
      TODO:
      - more tiles (perhaps allow row-based tiling also, and max. 8 tiles)?
      - maybe optionally (for EC purposes), motion vectors themselves
        should not cross tile edges, or we should emulate such borders as
        if they were off-frame, to limit error propagation to within one
        tile only. This doesn't have to be the default behaviour but could
        be an optional bitstream flag.
      
      Change-Id: I5951c3a0742a767b20bc9fb5af685d9892c2c96f
      1407bdc2
  3. 14 Jan, 2013 1 commit
    • Ronald S. Bultje's avatar
      Remove compound intra-intra experiment. · c9071601
      Ronald S. Bultje authored
      This experiment gives little gains and adds relatively much code
      complexity (and it hinders other experiments), so let's get rid of
      it.
      
      Change-Id: Id25e79a137a1b8a01138aa27a1fa0ba4a2df274a
      c9071601
  4. 26 Dec, 2012 1 commit
  5. 12 Dec, 2012 1 commit
    • Scott LaVarnway's avatar
      Improved vp9_ihtllm_c · b575394e
      Scott LaVarnway authored
      As suggested by Yaowu, we can use eob to reduce the complexity
      of the vp9_ihtllm_c function.  For the 1080p test clip used, the decoder
      performance improved by 17%.
      
      Change-Id: I32486f2f06f9b8f60467d2a574209aa3a3daa435
      b575394e
  6. 29 Nov, 2012 4 commits
    • Jim Bankoski's avatar
      unused variable · 6e02947e
      Jim Bankoski authored
      Change-Id: I1302a6eaa840d419e8bb9ad0673e42ef139d3fee
      6e02947e
    • Jim Bankoski's avatar
      ihtllm moves to rtcd · 030e268a
      Jim Bankoski authored
      clears up some warnings
      
      Change-Id: I9899637497c6ad7519f098e055ab98580ae6d688
      030e268a
    • Jim Bankoski's avatar
      more rtcd cleanup · 13dbf1fb
      Jim Bankoski authored
      Change-Id: Ieefd76e164ca4aa87597da0412977614ddfbacb7
      13dbf1fb
    • Deb Mukherjee's avatar
      Fixing 8x8/4x4 ADST for intra modes with tx select · 0742b1e4
      Deb Mukherjee authored
      This patch allows use of 8x8 and 4x4 ADST correctly for Intra
      16x16 modes and Intra 8x8 modes when the block size selected
      is smaller than the prediction mode. Also includes some cleanups
      and refactoring.
      
      Rebase.
      
      Change-Id: Ie3257bdf07bdb9c6e9476915e3a80183c8fa005a
      0742b1e4
  7. 28 Nov, 2012 1 commit
  8. 27 Nov, 2012 1 commit
    • John Koleszar's avatar
      Add vp9_ prefix to all vp9 files · fcccbcbb
      John Koleszar authored
      Support for gyp which doesn't support multiple objects in the same
      static library having the same basename.
      
      Change-Id: Ib947eefbaf68f8b177a796d23f875ccdfa6bc9dc
      fcccbcbb
  9. 25 Nov, 2012 1 commit
  10. 15 Nov, 2012 1 commit
  11. 10 Nov, 2012 1 commit
    • Deb Mukherjee's avatar
      New b-intra mode where direction is contextual · d01357bb
      Deb Mukherjee authored
      Preliminary patch on a new 4x4 intra mode B_CONTEXT_PRED where the
      dominant direction from the context is used to encode. Various decoder
      changes are needed to support decoding of B_CONTEXT_PRED in conjunction
      with hybrid transforms since the scan order and tokenization depends on
      the actual direction of prediction obtained from the context. Currently
      the traditional directional modes are used in conjunction with the
      B_CONTEXT_PRED, which also seems to provide the best results.
      
      The gains are small - in the 0.1% range.
      
      Change-Id: I5a7ea80b5218f42a9c0dfb42d3f79a68c7f0cdc2
      d01357bb
  12. 01 Nov, 2012 3 commits
  13. 31 Oct, 2012 6 commits
  14. 30 Oct, 2012 1 commit
    • Yunqing Wang's avatar
      Convert the transforms to integer forms. · 8a9e4e8a
      Yunqing Wang authored
      Converted the forward and inverse transforms to integer forms.
      
      Modify #define TEST_INT 1/0
      in the code to call integer/float version of transforms.
      
      The tests showed that average OVERALL PSNR loss was less than 0.1%.
      
      Change-Id: I1dfa4eeab6412597e3b970ce299cf0e116a917e6
      8a9e4e8a
  15. 29 Oct, 2012 2 commits
  16. 22 Oct, 2012 2 commits
  17. 21 Oct, 2012 1 commit
  18. 19 Oct, 2012 1 commit
    • Deb Mukherjee's avatar
      Some cleanups and fixes. · f3208f36
      Deb Mukherjee authored
      Separates the logic on transform type selection previously spread out
      over a number of files into a separate function. Currently the tx_type
      field in b_mode_info is not used, but still left in there to eventually
      use for signaling the transform type in the bitstream.
      
      Also, now for tx_type = DCT_DCT, the regular integer DCT is used, as
      opposed to the floating point DCT used in conjuction with hybrid
      transform.
      
      Results change somewhat due to the transform change, but are within
      reasonable limits. The hd/std-hd sets are slightly up, while derf/yt
      are slightly down.
      
      Change-Id: I5776840c2239ca2da31ca6cfd7fd1148dc5f9e0f
      f3208f36
  19. 16 Oct, 2012 1 commit
  20. 15 Oct, 2012 3 commits
    • Ronald S. Bultje's avatar
      Remove test from B_MODE_INFO. · 23a64c89
      Ronald S. Bultje authored
      It is essentially a duplicate of mode for RD-only purposes. Removing it
      saves us 4 bytes per B_MODE_INFO, or ~0.5MB for a 1080p video encode.
      
      Change-Id: I0a54db5f51658b3946d7efb1ca6e8cfbda0cdf88
      23a64c89
    • Ronald S. Bultje's avatar
      Minor refactoring in encodeintra.c. · a0ad521c
      Ronald S. Bultje authored
      Merge code blocks for different transform sizes; use MACROBLOCKD as a
      temp variable where that leads to smaller overall source code; remove
      duplicate code under #if CONFIG_HYBRIDTRANSFORM/#else blocks. Some style
      changes to make it follow the style guide a little better.
      
      Change-Id: I1870a06dae298243db46e14c6729c96c66196525
      a0ad521c
    • Ronald S. Bultje's avatar
      Remove duplicate or unused code in encoder/encodemb.c. · 238ed517
      Ronald S. Bultje authored
      Also make some minor stylistic changes to bring the code closer to
      the style guide. Remove distinction between inter and intra transform
      functions, since both do exactly the same thing except for the check
      against SPLITMV for the second-order transform. Remove some commented
      out debug code. Remove 8x8/16x16 transform code in encode_inter16x16y(),
      since the first-pass only uses 4x4 anyway.
      
      Change-Id: Ife54816ff759825b9141f95dc2ba43c253c14dba
      238ed517
  21. 14 Oct, 2012 4 commits
  22. 11 Oct, 2012 1 commit