1. 31 Oct, 2011 1 commit
    • Paul Wilkins's avatar
      Segmentation Entropy and tweaks. · 795c6dd2
      Paul Wilkins authored
      Some correction for entropy impact of segment signaled (EOB and ref frame)
      
      Other slight tweaks.
      
      Derf VBR average gain now over 1% (best over 7%)
      One YT test clip has gains of circa 30% (VBR)
      
      There is still an issue with noisy clips where making the background static
      and coded with 0,0 can have a negative effect, especially at low Q.
      This is probably because of the loss of smoothing by fractional pixel filters.
      
      Change-Id: I7a225613c98067b96f8fc7a7e36f95d465b2b834
      795c6dd2
  2. 27 Oct, 2011 1 commit
    • Paul Wilkins's avatar
      Resolve build problem · afb52f65
      Paul Wilkins authored
      Resolved experimental branch build problem when seg_features not configured.
      
      Change-Id: Ia0f9b460a26dc3eac9844ee595a7b196e9faf6a5
      afb52f65
  3. 24 Oct, 2011 3 commits
    • Yaowu Xu's avatar
      added a last stage rounding for 8x8 inverse dct · a66c945c
      Yaowu Xu authored
      Prior to the added rounding, tests on randomly generated data showed
      that forward-inverse transform round trip errors are about 3.02/block
      for input range [-10,10] and 2.68/block for input range [-256, 255].
      The added rounding reduced the errors to 0.031/block for input range
      [-10,10] and 0.037/block for input range [-256, 255].
      
      Maximum round trip error on for any pixel position is 1.
      
      The average errors are calculated based on 100,000 blocks of randomly
      with the specified ranges.
      
      Paul mentioned in discussion that the change was not clear on why we
      need change the rounding, so Patch 2 intends to make the rationale
      obvious in code, it merged the two separate shifts into one, and the
      two separate rounding factors into one. Patch 1 and 2 have same
      numerical test results.
      
      Change-Id: Ic5e2f5463de17253084d8b2398c4a210194b20de
      a66c945c
    • Paul Wilkins's avatar
      Segmentation Features; · 23701f4f
      Paul Wilkins authored
      Only encode sign bit for feature data that can have a sign.
      
      Tweaks to the test segmentation rules so that it now actually gives
      a net benefit on the derf set of about 0.4% though much higher
      on some clips at the low end.
      
      Change-Id: I8e61f1aebf41c9037db7e67e2f8975aa18a0c986
      23701f4f
    • Paul Wilkins's avatar
      Further segment feature extensions. · 01ce04bc
      Paul Wilkins authored
      This quite large check in includes the following:
      
      Merge in some code from Ronald (mbgraph.c) that scans a Gf/arf group.
      This is used as a basis for a simple segmentation for the normal frames
      in a gf/arf group. This code also uses satd functions from Yaowu.
      
      Adds functionality for coding the latest possible position of an EOB for
      blocks in the segment. (Currently 0-15 only, hence just for 4x4 dct).
      Where the EOB position is 0 this acts like "skip" and the normal coding
      of skip at the per mb level is disabled.
      
      Added functions (seg_common.c) for setting and reading segment feature
      elements. These may want to be optimized away at some point but while the
      mecahnism is in a state of flux they provide a single location for making
      changes and keep things a bit cleaner.
      
      This is still proof of concept code. Currently the tested feature set:-
      
      Quantizer,
      Loop Filter level,
      Reference frame,
      Prediction Mode,
      EOB end stop.
      
      TBD:-
      
      Add functions for setting and reading the feature data with range
      and validity checking.
      
      Handling of signed and unsigned feature data. At the moment all is assumed
      to be signed and a sign bit is coded but many cannot be negative.
      
      Correct handling of EOB feature with intra coded blocks.
      
      Testing/trapping of legal/illegal ref frame and mode combinations.
      
      Transform size switch plus merge and test with 8c8 DCT work
      
      Merge and test with Sumans Segmenation coding optimizations
      
      Change-Id: Iee12e83661c7abbd1e0ce6810915eb4ec35e2d8e
      01ce04bc
  4. 14 Oct, 2011 1 commit
    • Yaowu Xu's avatar
      fixed the wrong rounding in inverse haar transform · 152ce6b2
      Yaowu Xu authored
      Given the current forward haar transform:
       f0 = I0 + I1 + I2 + I3
       f1 = I0 + I1 - I2 - I3
       f2 = I0 - I1 + I2 - I3
       f3 = I0 - I1 - I2 + I3
      the output of the inverse haar prior rounding:
       i0 = f0 + f1 + f2 + f3 = I0 * 4;
       i1 = f0 + f1 - f2 - f3 = I1 * 4;
       i2 = f0 - f1 + f2 - f3 = I2 * 4;
       i3 = f0 - f1 - f2 + f3 = I3 * 4;
      As all the numbers are 4 multiples, simply >>2 always produces prefect
      results in term of forward-inverse transform round trip error.
      
      Change-Id: Id6658b00ea819ee61cfeef8c5985d4cd3e77f44e
      152ce6b2
  5. 08 Oct, 2011 1 commit
    • Yaowu Xu's avatar
      fixed a decoder bug · 3ca84969
      Yaowu Xu authored
      When 8x8 transform is enabled, the decoder does an extra reconstruct
      on MBs that are coded using 8x8. This commit fixed the logic around
      the decoding of mb encoded with 8x8 transform.
      
      Change-Id: I6926557c9ef00eecb375f62946f7e140c660bf6f
      3ca84969
  6. 30 Sep, 2011 2 commits
    • Paul Wilkins's avatar
      Segment coding of mode and reference frame. · 156b221a
      Paul Wilkins authored
      Proof of concept test code that encodes mode and reference
      frame data at the segment level.
      
      Decode-able bit stream but some issues not yet resolved.
      As it this helps a little on a couple of clips but hurts on most as
      the basis for segmentation is unsound.
      
      To build and test, configure with
      --enable-experimental --enable-segfeatures
      
      Change-Id: I22a60774f69273523fb152db8c31f4b10b07c7f4
      156b221a
    • Paul Wilkins's avatar
      Experimental: segfeature added. · 45e49e6e
      Paul Wilkins authored
      New setting added to configure script
      45e49e6e
  7. 20 Sep, 2011 3 commits
    • Fritz Koenig's avatar
      Move neon only arm functions under arm/neon. · bd0c3409
      Fritz Koenig authored
      These files don't contain generic arm code, so should
      only be compiled by neon.
      
      Change-Id: Ie712823aa04d4235e7cfe7a3b725e73ee4c3e564
      bd0c3409
    • Tero Rintaluoma's avatar
      NEON FDCT updated to match current C code · 0c2529a8
      Tero Rintaluoma authored
      - Removed fast_fdct4x4_neon and fast_fdct8x4_neon
      - Uses now short_fdct4x4 and short_fdct8x4
      - Gives ~1-2% speed-up on Cortex-A8/A9
      
      Change-Id: Ib62f2cb2080ae719f8fa1d518a3a5e71278a41ec
      0c2529a8
    • Tero Rintaluoma's avatar
      Fixed armv5te multiplications · 3c19bc3f
      Tero Rintaluoma authored
      Rd and Rm registers should be different in 'mul'. This register
      combination results in unpredictable behaviour. GCC will give
      a warning and RVCT an error in this case.
      
      Restriction applies only to armv5 targets and not for armv6 and above.
      
      Change-Id: I378d17c51e1f16a6820814fbed43e115aaabb03e
      3c19bc3f
  8. 19 Sep, 2011 3 commits
    • Stefan Holmer's avatar
      Fix necessary for input partitions iface to match the RTP profile · e529a825
      Stefan Holmer authored
      These changes fixes a glitch between the RTP profile and the input
      partitions interface. Since there's no way for the user to know the
      actual number of partitions, the decoder have to read the
      multi_token_paritition bits also when input partitions mode is
      enabled.
      
      Included are also a couple of fixes for issues with independent
      partitions and uninitialized memory reads.
      
      Change-Id: I6f93b15287d291169ed681898ed3fbcc5dc81837
      e529a825
    • Tero Rintaluoma's avatar
      Updated ARMv6 forward transforms to match C · 4c3ad66b
      Tero Rintaluoma authored
      - Updated walsh transform to match C
        (based on Change Id24f3392)
      - Changed fast_fdct4x4 and 8x4 to short_fdct4x4 and 8x4
        correspondingly
      
      Change-Id: I704e862f40e315b0a79997633c7bd9c347166a8e
      4c3ad66b
    • Tero Rintaluoma's avatar
      NEON walsh transform updated to match C · 2a4b2a00
      Tero Rintaluoma authored
      Modified original patch If2f07220885c4c3a0cae0dace34ea0e36124f001
      according to comments. Scheduled code a little bit to prevent some
      interlocks.
      
      Change-Id: I338f02b881098782f82af63d97f042b85e63e902
      2a4b2a00
  9. 16 Sep, 2011 4 commits
    • Yaowu Xu's avatar
      enable selecting&transmitting to for intra mode entropy · 1d44e7ce
      Yaowu Xu authored
      This commit added a 3 bit index to the bitstream, the index is used to
      look into the intra mode coding entropy context table. The commit uses
      the mode stats to calculate the cost of transmitting modes using 8
      possible entropy distributions, and selects the distribution that
      provides the lowest cost to do the actual mode coding.
      
      Initial test show this provides additional .2%~.3% gain over quantizer
      adaptive intra mode coding. So the adaptive intra mode coding provides
      a total of .5%(psnr) to .6% gain(ssim) combined for all-key-encoding
      
      To build and test, configure with
      --enable-experimental --enable-qimode
      
      Change-Id: I7c41cd8bfb352bc1fe7c5da1848a58faea5ed74a
      1d44e7ce
    • Yaowu Xu's avatar
      add quantizer adaptive intra mb mode encoding · aac2c126
      Yaowu Xu authored
      make intra mode coding entropy distribution adaptive to baseQindex, an
      encoding test on hd clips with all key frame shows universal gain on
      all clips in both .2%(psnr) and (ssim).3%.
      
      To build and test, configure with
      --enable-experimental --enable-qimode
      
      Change-Id: Iaa69241b984d4fdd8baa6d77ee78c0140f5ac00a
      aac2c126
    • Yaowu Xu's avatar
      add 8x8 intra prediction modes · ca6b85aa
      Yaowu Xu authored
      Patch 1 to Patch 3 is an initial implementation of 8x8 intra prediction
      modes, here are with the following assumptions:
      a. 8x8 has 4 prediction modes DC, H, V and TM
      b. UV 4x4 block use the same mode as corresponding 8x8 area
      c. i8x8 modes are enabled for key frame only for now
      Patch 4:
      d. removed debug code from previous patches
      Patch 5:
      e. added stats code to collect entropy stats and further cleaned up
      Patch 6:
      f. changed mode stats code to collect finer stats of modes
      Patch 7:
      g. normalized i8x8 modes distribution to total at 256 (8bits).
      Patch 8:
      h. fixed a bug in decoder and removed debug printf output.
      Patch 9:
      i. more cleanups to address paul's comment
      Patch 10:
      j. messy rebase/merges to bring the commit up to date.
      
      Tests on HD clips encoded with all key frame showing consistent gain
      on all clips and all metrics:~0.5%(psnr) and 0.6%(ssim):
      http://www.corp.google.com/~yaowu/no_crawl/i8x8hd_allkey_fixedq.html
      
      To build and test, configure with:
      --enable-experimental --enable-i8x8
      
      Change-Id: I9813fe07ae48cab5fdb5d904bca022514ad01e7f
      ca6b85aa
    • Stefan Holmer's avatar
      Fixes the boundary checks for extrapolated and interpolated MVs. · b854bbd8
      Stefan Holmer authored
      Change-Id: I5b47d39d1604f2650d2f2d1ca2a3f40843c8e1ea
      b854bbd8
  10. 15 Sep, 2011 1 commit
    • Paul Wilkins's avatar
      Segment Feature Signaling · ceb51742
      Paul Wilkins authored
      Plumbing for tuning new segment features on and off.
      
      Change-Id: If86cd6f103296b73030e8af7cf85c5b9bbffdbaf
      ceb51742
  11. 13 Sep, 2011 4 commits
    • Paul Wilkins's avatar
      Reverse coding order for segment features: · 1741cc7a
      Paul Wilkins authored
      Code all the features for one segment (grouped together)
      then all for the next etc. etc. rather than grouping the
      data by feature.
      
      Change-Id: I2a65193b3a70aca78f92e855e35d8969d857b6dd
      1741cc7a
    • Scott LaVarnway's avatar
      Fixed encoder crash · 5bc7b3a6
      Scott LaVarnway authored
      caused by the "Removed bmi copy to/from BLOCKD" commit.
      
      Change-Id: I9fae71bdc34c8ecc07bb81cd3ccf498b91ce3ec7
      5bc7b3a6
    • Paul Wilkins's avatar
      Change to segment_feature_data[][] structure. · 1c24442a
      Paul Wilkins authored
      This data structure is  now [Segment ID][Features]
      rather than [Features][Segment_ID]
      
      I propose as a separate modification to make the experimental
      bit stream reflect this such that all the features for a segment
      are coded together.
      
      Change-Id: I581e4e3ca2033bdbdef3d9300977a8202f55b4fb
      1c24442a
    • Paul Wilkins's avatar
      Segment Features: · dfbc61f3
      Paul Wilkins authored
      Some basic plumbing added for a range of segment level features.
      MB_LVL_* changed to SEG_LVL_* to better reflect meaning.
      
      Change-Id: Iac96da36990aa0e40afc0d86e990df337fd0c50b
      dfbc61f3
  12. 03 Sep, 2011 1 commit
    • Yaowu Xu's avatar
      fixed a decoder bug · 1fa77e28
      Yaowu Xu authored
      the bug appears to be introduced from a merge at Commit:62400028..
      
      Change-Id: I332d78b673f12e5ef2b0cdd6bab57dc2c8af7a72
      1fa77e28
  13. 31 Aug, 2011 1 commit
  14. 30 Aug, 2011 1 commit
  15. 25 Aug, 2011 1 commit
    • Yunqing Wang's avatar
      Minor modification on key frame decision · 1f20202e
      Yunqing Wang authored
      This change makes sure that no key frame recoding in real-time mode
      even if CONFIG_REALTIME_ONLY is not configured.
      
      Change-Id: Ifc34141f3217a6bb63cc087d78b111fadb35eec2
      1f20202e
  16. 24 Aug, 2011 4 commits
    • Fritz Koenig's avatar
      Quiet warning by removing unused variable. · 4797a972
      Fritz Koenig authored
      fwd_boost_score was not being computed or
      referenced, so remove declaration.
      
      Change-Id: Iece36cde1ec113e3c6afaff1407d24cdf12bd0a8
      4797a972
    • Scott LaVarnway's avatar
      Removed bmi copy to/from BLOCKD · b870947d
      Scott LaVarnway authored
      for SPLITMV and B_PRED modes.  Modified code to use the bmi
      found in mode_info_context instead of BLOCKD.  On the decode
      side, the uvmvs are calculated only when required, instead of
      every macroblock.  This is WIP. (bmi should eventually be
      removed from BLOCKD)
      Small performance gains noticed for RT encodes and decodes.(VGA)
      
      Change-Id: I2ed7f0fd5ca733655df684aa82da575c77a973e7
      b870947d
    • Fritz Koenig's avatar
      Fix naming of sse2 idct functions. · 112bd4e2
      Fritz Koenig authored
      Prepend idct function names with vp8_
      so that under profiling they show up
      associated with libvpx.
      
      Change-Id: I4fe357b50236cb7730a4cc00164c0a3487a1d8b4
      112bd4e2
    • Johann's avatar
      Fix data accesses for simple loopfilters · 85358d04
      Johann authored
      The data that the simple horizontal loopfilter reads is aligned, treat
      it accordingly.
      
      For the vertical, we only use the bottom 4 bytes, so don't read in 16
      (and incur the penalty for unaligned access).
      
      This shows a small improvement on older processors which have a
      significant penalty for unaligned reads.
      
      postproc_mmx.c is unused
      
      Change-Id: I87b29bbc0c3b19ee1ca1de3c4f47332a53087b3d
      85358d04
  17. 23 Aug, 2011 1 commit
    • Fritz Koenig's avatar
      Use local labels for jumps/loops in x86 assembly. · c5f890af
      Fritz Koenig authored
      Prepend . to local labels in assembly code.  This
      allows non unique labels within a file.  Also
      makes profiling information more informative
      by keeping the function name with the loop name.
      
      Change-Id: I7a983cb3a5ba2413d5dafd0a37936b268fb9e37f
      c5f890af
  18. 22 Aug, 2011 2 commits
  19. 19 Aug, 2011 2 commits
    • Fritz Koenig's avatar
      Reclasify optimized ssim calculations as SSE2. · 01376858
      Fritz Koenig authored
      Calculations were incorrectly classified as either
      SSE3 or SSSE3.  Only using SSE2 instructions.
      Cleanup function names and make non-RTCD code work
      as well.
      
      Change-Id: I29f5c2ead342b2086a468029c15e2c1d948b5d97
      01376858
    • Alpha Lam's avatar
      Copy less when active map is in use · 4e8d35a4
      Alpha Lam authored
      When active map is specified and the current frame is not a key frame,
      golden frame nor a altref frame then copy only those active regions.
      
      This significantly reduces encoding time by as much as 19% on the test
      system where realtime encoding is used. This is particularly useful
      when the frame size is large (e.g. 2560x1600) and there's only a few
      action macroblocks.
      
      Change-Id: If394a813ec2df5a0201745d1348dbde4278f7ad4
      4e8d35a4
  20. 17 Aug, 2011 1 commit
    • Paul Wilkins's avatar
      Small boost to every other frame. · 744f4823
      Paul Wilkins authored
      Instead of a single mid GF boost apply a few extra bits to
      every other frame. This gives a very small average metrics
      improvement on both derf and YT sets.
      
      Also use min GF interval as min KF interval.
      
      Change-Id: Iee238b8cae0ffaed850a5a944ac825cee18da485
      744f4823
  21. 16 Aug, 2011 1 commit
    • Scott LaVarnway's avatar
      Faster vp8_default_coef_probs · 19987dcb
      Scott LaVarnway authored
      Copies from a generated table instead of building the
      default coeff probabilities during runtime.
      
      Change-Id: I4d9551ea3a2d7d4a4f7ce9eda006495221a8de50
      19987dcb
  22. 15 Aug, 2011 1 commit