1. 02 Jul, 2013 4 commits
  2. 01 Jul, 2013 6 commits
  3. 29 Jun, 2013 7 commits
  4. 28 Jun, 2013 11 commits
    • Dmitry Kovalev's avatar
      Cosmetic reordering of FRAME_CONTEXT members. · 228b8232
      Dmitry Kovalev authored
      Change-Id: Id641e5188adf55e53e606e5813ae45feaf7abbd2
      228b8232
    • Dmitry Kovalev's avatar
      Cleanup inside vp9_decodemv.c. · 15fefced
      Dmitry Kovalev authored
      Adding read_skip_coeff function. Renaming decode_mv to read_mv for
      consistency with another function names. Removing redundant function
      arguments. Renaming kfread_modes to read_intra_mode_info, read_mb_modes_mv
      to read_inter_mode_info, vp9_decode_mb_mode_mv to vp9_read_mode_info,
      vp9_decode_mode_mvs_init to vp9_prepare_read_mode_info. Inlining function
      mb_mode_mv_init inside vp9_prepare_read_mode_info.
      
      Change-Id: Ifee05d333da4cd331d4aff40ce41ccd9b70e494a
      15fefced
    • Jingning Han's avatar
      Fix switch statement in 8x8 transform · 9def7f72
      Jingning Han authored
      Change-Id: I7c46354c4983feb5f6202c3ab4a1d9534da7e30f
      9def7f72
    • Ronald S. Bultje's avatar
      Inline vp9_get_coef_context() (and remove vp9_ prefix). · d00b8e5f
      Ronald S. Bultje authored
      Makes cost_coeffs() a lot faster:
      4x4: 236 -> 181 cycles
      8x8: 888 -> 588 cycles
      16x16: 3550 -> 2483 cycles
      32x32: 17392 -> 12010 cycles
      
      Total encode time of first 50 frames of bus (speed 0) @ 1500kbps goes
      from 2min51.6 to 2min43.9, i.e. 4.7% overall speedup.
      
      Change-Id: I16b8d595946393c8dc661599550b3f37f5718896
      d00b8e5f
    • Dmitry Kovalev's avatar
      Removing CONFIG_DEBUG checks on assertions. · 8e6ce6bb
      Dmitry Kovalev authored
      Adding CHECK_MEM_ERROR macro to vp9_common.h and removing two duplicated
      ones from vp9_onyx_int.h and vp9_onyxd_int.h.
      
      Change-Id: I916afec61b3019f18193135dac7c35ed0f89b8b6
      8e6ce6bb
    • Ronald S. Bultje's avatar
      Minor change to prevent one level of dereference in cost_coeffs(). · e3ce2b2a
      Ronald S. Bultje authored
      4x4: 234 -> 236 cycles
      8x8: 878 -> 888 cycles
      16x16: 3664 -> 3550 cycles
      32x32: 18134 -> 17392 cycles
      
      Change-Id: I37a51bfbb0060a3a54f09c6045c14a989811ed78
      e3ce2b2a
    • Ronald S. Bultje's avatar
      Some minor optimizations for cost_coeffs(). · 91d223bd
      Ronald S. Bultje authored
      Cycle timings for first 3 frames of bus (speed 0) at 1500kbps:
      4x4: 298 -> 234 cycles
      8x8: 1227 -> 878 cycles
      16x16: 23426 -> 18134 cycles
      32x32: 4906 -> 3664 cycles
      
      Total encode time of first 50 frames of bus @ 1500kbps (speed 0) goes
      from 3min0.7 to 2min51.6 seconds, i.e. 5.3% faster.
      
      Change-Id: I68a0e1b530b0563b84a67342cca4b45146077e95
      91d223bd
    • Ronald S. Bultje's avatar
      Make coefficient skip condition an explicit RD choice. · af660715
      Ronald S. Bultje authored
      This commit replaces zrun_zbin_boost, a method of biasing non-zero
      coefficients following runs of zero-coefficients to be rounded towards
      zero, with an explicit skip-block choice in the RD loop.
      
      The logic is basically that if individual coefficients should be rounded
      towards zero (from a RD point of view), the trellis/optimize loop should
      take care of it. If whole blocks should be zero (from a RD point of
      view), a single RD check is much more efficient than a complete
      serialization of the quantization loop.
      
      Quality change: derf +0.5% psnr, +1.6% ssim; yt +0.6% psnr, +1.1% ssim.
      SIMD for quantize will follow in a separate patch. Results for other
      test sets pending.
      
      Change-Id: Ife5fa641163ac5150ac428011e87188f1937c1f4
      af660715
    • Yaowu Xu's avatar
      Minor cleanups · 8b9eea0a
      Yaowu Xu authored
      Change-Id: I379617c1c731a686b3f7e032b8805860c1055b12
      8b9eea0a
    • Yaowu Xu's avatar
      Optimize partition search order · 1374a06b
      Yaowu Xu authored
      This commit change the partition search order to allow checking of
      rectangular partition to be done after square partitions. It also
      added a speed feature to skip rectangular partition check when
      NONE is better than SPLIT in RD sense.
      
      This feature roughly speed up encoder by 1.5X with loss on compression
      -0.91% on cif set
      -0.56% on stdhd set
      
      Change-Id: I0d2d06993041aa9ea9073fcc39c54f73a127dfa4
      1374a06b
    • Ronald S. Bultje's avatar
      Fix tile independence with both column tiling and static_thresh set. · fd4eed3b
      Ronald S. Bultje authored
      Change-Id: I0b2be0ec2c410a527f88b95a44f24ac967b2dac1
      fd4eed3b
  5. 27 Jun, 2013 5 commits
    • Dmitry Kovalev's avatar
      Decoder's code cleanup. · 3231da0a
      Dmitry Kovalev authored
      Using vp9_set_pred_flag function instead of custom code, adding
      decode_tokens function which is now called from decode_atom,
      decode_sb_intra, and decode_sb.
      
      Change-Id: Ie163a7106c0241099da9c5fe03069bd71f9d9ff8
      3231da0a
    • Frank Galligan's avatar
      Add Neon optimized loop filter functions. · 1d6dc1b7
      Frank Galligan authored
      - Added vp9_loop_filter_horizontal_edge_neon and
        vp9_loop_filter_vertical_edge_neon.
      - The functions are based off the vp8 loopfilter
        functions.
      - Matches x86 md5 checksum.
      
      Change-Id: Id1c4dddb03584227e5ecd29f574a6ac27738fdd0
      1d6dc1b7
    • Ronald S. Bultje's avatar
      Inline quantize so idiv instruction gets removed from inner loop. · 7a049be6
      Ronald S. Bultje authored
      Encoding time of first 50 frames of bus @ 1500kbps (speed 0) goes from
      3min15.0 to 3min10.9, i.e. 2.1% faster overall.
      
      Change-Id: If592ee99be09bcd34a7c8498347f44e7305e982c
      7a049be6
    • Jingning Han's avatar
      Make intra predictor reference buffer configurable · 861cb06c
      Jingning Han authored
      This commit enables configurable reference buffer pointer for intra
      predictor. This allows later removal of spatial dependency between
      blocks inside a 64x64 superblock in the rate-distortion optimization
      loop.
      
      Change-Id: I02418c2077efe19adc86e046a6b49364a980f5b1
      861cb06c
    • Jingning Han's avatar
      Make update_partition_context faster · 92479d95
      Jingning Han authored
      Use vpx_memset for updating the partition contexts. Thanks to Noah
      for pointing out the need of refactoring in this part.
      
      Change-Id: I67fb78429d632298f1cd8a0be346cc76f79392a6
      92479d95
  6. 26 Jun, 2013 7 commits