Skip to content
Snippets Groups Projects
  1. Aug 20, 2013
  2. Aug 19, 2013
    • Dmitry Kovalev's avatar
      Moving plane_block_idx from vp9_blockd.h to vp9_quantize.c. · 569ca37d
      Dmitry Kovalev authored
      Change-Id: Ib8af21f2e7f603c2fb407e5d15a3bba64b545b49
      569ca37d
    • Jingning Han's avatar
      Enable early termination in uv rd loop · 3275ad70
      Jingning Han authored
      This commit enables early termination in the rate-distortion
      optimization search loop for chroma components. When the cumulative
      rd cost is above the current best value, skip the rest per-block
      transform/quantization/coeff_cost and continue to the next
      prediction mode.
      
      For bus_cif at 2000 kbps, the average run-time goes down from
      168546ms -> 164678ms, (2% speed-up) at speed 0
       36197ms ->  34465ms, (4% speed-up) at speed 1
      
      Change-Id: I9d3043864126e62bd0166250d66b3170d520b3c0
      3275ad70
    • Dmitry Kovalev's avatar
      Using plane_bsize instead of bsize. · 2e3478a5
      Dmitry Kovalev authored
      This change set is intermediate. The next one will remove all repetitive
      plane_bsize calculations, because it will be passed as argument to
      foreach_transformed_block_visitor.
      
      Change-Id: Ifc12e0b330e017c6851a28746b3a5460b9bf7f0b
      2e3478a5
    • Jingning Han's avatar
      Fix potential use of uninitialized value · b34ce043
      Jingning Han authored
      Initialize the best mode and tx_size values in the rate-distortion
      optimization search loop.
      
      Change-Id: Ibfb5c0895691f172abcd4265c23aef4cb99fa8af
      b34ce043
  3. Aug 17, 2013
    • Jingning Han's avatar
      Fix the returned distortion value in rd_pick_intra · f67919ae
      Jingning Han authored
      Return the distortion value in vp9_rd_pick_intra_mode_sb as sum of
      dist_y and dist_uv. Remove the right shift operation on dist_uv,
      and make it consistent with that of vp9_rd_pick_inter_mode_sb.
      
      Change-Id: I9d564e242d9add38e32595d33b0e0dddb1d55e5b
      f67919ae
  4. Aug 16, 2013
  5. Aug 15, 2013
    • Adrian Grange's avatar
      Added resizing & initialization of last frame segment map · d5bec522
      Adrian Grange authored
      When the frame size changes the last frame segment map must
      be resized to match and initialized to 0.
      
      Change-Id: Idc10de109f55dbe9af3a6caae355a2974712243d
      d5bec522
    • Jingning Han's avatar
      Refactor rd loop for chroma components · 68369ca8
      Jingning Han authored
      This commit makes the rate-distortion optimization search of chroma
      components consistent across all block sizes. It removes redundant
      codes.
      
      Change-Id: I7e76f54d045e8efdd41d84a164c71f55b484471b
      68369ca8
    • Dmitry Kovalev's avatar
      Converting code from using ss_txfrm_size to tx_size. · bb3b817c
      Dmitry Kovalev authored
      Updated function signatures:
        txfrm_block_to_raster_block
        txfrm_block_to_raster_xy
        extend_for_intra
        vp9_optimize_b
      
      Change-Id: I7213f4c4b1b9ec802f90621d5ba61d5e4dac5e0a
      bb3b817c
    • Dmitry Kovalev's avatar
      Using { 0 } for initialization instead of memset. · 6f4fa44c
      Dmitry Kovalev authored
      Change-Id: I4fad357465022d14bfc7e13b348c6da267587314
      6f4fa44c
    • Dmitry Kovalev's avatar
      Renaming d27 predictor to d207. · 81d7bd50
      Dmitry Kovalev authored
      27 degrees intra predictor is actually 207 degrees, so renaming it.
      
      Change-Id: Ife96a910437eb80ccdc0b7a5b7a62c77542ae5be
      81d7bd50
    • Mans Rullgard's avatar
      vp9: neon: optimise vp9_wide_mbfilter_neon · 67e53716
      Mans Rullgard authored
      Break up long dependency chains to improve instruction scheduling.
      
      Change-Id: I0e0cb66943df24af920767bb4167b25c38af9630
      67e53716
    • Dmitry Kovalev's avatar
      Moving segmentation struct from MACROBLOCKD to VP9_COMMON. · b7616e38
      Dmitry Kovalev authored
      VP9_COMMON is the right place to segmentatation struct because it has
      global segmentation parameters, not something specific to macroblock
      processing.
      
      Change-Id: Ib9ada0c06c253996eb3b5f6cccf6a323fbbba708
      b7616e38
    • Jingning Han's avatar
      Remove unused RDCOST_8X8 macro · b0646f9e
      Jingning Han authored
      Change-Id: I17c7d7eaa60fe69c543403c340f7c1078bfd339f
      b0646f9e
    • Deb Mukherjee's avatar
      Speed feature to skip split partition based on var · 24856b6a
      Deb Mukherjee authored
      Adds a speed feature to disable split partition search based on a
      given threshold on the source variance. A tighter threshold derived
      from the threshold provided is used to also disable horizontal and
      vertical partitions.
      
      Results on derfraw300:
      threshold = 16, psnr = -0.057%, speedup ~1% (football)
      threshold = 32, psnr = -0.150%, speedup ~4-5% (football)
      threshold = 64, psnr = -0.570%, speedup ~10-12% (football)
      
      Results on stdhdraw250:
      threshold = 32, psnr = -0.18%, speedup is somewhat more than derf
      because of a larger number of smoother blocks at higher resolution.
      
      Based on these results, a threshold of 32 is chosen for speed 1,
      and a threshold of 64 is chosen for speeds 2 and above.
      
      Change-Id: If08912fb6c67fd4242d12a0d094783a99f52f6c6
      24856b6a
    • Jingning Han's avatar
      Unify luma and chroma rd-cost estimation · ec01f52f
      Jingning Han authored
      This commit unifies the rate-distortion cost calculation process of
      luma and chroma components. It allows early termination to be enabled
      later in the rd search loop of chroma components, in consistent with
      luma pixels.
      
      Change-Id: I2e52a7c6496176bf2a5e3ef338d34ceb8aad9b3d
      ec01f52f
    • James Zern's avatar
      vp9_dx_iface: check for NULL/0-size input · 20395189
      James Zern authored
      avoids a crash caused by issue #585
      
      Change-Id: I301595ee0227699b0da6f0dad6d870dd546e94ef
      20395189
  6. Aug 14, 2013
  7. Aug 13, 2013
  8. Aug 12, 2013
    • Jingning Han's avatar
      SSE2 high precision 32x32 forward DCT · 78136edc
      Jingning Han authored
      Enable SSE2 implementation of high precision 32x32 forward DCT. The
      intermediate stacks are of 32-bits. The run-time goes down from
      32126 cycles to 13442 cycles.
      
      Change-Id: Ib5ccafe3176c65bd6f2dbdef790bd47bbc880e56
      78136edc
Loading