    • Ronald S. Bultje's avatar
      More optimizations for cost_coeffs(). · e20fcd95
      Ronald S. Bultje authored
      4x4:    163 ->  123 cycles (33% faster)
      8x8:    491 ->  399 cycles (23% faster)
      16x16: 1889 -> 1763 cycles (7% faster)
      32x32: 8311 -> 8180 cycles (1.6% faster)
      Overall encoding time of first 50 frames of bus (speed 0) @ 1500kbps
      goes from 1min4.33 to 1min3.00, i.e. 2.11% faster.
      Change-Id: Ib52d1dbb5649b14de769d3e7a74af67440b5284f
    • James Zern's avatar
      vp9: apply loopfilter inline if possible · 38a4412e
      James Zern authored
      excludes tiled content currently
      Change-Id: I44155253e8d6771e5e039d663be5f21cc9d0355d
    • Dmitry Kovalev's avatar
      Adding update_tx_counts function. · b2fc6fa9
      Dmitry Kovalev authored
      Moving common encoder/decoder code to update_tx_counts. Also renaming
      vp9_get_pred_probs_tx_size to get_tx_probs2 and adding get_tx_probs to
      call vp9_get_pred_context_tx_size inside read_selected_tx_size only once
      (twice before).
      Change-Id: Ia50247f3893de88ef8e9041b0d44be44a40aaa4d
    • James Zern's avatar
      VP9_COMMON: remove unused temp_scale_frame · 0a58f462
      James Zern authored
      Change-Id: I696a0dca1d02d365e283029d1d077710bd5680e0
    • James Zern's avatar
      VP9_COMMON: drop cur_tile_{row,col}_idx · ccf6710d
      James Zern authored
      these were only being written in one location and never read.
      Change-Id: If59f3c09aa1485cf89bac0099a8a79e99688b5d1
    • Dmitry Kovalev's avatar
      Using update_ct and update_ct2 functions for probability update. · 8c5ca9ff
      Dmitry Kovalev authored
      Update logic for both mode and mvref was the same, so using MODE_COUNT_SAT,
      MODE_MAX_UPDATE_FACTOR, update_ct, update_ct2 for both cases. Removing
      function update_tx_ct because it was identical to update_mode_ct2.
      Change-Id: Iff566be27dbd6cde4c2ec04e8d988f207046b8f0
    • Jingning Han's avatar
      Optimize operation flow in sub8x8 rd loop · 409e77f2
      Jingning Han authored
      Stack the rate-distortion statistics in the sub8x8 rd loop. This allows
      the encoder to skip the forward transform, quantization, and coeff cost
      estimation, in the sub8x8 rd optimization search, if the motion
      vector(s) are of integer pixel value, and have been tested in the
      previous prediction filter type rd loops of the same block.
      This gives about 2% speed-up for bus_cif at 2000 kpbs, for speed 0.
      Its efficacy depends how frequently the motion search will select an
      integer motion vector.
      Change-Id: Iee15d4283ad4adea05522c1d40b198b127e6dd97
