Commits · 0c39318a8bc549451fc7c128a844b5337db90af7 · Xiph.Org / aom-rav1e

Nov 06, 2013

Missing _ means no sse3 for vp9_h_predictor_32x32. · 0c39318a

Paul Wilkins authored 11 years ago

Error in script means vp9_h_predictor_32x32 sse3 version
is not enabled.

Change-Id: Ia43672740da1ecdfb7fcd420490ef424b04accc4

0c39318a

Nov 04, 2013

Splitting partition_probs array into two arrays. · dde8069e

Dmitry Kovalev authored 11 years ago

We only update partition_probs for inter frames but they are constant
for key frames. It is not necessary to have constants inside frame
context and copy them every time. This change reduces FRAME_CONTEXT size
by at least 48 bytes.


Change-Id: If70a53be51043f37fe7d113853217937710932a7

dde8069e

Remove unused member variables from VP9_COMP · a0a6590e

Adrian Grange authored 11 years ago

Removed three members from the VP9_COMP data structure:
inter_zz_count, gf_bad_count, gf_update_recommended.

These were part of the VP8 real-time mode implementation
that was removed from the initial VP9 codecbase.

Change-Id: I866b083b88ef02c74837277d50ce532ca88492f3

a0a6590e

Nov 03, 2013

Add second ref frame check back in rdcost hist · 2de7cbe9

Jingning Han authored 11 years ago

Update best_inter_rd and best_inter_ref_frame only in single ref
frame case.

Change-Id: Id56825b231a62d6852bd83811410c05a7569f715

2de7cbe9

Nov 02, 2013
- vp9 ssse3 d207_predictor_32x32: add missing GLOBAL() · 2d980b80
  James Zern authored 11 years ago
  
  removes a textrel for sh_b23456789abcdefff Change-Id: I80cb9dfd8e49a0fe884c8ff76472275b3a00cb57
  2d980b80
Nov 01, 2013

Removing 'new' probability calculation from convert_distribution(). · df19c6b6

Dmitry Kovalev authored 11 years ago

We don't have to calculate 'new' probability in convert_distribution()
because it is enough to calculate only 'new' counters which could be used
to calculate probability if necessary. That's why removing a lot of unused
temporary probability arrays and reducing number of get_binary_prob()
calls.

Change-Id: I4e14eb7203d1ace61bbddefd6b9b6326be83ba63

df19c6b6

Convert filter kernel choice to lookup · 0f76ba55

Yaowu Xu authored 11 years ago

Also removed unused declaration related 6 tap filter

Change-Id: Ic17f516141d885157918505f4204081e4c951fad

0f76ba55

Two optimizations: · a272530b

Yaowu Xu authored 11 years ago

1. Reduced the size memset based on eob for 32x32 transform. The reset
of non-zero coefficient should probably go into where they are read in
inverse transform functions. (TODO)
2. Removed a redundant level of indirection.
vp9_iht4x4_add() checks transform type and call vp9_iht4x4_16_add()
for tranforms other than DCT_DCT. In this case, the DCT_DCT case
has been already handled here.

Change-Id: Iacbc77da761f0b308df5acea0f20c9add9f33d20

a272530b

Oct 31, 2013

simplify read_coef_prob() · a49e77af
Yaowu Xu authored 11 years ago
```
Change-Id: I529c634db4f81ba5386092c126f53312b1e51b2b
```
a49e77af

Cleaning up read_skip_coeff() function. · 970eb39b

Dmitry Kovalev authored 11 years ago

Making code easier to read and avoiding calculation of skip context twice.

Change-Id: I42c376b1a1811bc842bf6420bf81d2de7a1bf980

970eb39b

Cleanup. Adding const to function pointer arguments. · 7c524bbe
Dmitry Kovalev authored 11 years ago
```
Change-Id: I12c67c8c0fa1aa7fb3f7d6cc2ef65be29c4ea292
```
7c524bbe

Reducing the number of foreach_transformed_block() calls. · 47b6030d

Dmitry Kovalev authored 11 years ago

The change doesn't affect the bitstream. It changes the order or function
calls and affects how we reconstruct intra- and inter-blocks. Speed up is
about 1...1.5%.

For intra-blocks:
  Before:
    for each transform block read tokens
    for each transform block do prediction
    for each transform block do inverse transform
  Now:
    for each transform block
      read tokens
      do prediction
      do inverse transform

For inter-blocks:
  Before:
    for each transform block read tokens
    for each transform block do inverse transform
  Now:
    for each transform block
      read tokens
      do inverse transform

Change-Id: I12a79bf1aa5a18c351b8010369bd3ff1deae1570

47b6030d

mb_lpf_horizontal_edge AVX2 optimization · 54f92056

Tamar Levy authored 11 years ago

This CL contains two AVX2 optimized loop filter functions,
mb_lpf_horizontal_edge_w_avx2_8 and mb_lpf_horizontal_edge_w_avx2_16.

Change-Id: I604e4fe6e99752b7800c2ea98721d97f7e0b931b

54f92056

Oct 30, 2013

Updates to 1-pass: · b26ce8b1

Marco Paniconi authored 11 years ago

   -Don't reduce maxQ for gold/alt in CBR mode.

   -Fix to min/maxQ for first/initial key frame.

   -Add more speeds to datarate test and reduce the starting bitrate for test.

Change-Id: Id2a333d76dd3f6a51b322ca984588e2a22159c58

b26ce8b1

Replacing (SWITCHABLE_FILTERS + 1) with SWITCHABLE_FILTER_CONTEXTS. · 6761872e
Dmitry Kovalev authored 11 years ago
```
Change-Id: I9781a62bc1a4cd9176554d1271d87dbcafda9cb0
```
6761872e

Enable all-zero coeff block index for sub8x8 blk · 8c8381d5

Jingning Han authored 11 years ago

This commit makes zcoeff_blk cache the case where the entire block
is quantized to be zero (without applying zero-forcing) in the rate-
distortion optimization loop, and skip the forward DCT, quantization,
inverse DCT, and reconstruction process in the encode_block stage.

It now works for all the block sizes, including sub8x8 blocks.

Change-Id: I5ae60a9c436ba3637d11666733554bec4580ef98

8c8381d5

Reducing the number of recursive calls. · 2901bf2d

Dmitry Kovalev authored 11 years ago

Both decode_modes_sb and decode_modes_b had conditions to immediately
return at the beginning. Eliminating these conditions here and calling
these functions only to do a real work. Also unrolling loop for
PARTITION_SPLIT.

Change-Id: I2fc41cb74ac491f045a2f04fe68d30ff4aaa555d

2901bf2d

vp9/decode: align tile worker data allocation · 54c2854f

James Zern authored 11 years ago

fixes a crash in assembly on 32-bit linux/windows

Change-Id: I0c27e6c0ece9732b5eb2ee5b59ff42c3c8016c50

54c2854f

Fix x_offset_q4/y_offset_q4 calculation · 9ed2d0a5

Yunqing Wang authored 11 years ago

"<< SUBPEL_BITS" needs to be added in the calculation. Call
set_scaled_offsets() to calculate x_offset_q4 and y_offset_q4.

Change-Id: Ied130ea771510e918f51cd1dc3abe57f4c0962b5

9ed2d0a5

vp9: add multi-threaded tile decoder · fb484524

James Zern authored 11 years ago

tiles are decoded in parallel within a single frame

Change-Id: I7aca87cb1c239b74eceef72bdc9f672faebac373

fb484524

vp9/decode: add get_tile() · 6b00202f

James Zern authored 11 years ago

factorizes the code in decode_tiles(). reading the offsets backwards
wasn't doing anything to prove tile independence

Change-Id: I0395d3c77205852ebdc55efedc68291e93cef85c

6b00202f

Oct 29, 2013

Adding const to vp9_quantize_b_{32x32,} parameters. · 065972f9
Dmitry Kovalev authored 11 years ago
```
Change-Id: I56f8c50ac382202f66040cd9cfaa05d889572fc7
```
065972f9
CL for adding AVX-AVX2 support in libvpx. · e6863ef3
Erik Niemeyer authored 11 years ago
```
Change-Id: Idc03f3fca4bf2d0afd33631ea1d3caf8fc34ec29
```
e6863ef3

Fixing clang warning. · cd94eee4

Dmitry Kovalev authored 11 years ago

Warning was: "implicit conversion from enumeration type 'VPX_SCALING_MODE'
(aka 'enum vpx_scaling_mode_1d') to different enumeration type
'VPX_SCALING'".

Change-Id: I45689e439a8775bc1e7534d0ea1ff7c729f2c7f5

cd94eee4

vp9_decodframe.c: use vpx_memset instead of cast · dc799a87
Johann Koenig authored 11 years ago
```
Fix warning with -Wstrict-aliasing=1

Change-Id: Idfac09be1ab328923883e63436577f1018c895b8
```
dc799a87

Fixing wrongly initialized tx_type variable. · e6dcf2ae

Dmitry Kovalev authored 11 years ago

Wrong value was used in get_tx_type_4x4() function, so making
initialization before that call.

Change-Id: Ief30bb1e0c03b2f23d993bbf9ae18d7150ba9a83

e6dcf2ae

Correct handling of show_bit in uncompressed header. · 156de9c3

Dmitry Kovalev authored 11 years ago

"keyframe" variable in the current code actually means that previous
frame is a keyframe because cm->frame_type has not been initialized
in read_uncompressed_header.

Change-Id: I5645b0816c70abdef5dfc70113018d06276dac77

156de9c3

vp9_decode_frame: group assignments/setup calls · d39f279d

James Zern authored 11 years ago

group error checking at the top followed by allocations, setup then
decode.

Change-Id: I877d21326bb767885520511ecea70e5fd1e28054

d39f279d

Removing is_intra_mode() function. · aa76cd1e

Dmitry Kovalev authored 11 years ago

It is enough to check just block type: intra or inter. Intra block implies
intra prediction mode, and inter block implies inter mode.

Change-Id: I3cf98731a3935f670a3cd8e2b2443483eb944be4

aa76cd1e

Making get_tx_counts() similar to get_tx_probs(). · fa1ac00a
Dmitry Kovalev authored 11 years ago
```
Change-Id: I5b17f40e515c4bcf9ebef5380270a214af4e0115
```
fa1ac00a

Oct 28, 2013

Adding {read, write}_partition() instead of check_bsize_coverage(). · 19cf72ed
Dmitry Kovalev authored 11 years ago
```
Making partition read/write logic more clear.

Change-Id: I1981e90327257d37095567c62d72a103cda1da33
```
19cf72ed

Cleaning up vp9_regular_quantize_b_4x4. · 8253532c

Dmitry Kovalev authored 11 years ago

Passing scan & iscan as parameters, adding useful local variables.

Change-Id: Ia2a87906941db9557350d273669ce5c3cdb7235d

8253532c

vp9: add TileInfo · 58a0f6db

James Zern authored 11 years ago

replaces use of cur_tile_mi_(row|col)_(start|end) by VP9_COMMON, making
it less stateful and more reusable for parallel tile decoding

Change-Id: I1df09382b4567a0e5f4434825d47c79afe2399be

58a0f6db

vp9_decodframe: limit scope of private function params (2) · f0eabfd4

James Zern authored 11 years ago

replace VP9D_COMP usage with the (slightly) more targeted
VP9_COMMON/MACROBLCKD structures.

Change-Id: Ifdd9034f44d69eb94e232dd03c922de763b96a30

f0eabfd4

vp9: remove partition+entropy contexts from common · 7b9ca3ca

James Zern authored 11 years ago

these are now handled separately by the encoder and decoder

Change-Id: If9b16f7d734e992fb94a510a6d88f2690d7fb7cb

7b9ca3ca

vp9: add above/left_context to MACROBLOCKD · e571d3ba
James Zern authored 11 years ago
```
Change-Id: I75aab21c1692cbad717564cbb436578fddbc348d
```
e571d3ba
vp9: add above/left_seg_context to MACROBLOCKD · d9a317c8
James Zern authored 11 years ago
```
Change-Id: I9cbb768c5f857a096cf6c29d6755d0e5e6728435
```
d9a317c8

Oct 26, 2013

vp9 decode: defer loop filter allocation · 8f177bb0

James Zern authored 11 years ago

wait until do_loopfilter_inline is true before committing the resources

Change-Id: I01661bd40599b47362bb3fb534668471f2a9d8d7

8f177bb0

Adding fht{4x4, 8x8, 16x16} functions. · ae2f732e

Dmitry Kovalev authored 11 years ago

Adding these functions to encapsulate tx_type check. Changing TX_TYPE to
int to match the declaration in vo9_rtch.h.

Change-Id: I6f3a2df6e35595ca73b6aaa9e3909ee7bc3fd16f

ae2f732e

Oct 25, 2013

Rewrite loop_filter_info_n struct · 00dbd369

Yunqing Wang authored 11 years ago

Restructured the storing of loopfilter information. Deleted
loop_filter_info struct and reduced copying happened in every
superblock.

Tests showed a 0.5% ~ 0.8% decoder speed gain.

Change-Id: Ie6a8e46bae71dc3a3cd8c6054f5de540b8e0ef5e

00dbd369