Commits · a129905c705b5ee7b84b07fcb76c3c75c2e864c5 · Xiph.Org / aom-rav1e

Mar 14, 2016
- Fix typos in unit tests · b1a38715
  Hui Su authored 9 years ago
  
  Change-Id: Idff52b337ab2d494c0c26e0d2c71ab3ee8208691
  b1a38715
Mar 08, 2016

Implemented DST 16x16 SSE2 intrinsics optimization · 50a164a1

Yi Luo authored 9 years ago

- Implemented fdst16_sse2(), fdst16_8col() against C version: fdst16().
- Turned on 7 DST related hybrid txfm types in vp10_fht16x16_sse2().
- Replaced vp10_fht10x10_c() with vp10_fht16x16_sse2() in
  fwd_txfm_16x16().
- Added vp10_fht16x16_sse2() unit test against C version:
  vp10_fht16x16_c() (--gtest_filter=*VP10Trans16x16*).
- Unit test passed.
- Speed improvement: 2.4%, 3.2%, 3.2%, for city_cif.y4m, garden_sif.y4m,
  and mobile_cif.y4m.

Change-Id: Ib30a67ce5d5964bef143d588d0f8fa438be8901f

50a164a1

Mar 07, 2016

Added vp10_fht8x8_sse2() unit test · 6ab06212

Yi Luo authored 9 years ago

- Inherited base class TransformTestBase to derived class VP10Trans8x8HT.
- Employed RunCoeffCheck() to test vp10_fht8x8_sse2() against C reference
  function vp10_fht8x8_c().
- fdst8_sse2() related seven hybrid transform cases are covered in this
  test.
- Test passed (4 test cases w/o EXT_TX; 16 test cases with EXT_TX).

Change-Id: Id9a9b308c707164a120d9ceb2c30e572026fb1d0

6ab06212

Extend convolution functions to 128x128 for ext-partition. · 938b8dfc
Geza Lore authored 9 years ago
```
Change-Id: I7f7e26cd1d58eb38417200550c6fbf4108c9f942
```
938b8dfc

Mar 04, 2016

Added vp10_fht4x4_sse2() unit test · 267f73a1

Yi Luo authored 9 years ago

Inherited class TransformTestBase to derived class VP10Trans4x4HT.
Employed RunCoeffCheck() to test vp10_fht4x4_sse2() against
C reference vp10_fht4x4_c().
fdst4_sse2() related seven hybrid transform cases are covered
 in this test.
Wrote a header file for test base class. Some modification to
make sure the base class can be used for 8x8, 16x16, 32x32 cases.
All related tests passed.

Change-Id: I6b19a39d3ea30b657847781e78e73b829998a57a

267f73a1

Mar 03, 2016

Add 128 pixel variance and SAD functions · 697bf5be
Geza Lore authored 9 years ago
```
Change-Id: I8fde245b32c9e586683a28aa6925da0b83850b39
```
697bf5be

ANS: Switch from PDFs to CDFs. · 6bbbe316

Aℓex Converse authored 9 years ago

Make the RANS implementation operate on cumulative distribution
functions rather than individual probability distribution functions.
CDFs have shown themselves more flexible to work with.

Reduces decoding memory usage from scaling O(num_distributions *
symbol_resolution) to O(num_distributions).

No bitstream change. This is an purely implementation change.

Change-Id: I4e18d3a0a3d37a36a61487c3d778f9d088b0b374

6bbbe316

Mar 02, 2016

Adds masked variance and sad functions for wedge · 1d69ceee

Deb Mukherjee authored 9 years ago

Adds masked variance and sad functions needed for wedge
prediction modes to come.

Change-Id: I25b231bbc345e6a494316abb0a7d5cd5586a3a54

1d69ceee

Feb 26, 2016

Some refactoring and cleanups of interp filter · bab2912b

Deb Mukherjee authored 9 years ago

Includes various cosmetic changes and refactoring including
naming the sharp filters differently (since they are no longer
8-tap).

Change-Id: Ida5a19ca0daa9f6a64a6734394c685b2a4a2564a

bab2912b

Feb 25, 2016

convolve8 sse2 test · 8878fa4f

Angie Chiang authored 9 years ago

This experiment shows that when frame size is 64x64
vpx_highbd_convolve8_sse2 and vpx_convolve8_sse2's speed are similar.
However when frame size becomes 1024x1024
vpx_highbd_convolve8_sse2 is around 50% slower than vpx_convolve8_sse2
we think the bottleneck is from memory IO

VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_8_64
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_8_64 (17 ms)
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_16_64
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_16_64 (42 ms)
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_32_64
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_32_64 (139 ms)
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_64_64
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_64_64 (499 ms)

VP10ConvolveTest.vpx_convolve8_sse2_speed_l_8_64
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_8_64 (16 ms)
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_16_64
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_16_64 (40 ms)
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_32_64
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_32_64 (130 ms)
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_64_64
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_64_64 (485 ms)

VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_8_1024
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_8_1024 (32 ms)
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_16_1024
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_16_1024 (61 ms)
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_32_1024
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_32_1024 (196 ms)
VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_64_1024

VP10ConvolveTest.vpx_highbd_convolve8_sse2_speed_64_1024 (694 ms)
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_8_1024
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_8_1024 (21 ms)
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_16_1024
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_16_1024 (44 ms)
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_32_1024
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_32_1024 (138 ms)
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_64_1024
VP10ConvolveTest.vpx_convolve8_sse2_speed_l_64_1024 (491 ms)

Change-Id: I3131a031e0380e8eae748cfcccc6cbb961d05943

8878fa4f

Feb 24, 2016

Add test for screen content coding tools in end to end test · 827e1b3f

Hui Su authored 9 years ago

Test screen content coding tools (currently only palette) at
speed 1 and two-pass.

Change-Id: I3c467aee1cd9c366c65a3abfdccfafa0416b59b7

827e1b3f

Feb 23, 2016
- Extend vpxssim to handle more HBD combinations · eeaf8e6b
  Yaowu Xu authored 9 years ago
  
  Change-Id: I38426d946b74c9090a265d34b89e2db6693927c2
  eeaf8e6b
Feb 22, 2016

Cleanup psnr.h · 38cfc45e
Yaowu Xu authored 9 years ago
```
Change-Id: Id026e72ee655ee5bd645a89e378da0d462be367d
```
38cfc45e

Add shift stage in FASTSSIM computation · d1c5cd4a

Yaowu Xu authored 9 years ago

This commits adds a shift stage for FASTSSIM computaton when source
bit depth is different from working bit depth, to make sure metric
results are calculated in bit_depth consistent with source.

Change-Id: I997799634076ef7b00fd051710544681ed536185

d1c5cd4a

Move psnrhvs function declaration to psnr.h · 6e695da2
Yaowu Xu authored 9 years ago
```
From "ssim.h"

Change-Id: Ie53378794149ef8a844b4eb47ad4f08579de4b60
```
6e695da2

Feb 21, 2016

Extend HBDMetricTest · f6a7b17a

Yaowu Xu authored 9 years ago

This commit extends the HBDMetricTests to handle testing for metric
computation where input source depth is different from working bit
depth.

Change-Id: I5d11101cc9603a3fd09e8439816bb982a0f1b654

f6a7b17a

Feb 20, 2016

Fix 12 TAP convolution bug · 1e403064

Angie Chiang authored 9 years ago

Priviously, we do 12-tap interpolation even there is no sub pixel,
This could cause a bug becuase decoder doesn't extend border when there
is no sub pixel. In this situation, if we still do interpolation, we
will access the border extension which doesn't exist and cause a
memory error

Change-Id: I55b879722f0a10c5d13261bd9617a75c826a2418

1e403064

Feb 17, 2016
- Add tests for Highbitdepth PSNR metric computations · 9fb593d0
  Yaowu Xu authored 9 years ago
  
  Change-Id: I07324155f73bbdbe25bb7a7ccd587ebf9010ac7a
  9fb593d0
- lpf_8_test: remove unneeded function wrapper · 3ea537c0
  James Zern authored 9 years ago
  
  the count parameter has been removed from all loopfilter functions Change-Id: I87ba72006b59c65c46ca40bcb1c29171dfe0598a
  3ea537c0
- split vpx_highbd_lpf_horizontal_16 in two · 9b44d9d0
  James Zern authored 9 years ago
  
  replace with vpx_highbd_lpf_horizontal_edge_16 and vpx_highbd_lpf_horizontal_edge_8 to avoid passing a count parameter Change-Id: I551f8cec0fce57032cb2652584bb802e2248644d
  9b44d9d0
- split vpx_lpf_horizontal_16 in two · 1b519fb6
  James Zern authored 9 years ago
  
  replace with vpx_lpf_horizontal_edge_16 and vpx_lpf_horizontal_edge_8 to avoid passing a count parameter Change-Id: I848c95c02a3c6ebaa6c2bdf0983dce05cd645271
  1b519fb6
- vpx_highbd_lpf_horizontal_4: remove unused count param · e7a23d70
  James Zern authored 9 years ago
  
  Change-Id: I655a771e1b1a8753be5669ef9348a312ba6cfdbc
  e7a23d70
- vpx_highbd_lpf_horizontal_8: remove unused count param · 51718573
  James Zern authored 9 years ago
  
  Change-Id: Iaca71ea3796115d4c2d43563b4e6f3914e21f1bf
  51718573
- vpx_highbd_lpf_vertical_4: remove unused count param · 3c1019e4
  James Zern authored 9 years ago
  
  Change-Id: Ic6da723c5cf3cd8127db1f476c3e46ea134cb774
  3c1019e4
- vpx_highbd_lpf_vertical_8: remove unused count param · 72a9f06a
  James Zern authored 9 years ago
  
  Change-Id: Id16f7259897654831d31642c2d5e0bbe5e13416c
  72a9f06a
- vpx_lpf_horizontal_4: remove unused count param · b1e97c6a
  James Zern authored 9 years ago
  
  Change-Id: Iec7d8eda343991f7d7d46931dca17af23c821d11
  b1e97c6a
- vpx_lpf_horizontal_8: remove unused count param · bd5a5bb5
  James Zern authored 9 years ago
  
  Change-Id: I48741e167a7b09b7c9ad3bfc1c4b88ef1029ae46
  bd5a5bb5
Feb 16, 2016
- vpx_lpf_vertical_4: remove unused count param · 109a47b3
  James Zern authored 9 years ago
  
  Change-Id: I43a191cb3d42e51e7bca266adfa11c6239a8064c
  109a47b3
- vpx_lpf_vertical_8: remove unused count param · 37225744
  James Zern authored 9 years ago
  
  Change-Id: Ic69406da00afb0f06588e8c0deb2b043952b078c
  37225744
- lpf_8_test: add missing dspr2 tests · 47dee375
  James Zern authored 9 years ago
  
  Change-Id: I3954ff86ec1965cd6d4eec570c2d1993538d9c11
  47dee375
- lpf_8_test: add missing vpx_lpf_horizontal_4 tests · 4fec4a8e
  James Zern authored 9 years ago
  
  mmx, msa Change-Id: Ia9604adcdcc77411f383e081e01a18d232c9d992
  4fec4a8e
- lpf_8_test: add missing vpx_lpf_vertical_4 tests · c3f2c8ad
  James Zern authored 9 years ago
  
  mmx, msa Change-Id: I113ce0ec144ee673d5dcde4c03fe7670f9f4c369
  c3f2c8ad
- lpf_8_test: simplify function wrapper generation · 45a7b5eb
  James Zern authored 9 years ago
  
  Change-Id: Ie4d3e80a4e43dd4ada78d073e308e10db4ea3239
  45a7b5eb
Feb 15, 2016

Add optimized vpx_sum_squares_2d_i16 for vp10. · abd00505

Geza Lore authored 9 years ago

Using this we can eliminate large numbers of calls to predict intra,
and is also faster than most of the variance functions it replaces.
This is an equivalence transform so coding performance is unaffected.

Encoder speedup is approx 7% when var_tx, super_tx and ext_tx are all
enabled.

Change-Id: I0d4c83afc4a97a1826f3abd864bd68e41bb504fb

abd00505

Feb 12, 2016

vp9-resize: Fix an issue with external dynamic resize. · 3cbc26f3

Marco Paniconi authored 9 years ago

External dynamic resize with swapping width and height was
not handled properly.
Fix is to re-init loop-filter under certain condtions.

Modify unittest to test this case.
Without this change test will fail.

Relates to: https://bugs.chromium.org/p/webm/issues/detail?id=1140

Change-Id: I7d81ca7fe0783b3bc103a52a7b7cf073a96be26e

3cbc26f3

tests: quiet some unused parameter warnings · cffef113
James Zern authored 9 years ago
```
Change-Id: Iff8b0d77234f78bf407676891bccad92825bfcc6
```
cffef113
vp9_error_block_test: prefer EXPECT over assert() · bdad3689
James Zern authored 9 years ago
```
Change-Id: Id523448bac903999934370f7b06a5c316f11a966
```
bdad3689
vp9_encoder_parms_get_to_decoder: add missing initializers · 153ef3d8
James Zern authored 9 years ago
```
+ quiet an unused parameter warning

Change-Id: I65f69172febb4e0701d3e440b7e1fb31829cda57
```
153ef3d8

Feb 11, 2016

Enable computing PSNRHVS for hbd build · bb8ca088

Yaowu Xu authored 9 years ago

This commit adds computation of PSNRHVS for highbitdepth build, it
also adds tests to make sure the calculation of psnrhvs metric for
10 and 12 bit correct.

Change-Id: Iac8a8073d2b3e3ba5d368829d770793212fa63b6

bb8ca088

vp9-resize: Force reference masking off for external dynamic-resizing. · 34d12d11

Marco Paniconi authored 9 years ago

An issue exists with reference_masking in non-rd pickmode for spatial
scaling. It was kept off for internal dynamic resizing and svc, this
change is to keep it off also for external dynamic resizing.

Update to external resize test, and update TODO to re-enable this
at frame level when references have same scale as source.

Change-Id: If880a643572127def703ee5b2d16fd41bdbf256c

34d12d11