- 14 Jul, 2017 1 commit
-
-
Urvang Joshi authored
The memset to 0 wasn't required because the temporary variable is only written to, before being read in the next function call. Tested: ./test_libaom --gtest_filter=*SelfguidedFilterTest* Change-Id: Ie1628d43b050744ae97a8be55f551edb602b018b
-
- 12 Apr, 2017 1 commit
-
-
Rename '--enable-aom-highbitdepth' to '--enable-highbitdepth' Change-Id: I1de13c3508c30c552532993419d8ace326142ab6
-
- 20 Mar, 2017 1 commit
-
-
David Barker authored
This filter was temporarily removed due to test failures. This patch reintroduces the filter and fixes two bugs: * The test cases would occasionally segfault on x86, since the highbd filter requires its inputs to be aligned to 16 bytes. This will always be true when used on real videos, so adjust the test cases to match. * The function calc_block was incorrect for bit_depth > 8, due to passing an incorrect argument to _mm_srl_epi32(). This was the cause of the original test failures. BUG=aomedia:392 Change-Id: Ia06b76c3e6122eebadd0995fb62f32c2fcab8b3e
-
- 13 Mar, 2017 1 commit
-
-
Yaowu Xu authored
Change-Id: I035d4a3b14e6c56a55b06dea57b4a49cd161e69f
-
- 10 Mar, 2017 1 commit
-
-
David Barker authored
Patch https://aomedia-review.googlesource.com/c/8387/ introduced some bugs into the self-guided filter correctness test. One bug was fixed in https://aomedia-review.googlesource.com/c/8445/ , but another surfaced while developing https://aomedia-review.googlesource.com/c/8480/ This patch tidies up the relevant code to make it clearer, and should fix the remaining bug(s). Change-Id: I7a5957b02f0ad309aa4081c72d490d7369c8d90a
-
- 09 Mar, 2017 2 commits
-
-
David Barker authored
Performance is very similar to the lowbd path (only 4-5% slower) Change-Id: Ifdb272c3f6c0e6f41e7046cc49497c72b5a796d9
-
Yaowu Xu authored
The commit increase size of a few heap allocations to make sure later access is not out of bounds. BUG=aomedia:383 Change-Id: Iadb08faa1e55be361dd3d4adaafeb85cecf23bbb
-
- 08 Mar, 2017 2 commits
-
-
David Barker authored
By rearranging the code in restoration.c, we can allow the encoder to use the SSE4.1 version of the self-guided filter while picking the loop-restoration filter. This also helps us prepare for adding a highbitdepth SSE4.1 version of the self-guided filter. No effect on encoder output, but gives an end-to-end speedup of 1-2%. Change-Id: Id17ba4a0963ddce9f70a7cae666e212e138d5f2c
-
David Barker authored
Adjust the vectorized filter so that it can handle tile widths which are not a multiple of 4, so we do not have to fall back to the C version of the filter. Negligible speed impact for tiles with widths which are multiples of 4, and greatly improves speed on tiles with non-multiple-of-4 widths. Change-Id: Iae9d14f812c52c6f66910d27da1d8e98930df7ba
-
- 06 Mar, 2017 1 commit
-
-
David Barker authored
Add an SSE4.1 lowbd version of the self-guided filter for loop-restoration, and apply some optimizations to the C version. Approximate times per 128x128 / 256x256 tile on the machine this was developed on: Previous C: 620us / 2800us Optimized C: 500us / 2200us ( 24% / 27% faster) SSE4.1: 147us / 600us (320% / 370% faster) Change-Id: I23ff5a5482a191aeb06f9d1f767a9f036bb357fe
-