- 08 Oct, 2017 9 commits
-
-
Debargha Mukherjee authored
Change-Id: I71c07652565c0e1ca44d73f3731459949271fe45
-
Debargha Mukherjee authored
Solves some Windows build issues Change-Id: Ia903ed05285362449829a2777999cf73058f7733
-
Zoe Liu authored
This coding tool is dependent on the tool of frame_marker. This tool derives the frame sign bias directly from the frame offset. No sign bias signaling is needed. Change-Id: I3a8c77904d73caeeb1b6777fb026279fd2bbc6fb
-
Yunqing Wang authored
Add an experiment "tmp", which includes: 1. Always use larger block size while storing frame MVs and make it consistent for CB4X4 or non-CB4X4 cases. Namely, use 8x8 for 4x4 mi size and 16x16 for 8x8 mi size. 2. Allocate smaller buffer for frame MVs and save memory usage. 3. Use nearby 8x8 or 16x16 location's previous frame MVs, and make the logic simple. 4. Reduce the number of copying for frame MVs, that is very costly in decoder. Baseline decoder got 5+% speedup. Borg test on lowres set showed a +0.009% PSNR difference before/after the patch. Change-Id: I61e14e95fd35bea88f338931b4f43c44f4e4cf1f
-
Debargha Mukherjee authored
Change-Id: I16cee2064ddc80f80a21560e9d192a39033949ca
-
Debargha Mukherjee authored
Change-Id: I599f8fbdd3c19ec67d9a2118a41d735e11dd3f07
-
Zoe Liu authored
Change-Id: Ibdcb1530b9f81a2a5222e95cf5c0b7b2938509a8
-
Debargha Mukherjee authored
Change-Id: Ia6731231f860c3ca689240c777463d8b232b3901
-
Debargha Mukherjee authored
Various fixes for pvq build. Change-Id: Ideebdb072ed5786f3224e93ded5ec75a23e68dab
-
- 07 Oct, 2017 11 commits
-
-
Luc Trudeau authored
Change-Id: I13ba0dbe57297b540b78512d21a119f05a86a849
-
Luc Trudeau authored
high bit depth (_hbd) and low bit depth (_lbd) versions of the cfl functions: sum_above_row, sum_left_col, cfl_build_prediction, cfl_luma_subsampling_420 (4:4:4 will be added in subsequent commit) and cfl_alpha_dist. For cfl_alpha_dist, special care is given to scale the SSE according to the bit depth. BUG=aomedia:835 Change-Id: I5b72845100d88fb8a438efe665bcae7fe1ba50b8
-
Urvang Joshi authored
When enabled, scaling through resize and superres will occur only in the frame's width; the height will not be scaled. Macro is off by default. Change-Id: I501b2b0b2766aa4a86da5937b57c4d5aee4e34c4
-
Urvang Joshi authored
Change-Id: I27292b7cdb27cec23754a6f017c5c7c55eb38bb5
-
Debargha Mukherjee authored
ext-partition-types and supertx are incompatible Change-Id: I6c4cce16453cff13b0acbaad93dde7d089891038
-
Urvang Joshi authored
Earlier, the superres scale was in the form of: N/16, where N ranged from 8 to 16. We change this to the form: 8/D, where D ranges from 8 to 16. This helps on the decoder side, by making it possible to work on 8x8 blocks at a time. Change-Id: I6c72d4b3e8d1c830e61d4bb8d7f6337a100c3064
-
Urvang Joshi authored
cm->superres_scale_numerator is used for both keyframes and non-keyframes, and is initialized from either oxcf->superres_scale_numerator or oxcf->superres_kf_scale_numerator as appropriate. Change-Id: Ie46df576ef3830e181643ae591d836449a4bd38f
-
Rupert Swarbrick authored
The restoration tiles (rtiles) divide the upscaled frame, not the encoded one. Change-Id: I2d08fe926d694fee7064461685289d3fd1c1de0c
-
Debargha Mukherjee authored
This optimization for speed was useful only when max tx-size was 32x32. However with tx64x64 this was breaking certain assumptions causing huge drops in coding efficiency. So I am removing this optimization for now. This can be brought back latger as a speed feature. The removal of this optimzation brings back the loss when 32x64 and 64x32 transforms are used. Change-Id: I15987ea9ff53fa36a2962fe5f156c30a11e809ed
-
Joe Young authored
The SSE4 function filter_intra_edge_sse4_1() reads data slightly past the initialized part of the array. Those data are discarded later, but causes a valgrind warning. This change avoids the warning by initializing the array an extra +16 positions. BUG=aomedia:868 Change-Id: Ib610492cff91492ae379c5d62895773f8747c4bc
-
Luc Trudeau authored
To simplify high bit depth commit, the summing the top row and the left column are extracted out of cfl_dc_pred. This does not change the bitstream. Change-Id: I5c9fe91df4942f736c5af29c1d93abb3a6c8501f
-
- 06 Oct, 2017 17 commits
-
-
Jingning Han authored
Reduce the context model size for key frame modes from 30240 bits to 4500 bits, i.e., less than 1/6 of the original context model. The coding performance loss on key frame is 0.14% for lowres and noise level difference for video sequence. The loss on key frame for midres is 0.05% and noise level for whole video. The change on hdres kf coding is 0.015%. Change-Id: I9e36825e5c5ee6ba35038c3ca349ad1ad3429910
-
Debargha Mukherjee authored
When ext-partition and ncobmc-adapt-weight is on, avoid too large stack allocations. Change-Id: I8db74e45cac80c4e5dfd9e20cfc73d9978d1578e
-
Angie Chiang authored
Change-Id: I923931a9dbf828eb13670511852d55c953b479c1
-
Sebastien Alaiwan authored
This is undefined behaviour in C99 and could mislead the optimizer. This fixes the ubsan warning, and still generates optimal code (i.e an inlined 'sar' instruction). Change-Id: I36b20a6780532b8c9379b9fbfd970933d56b1bc5
-
Alexander Bokov authored
Average speed-up (lowres): low bitrates: 6.6% mid bitrates: 2.5% high bitrates: 0.0% Average PSNR loss: lowres: 0.010% midres: 0.005% Change-Id: Id34fb247e5e31f04ca324c58142e4b5ac4edacda
-
Yi Luo authored
On i7-6700: Predictor ssse3 v. C 4x4 ~1.3x 4x8 ~1.9x 8x4 ~2.3x 8x8 ~3.4x 8x16 ~4.1x 16x8 ~4.6x 16x16 ~5.2x 16x32 ~5.6x 32x16 ~4.2x 32x32 ~4.7x Change-Id: Ic12383cf9d4446361d6355eb8a480a3c7602060e
-
Sebastien Alaiwan authored
Here, we're testing CONFIG_HIGHBITDEPTH but what we really depend upon is the actual size of the coefficients. Change-Id: I33d71e4b38b4b83bb4232346f4d449f20bcf740e
-
RogerZhou authored
Change-Id: I4f158dd15e4a7a8abb27ad0ad6aed6f8f9fce4f6
-
Rupert Swarbrick authored
Since the CONFIG_EXT_INTER #if/#endif lines have been removed, it's a bit clearer what's going on here and this patch cleans up the code. Firstly, the patch pulls the cheap checks on best_mbmode.ref_frame out to the front of the block, so we needn't call gm_get_motion_vector at all for compound predictions. Next, second element of the zeromv array is never used, so we needn't compute it. Finally, the patch removes the calls to lower_mv_precision. These shouldn't be needed, but it's not exactly obvious why not so the patch adds some comments to gm_get_motion_vector to explain what's going on and adds an assertion to make sure they are true. It also adds a call to integer_mv_precision on the early return path of gm_get_motion_vector, correcting an apparent bug when CONFIG_AMVR is true. This patch shouldn't make any difference to encoder or decoder behaviour. Change-Id: I0b4a01063574d080bbf6d30187f4e1748c60939d
-
Angie Chiang authored
The performance difference is lowres: 0.02% gain midres: 0.07% gain Change-Id: I68a74462f41db3bf24573cf2a08c8b5b8aa13f5f
-
Debargha Mukherjee authored
This is an adopted experiment. Change-Id: If33f700c12fb3fd0f8368714d5e9795aad3729d8
-
Yunqing Wang authored
This patch fixed loop-restoration subpixel_8t_ssse3 function so that it works with CONFIG_PIC. Change-Id: I178efae161a2b4c8388c141c378c99149ea3a847
-
Sebastien Alaiwan authored
Change-Id: I4732dbbb71a0db9ac284a4b2ae5f10816e0e9264
-
RogerZhou authored
Change-Id: I3f30c35bcd1bc623ad0c34c4b954ff71b2fcfd00
-
Hui Su authored
Change-Id: I22f4ea3b8348816d4592645e9994b4b0ff29868c
-
Debargha Mukherjee authored
Re-enable test with intra-edge Change-Id: I189398a2426a8ca66edaf15f91286a2aff9b47b0
-
Debargha Mukherjee authored
Fixes compile failures with unpoison-partition-context and ext-partition-types. Change-Id: Ibedab7d927d6bf7329a5fc5a3050aa6272d4b254
-
- 05 Oct, 2017 3 commits
-
-
Hui Su authored
Hard code av1_ext_tx_ind[] and av1_ext_tx_inv[]. Change-Id: Ife650d41be3f85d048f5585659ad57e03f54b39b
-
Jingning Han authored
Change-Id: Ib0d699106583e7072e6287a5a780967b5059efa3
-
Thomas Davies authored
Change-Id: I0b6f56d2bf0482cfbf45525828d6daa6aea187e5
-