- 25 Jan, 2018 18 commits
-
-
Hui Su authored
Change-Id: I93702d24bf7d711b6910e2e502f9f97c661bcf6c
-
Yushin Cho authored
Initialization has been nowhere done for seg->temporal_update. Change-Id: I3ccc0e10e14a83859b683c026093b921ea6d5dbf
-
Frank Bossen authored
Can reduce decoder run time by 4 percent. Change-Id: Ibdd5bb3a18002789852f2e367b32533163a8c022
-
Jingning Han authored
Change-Id: I83ca47c1469d8e383a815058c02c4826c6282873
-
Jingning Han authored
Change-Id: If8836621586ab5090affbb8d6d7b0be3a3e4cde8
-
David Barker authored
Commit https://aomedia-review.googlesource.com/c/aom/+/40541 replaced a division in the DC intra predictor by an approximate multiply+shift sequence. Unfortunately, this approximation is able to produce out-of-range values. For example, consider 4x8 DC_PRED, with bit depth = 10. If all of the context pixels are 0x3FF (the max value), then we get: sum = 12 * 0x3FF expected_dc = (sum * 0xAB) >> 11 = 1024 = 0x400 This means that we need to insert a clip_pixel(_highbd) operation at the end of the DC prediction, to bring this value back in range. BUG=aomedia:1272 Change-Id: I9beb9ac8a4b39803865f7e23932402ecd1d6f672
-
Yunqing Wang authored
mode_context[ref_frame] is calculated in find_mv_refs_idx(), but is set to 0 in setup_ref_mv_list. Therefore, the calculation in find_mv_refs_idx() is not needed. Change-Id: I65ca06a2000278ad21c2eaa81eb12c48a7c1fcb8
-
Frank Bossen authored
This change is not normative and produces the same results as before. TPL_MV_REF data structure is about 5x smaller. Observed overall decoder run time reduction is about 4%. No observed change in encoder run time. Change-Id: Id68a492bac3bf28f48b7ceeedf85cd29981238ee
-
Tom Finegan authored
Writes PRE_OBU_SIZE_BYTES (currently 4) bytes padded unsigned LEB128 encoded integers in OBU size fields when enabled: $ cmake path/to/aom -DCONFIG_OBU=1 -DCONFIG_OBU_SIZING=1 && cmake --build . Requires CONFIG_OBU. BUG=aomedia:1125 Change-Id: I4d184ef0c8587d24e9c8c3e63237ea5003386c6a
-
Frederic Barbier authored
BUG=aomedia:1266 Change-Id: I7612e379aa7c63da56e975e95cd7266cd1f8c68d
-
Yue Chen authored
Remove all *bmc variables, which were used to record basic motion search results (no advanced masked compound) when obmc and warped motion modes were allowed to work with compound ref. Remove switchable rate that is passed in to it, since in most motion modes, we need to recalculate the cost based on motion_mode and the refined mv. This change slightly improve the rd perf. Performance change: -0.024% Change-Id: I4afe0927e97cc7e7251022957f7665ed3032079c
-
Angie Chiang authored
Instead of listing all possible stage_range, we use set_fwd_txfm_non_scale_range() to generate 2d stage_range from 1d stage_range. This will reduce the complexity of txfm table significantly. This is a lossless change. The coding performance isn't changed. The txfm config is exactly the same as it was before. Change-Id: Ibd1d9e53772bb928faaeecc98d81cbc8f38b27ed
-
Angie Chiang authored
Change-Id: I73d1d15ab678242737432064d203c476057286ed
-
Zoe Liu authored
This patch simply aggregates the checking on the counts of certain reference frames in the neighboring above and left blocks. It does not incur any coding performance change. Change-Id: I59a962ba95e7ab16731ce97371ec5709a582a0ba
-
Hui Su authored
Change-Id: I4f9d014324b35e30f25cae5fa570620249640cf6
-
Hui Su authored
Only half of it was necessary. Change-Id: I0b5fc9ae6a17f5d812e10ee903a12f23f1377d8e
-
Debargha Mukherjee authored
Temporary quick fix for broken compatibility with testing infrastructure. Change-Id: I9af93690dd107fc79a79062f4d6ea7c53c8b4798
-
Tom Finegan authored
- Stop relying on asserts for error checking. - Update callers to check for and return errors where required. Change-Id: Id6a39b14397394b85aaa9dc8b168f7a26f04919b
-
- 24 Jan, 2018 18 commits
-
-
Cheng Chen authored
Record total rate cost when computing trellis optimization. Reduce redundant rate computation in later stages. Speed impact: ~6% speed up Coding performance should not be affected. Change-Id: I9e940a2d126bb55930fcf22ea04d061eee1fc944
-
Andrey Norkin authored
Change-Id: I0fdb09499196e02709e067f690dff71146ee5114
-
Imdad Sardharwalla authored
The self-guided filter speed tests show that: - The SSE4.1 implementation of FAST SGR is ~35% faster than the corresponding implementation of SGR; - The AVX2 implementation of FAST SGR is ~28% faster than the corresponding implementation of SGR. Change-Id: Iecdc1f8cee79500084c71d06dbb02d804272aa99
-
Debargha Mukherjee authored
Adds an experiment for fast sgr computation where for the r=2 filter, computation of the A, B stats are computed for every other row and averaged in between. The motivation is to improve software performance with hopefully minimal loss. Change-Id: Ie36687826524dc18c1fbb7f6becff244187bf8da
-
David Barker authored
There is a special case with certain frame heights, where we end up with a loop restoration stripe which ends 1px above the crop border. Previously this case was handled in quite an ugly way, which also disagrees with the spec (+ isn't great for hardware). This patch changes things to match the spec. Specifically, the old method was to sometimes upscale one extra row of deblocked pixels so that we could always have a 2px "below" border for each processing stripe. The new method is to only use rows inside the crop border, and to duplicate them if necessary. BUG=aomedia:1264 Change-Id: Idf8ab510e1091dc3f5b257de60e16bca214d8dc4
-
Sean DuBois authored
BUG=aomedia:13 Change-Id: I9df343f4a6a809b09446ff1f2083c38771ab068b
-
Yaowu Xu authored
Profile 0 now supports 10 bit, therefore no longer means input_shift at 0. Change-Id: Idae429b88ee5c073ee6e939a88d569c5ffde2b0d
-
Angie Chiang authored
Move cos_bit from txfm 1d cfg to 2d cfg Each txfm stage only uses one cos_bit This is a lossless change and it speeds up encoder by 2% Change-Id: I45d398761e4729b8c4c37729571fe3765cb0c83f
-
Frederic Barbier authored
Change-Id: I6532e20c958d5bf6f6d73a6f076664e1b74ba055
-
Jingning Han authored
Skip the rate distortion search over last 2/3 reference frames for the reference motion vectors derived from non-nearest neighbors. The overall coding performance change is in the noise range - 0.05% better. Speed up the encoding process by 20%. Change-Id: I823b8ca2805ae332f4c9bc8ee255069a82db4331
-
Zoe Liu authored
In rd_pick_partition(), the first one or two blocks for the partition types HORZ_A, HORZ_B, VERT_A, and VERT_B may be already evaluated, during the evaluation of SPLIT, HORZ, and VERT. This patch saves the RD pick mode results and tries to reuse them to remove the duplicate RD mode evaluation operations. This patch should not incur any coding performance loss. Testing on a few lowres frames: when CFL is off, this patch obtains >10% encoder speedup. Change-Id: I932e233bc93873de62a88230254df44494236dde
-
Yushin Cho authored
AVX2 Code for av1_convolve_2d_sr_c() Change-Id: Id8a2192b78bbb2c6ac22da3134a7c256941985c8
-
Johann authored
These flags provided compatibility with configure but have no effect in cmake builds. Change-Id: I2dbb71d9aeaae759cc3c4a46917e3840d696328d
-
Johann authored
In-tree builds are explicitly disallowed by cmake. Any of these files showing up in the source tree should be cause for concern. BUG=aomedia:1254 Change-Id: Iae42c17cbadb6554c6a95bda14daf5ac67e352a7
-
Johann authored
At least the changes that don't conflict with 4.0.1 Change-Id: Iaa2fda027b8ab2b023d608cf5ec7b377a72b851e
-
Yaowu Xu authored
Change-Id: I243e2a3cbae5b4eebe7fbabcb9f55552e9f13bd8
-
Jingning Han authored
Make the per transform block kernel selection process unified with the rate distortion model used in preliminary mode search. This makes the txk-sel model search space same as baseline. Change-Id: I82a2d94e88a03c88154582575ced500197f8a409
-
Hui Su authored
Change-Id: Iea0e8665cdd5b9bc0fe17930add7068443765ea9
-
- 23 Jan, 2018 4 commits
-
-
Hui Su authored
It's more efficient to use av1_cost_literal() instead. Change-Id: I50727d4a4ee06492b373c2e7831c224c5eae8735
-
Hui Su authored
Change-Id: I9e16b5de0a3ae1814982660434812d417955d94f
-
Debargha Mukherjee authored
An improvement in coding efficiency for higher resolution sources. Plus having this on by default will guard against 256x256 LRU support not being inadvertently broken. Change-Id: I171b3c310eab72e27390e9ad0aa9c362f7fbb508
-
Yaowu Xu authored
This commit replaces hard coded FRAME_ID_NUMBERS_PRESENT_FLAG with error_resilient_mode, which properly reflects the intention of the experiment, i.e. "signal the complete state of the reference buffer explicitly for each frame" to deal with possible frame losses. Change-Id: I7130c110d26c6a8e1cf1266c05482b768cf352f9
-