- 28 Mar, 2016 3 commits
-
-
Alex Converse authored
* changes: Add pluggable bitwriters. Add pluggable bitreaders.
-
Hui Su authored
-
Yi Luo authored
-
- 26 Mar, 2016 4 commits
-
-
hui su authored
This patch fixes 2 issues in Palette mode: 1. More memory is needed in PALETTE_BUFFER for 444 video format. 2. A merge issue caused by https://chromium-review.googlesource.com/#/c/333940/7 Change-Id: I2aedc7dfdfb6b66fbd600189ec6e1e2cc6120d40
-
Alex Converse authored
-
Alex Converse authored
No need to do avoid shortcuts when all we are testing is the superframe syntax. Decreases the run time up the VP10 version of the test from 22 seconds to 3 seconds on my machine. Change-Id: If0c3551cbb8af8b803e02629e803e5f09da76cd1
-
Alex Converse authored
-
- 25 Mar, 2016 12 commits
-
-
Yi Luo authored
- Wrote function: fidtx8_sse2() and fidtx16_sse2(). - Turned on vp10_fht8x8_sse2()/vp10_fht16x16_sse2() for new types. - Updated 8x8/16x16 unit tests for accuracy/speed. - Running 20K times with random numbers and getting through tx type from V_DCT to H_FLIPADST, SSE2 speed improvement: 8x8: ~131% 16x16: ~66% Change-Id: Ibbb707e932a08fec3b1f423a7dab280a1d696c9a
-
Alex Converse authored
Failing since: 66f2f65e Merge MISC_FIXES Change-Id: I8135d6a9d74972c595f1b6294fb842e68f91e50e
-
Yue Chen authored
Skip checking obmc when regular inter predictor is not so good (the rd-cost for Y residual is greater than the total rd of the best mode so far.) Performance change compared to full rd search: +0.006% lowres, -0.056% midres Encoding time : 1.14X baseline (was 1.42X) Change-Id: I11350f955a20e1a2331be458537a915e09fbedf3
-
Alex Converse authored
-
Yunqing Wang authored
-
Alex Converse authored
Change-Id: Ic9ed1f88f5550b69a45a0fdc71aae5864db7e178
-
Alex Converse authored
This will make the code change for a pure ANS experiment manageable. Change-Id: I3c72a2d8e75afa2cc8e56992ee91f4760202f4d4
-
Alex Converse authored
This will make the code change for a pure ANS experiment manageable. Change-Id: I9905f3a89f492a4346860463a72fa8c52aac4c8e
-
Hui Su authored
-
Yunqing Wang authored
After porting tile coding from VP9 to VP10, some performance degradation was seen because of the difference between VP9 and Vp10 baseline. This patch disabled some features in VP10 while tile coding is turned on. Also, an encoder control API was added back for this use case. Change-Id: I8f736db8388408a8cc35320a2f80abb02906571c
-
hui su authored
Skip filtered intra modes search in inter frame when DC mode is worse than the best mode so far. With ext-intra enabled, the overall speed is increased by 20~40%; performance drop is 0.03% on lowres and 0.05% on midres. Change-Id: I75d2503b067cf5e46e3533b97fb01497e125baa7
-
Yi Luo authored
-
- 24 Mar, 2016 6 commits
-
-
Yi Luo authored
- Added function fidtx4_sse2(). - Turned on vp10_fht4x4_sse2() for these tx types. - Updated 4x4 unit test for speed/accuracy. - 4x4 Unit test passed. - Running 20K times with random numbers for tx type from V_DCT to H_FLIPADST, SSE2 against C, speed improves ~46%. Change-Id: I828088b7f98dc0f5939a72e3fcd6cb0b8d8dd8bf
-
Jingning Han authored
Change-Id: Id09e1913c1ac965b78df2e67471807019c89f8ab
-
Jingning Han authored
-
Jingning Han authored
This was broken due the leakage between consecutive CLs. Change-Id: I08ba8c67a42871d9488729ed854845641aa7ca30
-
Geza Lore authored
If configured with --enable-ext-tile, the codec uses an alternative tile coding syntax in the bitstream. Changes include:: - The maximum number of tile rows and columns is extended to 1024 each. - The minimum tile width/height is 64 pixels (1 superblock). - A tile copy mode is added where a tile directly reuse the coded data of a previous tile - The meaning of the tile-columns and tile-rows codec parameters are overloaded to mean tile-width and tile-height in units of 64 pixels. - All tiles should now be independent, including rows within the same columns, so large scale parallel, or independent decoding is possible. - vpxdec also gained the options to decode only a particular tile, tile row, or tile column. Changes without --enable-ext-tile: - All tiles should now be independent, including rows within the same columns, so large scale parallel, or independent decoding is possible. - vpxenc default tile configuration changed to use 1 tile column. Change-Id: I0cd08ad550967ac18622dae5e98ad23d581cb33e
-
Angie Chiang authored
-
- 23 Mar, 2016 12 commits
-
-
Yi Luo authored
-
Yi Luo authored
- Use Makefile to control the build for highbd_fwd_txfm_sse4.c. - Fixed hybrid transform (HT) types due to recent update. - Added new unit test cases for highbd HT. Change-Id: Ifd768a9b429a8c21ed40c1de8152fb5ac71e2f90
-
Jingning Han authored
This commit separates the predicted motion vector from the nearestmv motion vector in the coding process for both regular and sub8x8 block sizes. Change-Id: I703490513b0194e6669ebf719352db015facb3e1
-
Angie Chiang authored
Change-Id: Ieabe5534e5f4fb3f2d751a3cfc682208b3913715
-
Yi Luo authored
Merge "Highbd fht4x4 SSE4.1 optimization for DCT_DCT mode - Setup function vp10_highbd_fht4x4_sse4_1 for highbd SSE4.1 intrinsics optimization. - Wrote SSE4.1 functions: load_buffer_4x4(), write_buffer_4x4(), and fdct4x4_sse4_1(). - Used logic right shift to avoid coeff memory write/read. - Turned on vp10_highbd_fht4x4_sse4_1 for DCT_DCT mode only. - Improved overall encoding performance >2.3% for 50 frames sequence, park_joy_1080p_12.y4m, in which, --input-bit-depth=12, --bit-depth=12, 50 frames. - Unit test passed." into nextgenv2
-
Hui Su authored
-
Alex Converse authored
-
Alex Converse authored
-
Hui Su authored
-
Yi Luo authored
- Setup function vp10_highbd_fht4x4_sse4_1 for highbd SSE4.1 intrinsics optimization. - Wrote SSE4.1 functions: load_buffer_4x4(), write_buffer_4x4(), and fdct4x4_sse4_1(). - Used logic right shift to avoid coeff memory write/read. - Turned on vp10_highbd_fht4x4_sse4_1 for DCT_DCT mode only. - Improved overall encoding performance >2.3% for 50 frames sequence, park_joy_1080p_12.y4m, in which, --input-bit-depth=12, --bit-depth=12, 50 frames. - Unit test passed. Change-Id: Idd6dc6e472cbbf235f0ade4f66fbe859a860a004
-
Debargha Mukherjee authored
-
Debargha Mukherjee authored
Uses 15 frames instead of 30. Also only test speed 0 for VP10. Change-Id: Icace050edd974622d83bdf843058f63bd8d3a84b
-
- 22 Mar, 2016 3 commits
-
-
Alex Converse authored
Change-Id: I32b63fca020c410cef16e93379b4e6e281ccbccd
-
Yue Chen authored
-
Julia Robson authored
This has been ported under ext_partition_types because it is due to be combined with the coding_unit_size experiment which is already being ported under ext_partition Change-Id: I47af869ae123ddf0aa99160dac644059d14266ee
-