- 09 Sep, 2016 1 commit
-
-
Geza Lore authored
Encoder speedup ~2% with ext-tx + rect-tx Change-Id: Id56ddf102a887de31d181bde6d8ef8c4f03da945
-
- 02 Sep, 2016 2 commits
- 01 Sep, 2016 2 commits
-
-
Yaowu Xu authored
Cherry-Picked the following commits: 0defd8f2 Changed "WebM" to "AOMedia" & "webm" to "aomedia" 54e66767 Replace "VPx" by "AVx" 5082a369 Change "Vpx" to "Avx" 7df44f17 Replace "Vp9" w/ "Av1" 967f722f Remove kVp9CodecId 828f30ce Change "Vp8" to "AOM" 030b5ffc AUTHORS regenerated 2524caee Add ref-mv experimental flag 016762be Change copyright notice to AOMedia form 81e55269 Replace vp9 w/ av1 9b94565b Add missing files fa8ca9f2 Change "vp9" to "av1" ec838b76 Convert "vp8" to "aom" 80edfa01 Change "VP9" to "AV1" d1a11fb9 Change "vp8" to "aom" 7b582513 Point to WebM test data dd1a5c8d Replace "VP8" with "AOM" ff00fc0f Change "VPX" to "AOM" 01dee0bb Change "vp10" to "av1" in source code cebe6f0c Convert "vpx" to "aom" 17b05679 rename vp10*.mk to av1_*.mk fe5f8a8a rename files vp10_* to av1_* Change-Id: I6fc3d18eb11fc171e46140c836ad5339cf6c9419
-
- 26 Aug, 2016 1 commit
-
-
Geza Lore authored
For rectangular blocks between 8x8 and 32x32, we can now code the transform size as one bigger than the largest square that fits in the block (eg, for 16x8, we can code a transform size of 16x16 rather than the previous maximum of 8x8), when this oversized transform is coded in the bitstream, the codec will use the full size rectangular transform for that block (eg 16x8 transform in the above example). Also fixes a scaling bug in 16x8/8x16 transforms. Change-Id: I62ce75f1b01c46fe2fbc727ce4abef695f4fcd43
-
- 23 Aug, 2016 1 commit
-
-
Yaowu Xu authored
This fixes two MSVC compiler warnings. Change-Id: I55ad8833676e20c2c4a55885b99a7a9293d9623f
-
- 18 Aug, 2016 1 commit
- 16 Aug, 2016 1 commit
-
-
Yaowu Xu authored
Manually cherry-picked following commits from AOMedia git repository: bb2727c2 Sort includess for "clpf.h" c297fd00 Add quantisation matrix range parameters. 0527894f Add encoder option and signaling for quant matrix control. 4106232b Turn off trellis coding for quantization matrices. 4017fca3 Modify tests to allow quantization matrices. 1c122c24 Add quant and dequant functions for new quant matrices. 95a89994 Enable CLPF f72782bb Fix a build issue 73bae509 Add quantisation matrices and selection functions 33208d20 Added support for constrained low pass filter (CLPF) Change-Id: I60fc1ee1ac40e6b9d1d00affd97547ee5d5dd6be
-
- 15 Aug, 2016 1 commit
-
-
Debargha Mukherjee authored
Adds forward, inverse transforms and scan orders. Change-Id: Iab6994f4b0ef65e660b714d111b79b1c8172d6a8
-
- 12 Aug, 2016 1 commit
-
-
Change-Id: I58a42ced5b8a4338524434ff3356850b89aa705a
-
- 21 Jul, 2016 1 commit
-
-
Debargha Mukherjee authored
Added a new expt rect-tx to be used in conjunction with ext-tx. [rect-tx is a temporary config flag and will eventually be merged into ext-tx once it works correctly with all other experiments]. Added 4x8 and 8x4 tranforms for use initially with rectangular sub8x8 y blocks as part of this experiment. There is about a -0.2% BDRATE improvement on lowres, others pending. When var-tx is on rectangular transforms are currently not used. That will be enabled in a subsequent patch. Change-Id: Iaf3f88ede2740ffe6a0ffb1ef5fc01a16cd0283a
-
- 21 Mar, 2016 1 commit
-
-
Debargha Mukherjee authored
Makes a set of 16 transforms total, adding all 1D combinations of ADST and FlipADST, and removng all DST transforms. lowres, midres both improve by about 0.1% and hdres by -0.378% in BDRATE but with fewer transforms that are also simpler. Further experiments to continue later. Change-Id: I7348a4c0e12078fdea5ae3a2d36a89a319ffcc6e
-
- 15 Mar, 2016 1 commit
-
-
Debargha Mukherjee authored
In preparation for adding more 1D variants with ADST/FlipADST/etc. BDRATE actually improves by 0.21% on lowres. Change-Id: I2fa4720c69fe001fa666119a284dfc6b17fffab2
-
- 07 Mar, 2016 1 commit
-
-
Jingning Han authored
This commit enables a hybrid 1-D/2-D transform coding scheme and the accompany entropy coding system. It currently uses hybrid 1-D/2-D DCT transform coding. It provides coding performance gains: lowres_all 0.55% hdres_all 0.43% Change-Id: I2b30dcafd21eb2bb3371f6e854cbab440a4dfa78
-
- 17 Feb, 2016 1 commit
-
-
Debargha Mukherjee authored
Adds new 32x32 masked 1-d transforms that combine 1-D length-16 DCT with length-16 identity transforms. To be continued in subsequent patches. Change-Id: I0b4f66492d44c079b3c3b531ba48a97201de1484
-
- 16 Feb, 2016 1 commit
-
-
Debargha Mukherjee authored
Removes the USE_DST2 flag that was on by default. DST2 performs slightly better that DST1 and is faster to compute. Change-Id: Ifb788f3f0a0e1995d7625230cec144b876f01206
-
- 15 Dec, 2015 1 commit
-
-
Debargha Mukherjee authored
The DST2 is implemented by input alternate sign-flip, followed by DCT, followed by output reversal. Results are roughly the same, but it should be easier to optimize the DST2. [Interestingly a mtrix multuiply implementation is about 0.1% better]. Change-Id: If9ae5fdba87767fb0e6c163a62b77ee66a8d3afc
-
- 04 Dec, 2015 1 commit
-
-
Angie Chiang authored
The range_check is not used because the bit range in fdct# is not correct. Since we are going to merge in a new version of fdct# from nextgenv2, we won't fix the incorrect bit range now. Change-Id: I54f27a6507f27bf475af302b4dbedc71c5385118
-
- 09 Nov, 2015 1 commit
-
-
Johann authored
Javan Whistling Duck release. Change-Id: If44c9ca16a8188b68759325fbacc771365cb4af8
-
- 03 Nov, 2015 2 commits
-
-
Geza Lore authored
This patch eliminates the copying of data when using FLIPADST forward transforms, by incorporating the necessary data flipping into the load_buffer_* functions of the SSE2 optimized forward transforms. The load_buffer_* functions are normally inlined, so the overhead of copying the data is removed and the overhead of flipping is minimized. Left to right flipping is still not free, as the columns need to be shuffled in registers. To preserve identity between the C and SSE2 implementations, the appropriate C implementations now also do the data flipping as part of the transform, rather than relying on the caller for flipping the input. Overall speedup is about 1.5-2% in encode on my tests. Note that these are only the forward transforms. Inverse transforms to come in a later patch. There are also a few code hygiene changes: - Fixed some indents of switch statements. - DCT_DCT transform now always use vp10_fht* functions, which dispatch to vpx_fdct* for DCT_DCT (some of them used to call vpx_fdct* directly, some of them used to call vp10_fht*). Change-Id: I93439257dc5cd104ac6129cfed45af142fb64574
-
- 15 Oct, 2015 1 commit
-
-
Jingning Han authored
Change-Id: I13c0ecff8c58a0571d9de4bc5fbbebe72533ccdb
-
- 30 Sep, 2015 1 commit
-
-
Debargha Mukherjee authored
Adds an early termination to the ext_tx search, and also implements the DST transforms more efficiently. About 4 times faster with the ext-tx experiment. There is a 0.09% drop in performance on derflr from 1.735% to 1.648%, but worth it with the speedup achieved. Change-Id: I2ede9d69c557f25e0a76cd5d701cc0e36e825c7c
-
- 29 Sep, 2015 1 commit
-
-
Yaowu Xu authored
Resolved Conflicts in the following files: configure vp10/common/idct.c vp10/encoder/dct.c vp10/encoder/encodemb.c vp10/encoder/rdopt.c Change-Id: I4cb3986b0b80de65c722ca29d53a0a57f5a94316
-
- 26 Sep, 2015 1 commit
-
-
Angie Chiang authored
comment out fdct32 remove fdct32 test Change-Id: I31c47fb435377465cd3265e39621ca50d3aae656
-
- 24 Sep, 2015 1 commit
-
-
James Zern authored
This reverts commit 8903b9fa. there is no reason for these to be global Change-Id: I66a31c06f8426aeca348ef12d9b9ab59d6d5e55d
-
- 21 Sep, 2015 1 commit
-
-
Angie Chiang authored
remove static from fdct4/8/16/32 in vp10/encoder/dct.c add prefix vp10_ to fdct4/8/16/32 add vp10/encoder/dct.h Change-Id: I644827a191c1a7761850ec0b1da705638b618c66
-
- 15 Sep, 2015 1 commit
-
-
Debargha Mukherjee authored
derflr: +1.732% (8-bit) Change-Id: I9c04c8249646ff96eacacfa1dcb0bd118c04e84a
-
- 12 Sep, 2015 1 commit
-
-
Unify the style of fdct4() fdct8() fdct16() Add fdct32() Add range_check() at each stage Add unit test at ../../test/vp10_dct_test.cc Change-Id: I13f76d9046c3ea473c82024b09a5bc8662e2c28e
-
- 10 Sep, 2015 1 commit
-
-
Debargha Mukherjee authored
Creates new hybrid transforms combining symmetric DST with ADST and DCT. Thus a total of 16 transforms are supported. derfl: +1.659% (up about 0.2%) Change-Id: Idde1cecdb59527890bf05da740099c3f6a5b9764
-
- 03 Sep, 2015 1 commit
-
-
James Zern authored
Tests fail to build. This reverts commit f78d6aa7. Change-Id: Ia220270517ded273c65a7ab965d82edb696663c9
-
- 02 Sep, 2015 1 commit
-
-
Angie Chiang authored
Unify the style of fdct4() fdct8() fdct16() Add fdct32() Add range_check() at each stage Add unit test at ../../test/vp10_dct_test.cc Change-Id: I9e912b2c5683862e65c5a21abc3e1c260cca4576
-
- 12 Aug, 2015 3 commits
-
-
Jingning Han authored
Change-Id: Ic5004f8bdc1c2b025b598e80374ee1f286ea95ee
-
Jingning Han authored
Remove the vp9_ prefix from vp10 file names. Change-Id: I513a211b286a57d6126fc1b0fbfd6405120014f1
-
Jingning Han authored
This commit folks the VP9 and VP10 codebase and makes libvpx support VP8, VP9, and VP10. Change-Id: I81782e0b809acb3c9844bee8c8ec8f4d5e8fa356
-
- 10 Aug, 2015 1 commit
-
-
Alex Converse authored
Use system_state.h in vpx_dsp and remove unneeded includes of vp9_systemdependent.h. Change-Id: I92557ec6dd5aa790160b4f31fe7967db0d7ec3c4
-
- 28 Jul, 2015 3 commits
-
-
Jingning Han authored
Clean up the forward 2D-DCT function names in vpx_dsp. Change-Id: I3117978596d198b690036e7eb05fe429caf3bc25
-
Jingning Han authored
The forward 32x32 2D-DCT functions are aligned in vpx_dsp folder. The vp9_dct.h file is not effectively used now. Change-Id: Ie7946b6fdd784b8e91496242337bc9002c75c281
-
Jingning Han authored
This completes the forward transform functions layout refactoring. Change-Id: I996fb0fb795f41e2040f7b21db985774098aedbd
-