- 04 Dec, 2015 2 commits
- 30 Nov, 2015 1 commit
-
-
Jian Zhou authored
Relocate h_predictor_4x4 from SSSE3 to SSE2 with XMM registers. Speed up by ~25% in ./test_intra_pred_speed. Change-Id: I64e14c13b482a471449be3559bfb0da45cf88d9d
-
- 25 Nov, 2015 1 commit
-
-
James Zern authored
~60-65% faster at the function level across block sizes Change-Id: Iaf8cbe95731c43fdcbf68256e44284ba51a93893
-
- 23 Nov, 2015 1 commit
-
-
Marco authored
Change-Id: Ic0ec32c1d7f7c08c9f956592dccbfd9060b1f624
-
- 20 Nov, 2015 2 commits
-
-
James Zern authored
accumulate satd in 32-bits + add unit test Change-Id: I6748183df3662ddb9d635f9641f9586f2fd38ad5
-
James Zern authored
the final sum may use up to 26 bits + add a unit test + disable the sse2 as the result will rollover; this will be fixed in a future commit Change-Id: I2a49811dfaa06abfd9fa1e1e65ed7cd68e4c97ce
-
- 19 Nov, 2015 1 commit
-
-
Jian Zhou authored
tm_predictor_4x4 is implemented with SSE2 using XMM registers. Speed up by ~25% in ./test_intra_pred_speed. Change-Id: I25074b78d476a2cb17f81cf654bdfd80df2070e0
-
- 14 Nov, 2015 1 commit
-
-
jackychen authored
Change-Id: I5a4f1f7b9de20fbfc28cb743dcd29c0eeca736f8
-
- 13 Nov, 2015 2 commits
- 10 Nov, 2015 1 commit
-
-
jackychen authored
Change-Id: I9bfa80de73847d9be88b6ce9865d7bb5fafaaa57
-
- 09 Nov, 2015 2 commits
-
-
jackychen authored
The unit test requires a longer clip which is already in the repo. Change-Id: Ic42e8d83e636fafd20d485a7f5f8422835319245
-
jackychen authored
For 1 pass CBR mode: increase waiting time after key frame before we start sampling rate control behavior for determining resize. This change need to disable one internal resize(DownUp) temporally since it requires a longer clip to do so. Change-Id: If21beda1be23f169ee541ab4dd642f718347887a
-
- 06 Nov, 2015 3 commits
-
-
James Zern authored
this helps some toolchains (vs9) resolve the type of the parameter Change-Id: I8c83b86da53b1783cd18c0f765b67ba33da91d72
-
James Zern authored
this helps some toolchains (vs9) resolve the type of the parameter Change-Id: Ic53b2ed5fbce05c5b5e633b4a4ef9ea75c55360a
-
James Zern authored
this helps some toolchains (vs9) resolve the type of the parameter Change-Id: I4acc8a844d1e55b766f66482bd6d32998174d70f
-
- 05 Nov, 2015 1 commit
-
-
James Zern authored
-l -> -sl, renamed in: be3b08da [svc] Temporal svc with two pass rate control Change-Id: I5a7b179b33d94e20e54825090659156dece928c0
-
- 03 Nov, 2015 1 commit
-
-
Marco authored
Current threshold is little too strict. Change-Id: I99ec1409d095e0c2fd3b7ab398742cabcc05700b
-
- 31 Oct, 2015 1 commit
-
-
James Zern authored
this avoids redefining vpx_codec_vp9_dx, vpx_codec_vp9_dx_algo in vp9_encoder_parms_get_to_decoder.cc Change-Id: I3b89e7a62497227ee32419f1a7d30e4c10a13c05 (cherry picked from commit ca163b85)
-
- 30 Oct, 2015 1 commit
-
-
James Zern authored
this avoids redefining vpx_codec_vp9_dx, vpx_codec_vp9_dx_algo in vp9_encoder_parms_get_to_decoder.cc Change-Id: I3b89e7a62497227ee32419f1a7d30e4c10a13c05
-
- 29 Oct, 2015 1 commit
-
-
jackychen authored
Refer to doc "vp9-test-vectors". BUG=https://code.google.com/p/webm/issues/detail?id=1086 Change-Id: I523d1f39141a3a86f113604cbdb9cd41cc2d6470
-
- 28 Oct, 2015 2 commits
-
-
Hangyu Kuang authored
These videos change resolution every 10 frames versus every 3 frames in current test sets. Change-Id: Ic33f449fc9b6d2f480825d4715b8f63e70801232
-
Hangyu Kuang authored
Change-Id: I70b1b8162a0c9b8501358ba7d32fecd1dc020ab5
-
- 22 Oct, 2015 1 commit
-
-
Ronald S. Bultje authored
Change-Id: Ic64b6928af7ae8ecc987f845b0bf0faecdacb072
-
- 21 Oct, 2015 1 commit
-
-
Geza Lore authored
A new version of vp9_highbd_error_8bit is now available which is optimized with AVX assembly. AVX itself does not buy us too much, but the non-destructive 3 operand format encoding of the 128bit SSEn integer instructions helps to eliminate move instructions. The Sandy Bridge micro-architecture cannot eliminate move instructions in the processor front end, so AVX will help on these machines. Further 2 optimizations are applied: 1. The common case of computing block error on 4x4 blocks is optimized as a special case. 2. All arithmetic is speculatively done on 32 bits only. At the end of the loop, the code detects if overflow might have happened and if so, the whole computation is re-executed using higher precision arithmetic. This case however is extremely rare in real use, so we can achieve a large net gain here. The optimizations rely on the fact that the coefficients are in the range [-(2^15-1), 2^15-1], and that the quantized coefficients always have the same sign as the input coefficients (in the worst case they are 0). These are the same assumptions that the old SSE2 assembly code for the non high bitdepth configuration relied on. The unit tests have been updated to take this constraint into consideration when generating test input data. Change-Id: I57d9888a74715e7145a5d9987d67891ef68f39b7
-
- 16 Oct, 2015 1 commit
-
-
Yaowu Xu authored
to make meaning of color_range obvious. Change-Id: I303582e448b82b3203b497e27b22601cc718dfff
-
- 09 Oct, 2015 2 commits
-
-
Alex Converse authored
single-threaded: swanky (silvermont): ~1% faster overall peppy (celeron,haswell): ~1.5% faster overall Change-Id: Ib74f014374c63c9eaf2d38191cbd8e2edcc52073
-
Alex Converse authored
Change-Id: Iccb4cdc23c1845cf9cb7d69101c9f4f43675d368
-
- 08 Oct, 2015 1 commit
-
-
Geza Lore authored
If high bit depth configuration is enabled, but encoding in profile 0, the code now falls back on optimized SSE2 assembler to compute the block errors, similar to when high bit depth is not enabled. Change-Id: I471d1494e541de61a4008f852dbc0d548856484f
-
- 07 Oct, 2015 1 commit
-
-
James Zern authored
some mingw32 configs define this. force this to be on to ensure the build succeeds Change-Id: I2cc490782b6a0736aa617e6a1457fc2bc984adbb
-
- 06 Oct, 2015 1 commit
-
-
James Zern authored
The serial decode check is too strict for tile-threaded decoding as there is no guarantee on the decode order nor which specific error will take precedence. Currently a tile-level error is not forwarded so the frame will simply be marked corrupt. Change-Id: I51cf1e39e44bedeac93746154b36a4ccb2f059b1
-
- 30 Sep, 2015 4 commits
-
-
Ronald S. Bultje authored
Change-Id: I936c2430c3c5b1e0ab5dec0a20110525e925b5e4
-
Ronald S. Bultje authored
Change-Id: I2000820e0c04de2c975d370a0cf7145330289bb2
-
James Zern authored
define NOMINMAX to allow the std:: versions to be used; min/max will be defined transitively via windows.h otherwise Change-Id: I692b03fa3e70b7a53962d3fd209498f70f712fed
-
Alex Converse authored
Change-Id: Iad73b490b171cdda5c368ada69fb8eab2a86c156
-
- 26 Sep, 2015 2 commits
-
-
Ronald S. Bultje authored
In the decoder, map this to the output variable vpx_image_t.r_w/h. This is intended as an improved version of VP9D_GET_DISPLAY_SIZE, which doesn't work with parallel frame decoding. In the encoder, map this to a codec control func (VP9E_SET_RENDER_SIZE) that takes a w/h pair argument in a int[2] (identical to VP9D_GET_DISPLAY_SIZE). Also add render_size to the encoder_param_get_to_decoder unit test. See issue 1030. Change-Id: I12124c13602d832bf4c44090db08c1009c94c7e8
-
Angie Chiang authored
comment out fdct32 remove fdct32 test Change-Id: I31c47fb435377465cd3265e39621ca50d3aae656
-
- 25 Sep, 2015 1 commit
-
-
Alex Converse authored
Fails with Icac63051bf37c7355e661837b57c257d58c764fc reverted. Change-Id: I460d7a5a74faa4daace25f911f8dc5f68e16c951
-
- 24 Sep, 2015 1 commit
-
-
James Zern authored
rename Decode[2-4] to something more precise Change-Id: I68c4f189796eb11ac1a5b7b682f24efb71708187
-