Optimize wedge partition selection.
We can optimize wedge partition selection by pre-computing the residuals of the 2 underlying predictors, and then blend these to compute the sse of the compound predictor, without actually having to compute and subtract the compound predictor. Similarly we can pre-compute a proxy array which we can use to cheaply check which mask sign would have lower sse. Details are in wedge_utils.c. Mathematically these are equivalence transformations, but due to the finite precision the encoder output will be perturbed, though on average this should make 0% difference. ext-inter gains about ~4.5% speedup. Change-Id: Ib2657c3209ae161b4090b58b4b6c392641bf2792
Showing
- test/test.mk 1 addition, 0 deletionstest/test.mk
- test/vp10_wedge_utils_test.cc 399 additions, 0 deletionstest/vp10_wedge_utils_test.cc
- vp10/common/reconinter.c 0 additions, 8 deletionsvp10/common/reconinter.c
- vp10/common/reconinter.h 0 additions, 1 deletionvp10/common/reconinter.h
- vp10/common/vp10_rtcd_defs.pl 9 additions, 0 deletionsvp10/common/vp10_rtcd_defs.pl
- vp10/encoder/rdopt.c 226 additions, 111 deletionsvp10/encoder/rdopt.c
- vp10/encoder/wedge_utils.c 135 additions, 0 deletionsvp10/encoder/wedge_utils.c
- vp10/encoder/x86/wedge_utils_sse2.c 260 additions, 0 deletionsvp10/encoder/x86/wedge_utils_sse2.c
- vp10/vp10cx.mk 4 additions, 0 deletionsvp10/vp10cx.mk
Loading
Please register or sign in to comment