Speed up h_predictor_16x16
Relocate the function from SSSE3 to SSE2, Unroll loop from 8 to 4, and reduce mem access to left. Speed up by >20% in ./test_intra_pred_speed. Change-Id: Ie48229c2e32404706b722442942c84983bda74cc
Showing
- test/test_intra_pred_speed.cc 2 additions, 2 deletionstest/test_intra_pred_speed.cc
- vpx_dsp/vpx_dsp_rtcd_defs.pl 1 addition, 1 deletionvpx_dsp/vpx_dsp_rtcd_defs.pl
- vpx_dsp/x86/intrapred_sse2.asm 24 additions, 0 deletionsvpx_dsp/x86/intrapred_sse2.asm
- vpx_dsp/x86/intrapred_ssse3.asm 0 additions, 18 deletionsvpx_dsp/x86/intrapred_ssse3.asm
Loading
Please register or sign in to comment