Speed up h_predictor_8x8
Relocate the function from SSSE3 to SSE2, Unroll loop from 4 to 2, and reduce mem access to left. Speed up by >20% in ./test_intra_pred_speed. Change-Id: Ib9f1846819783b6e05e2a310c930eb844b2b4d2e
Showing
- test/test_intra_pred_speed.cc 6 additions, 5 deletionstest/test_intra_pred_speed.cc
- vpx_dsp/vpx_dsp_rtcd_defs.pl 1 addition, 1 deletionvpx_dsp/vpx_dsp_rtcd_defs.pl
- vpx_dsp/x86/intrapred_sse2.asm 23 additions, 0 deletionsvpx_dsp/x86/intrapred_sse2.asm
- vpx_dsp/x86/intrapred_ssse3.asm 0 additions, 18 deletionsvpx_dsp/x86/intrapred_ssse3.asm
Loading
Please register or sign in to comment