Port SIMD optimization for obmc blending functions to av1
SIMD optimization for 1d blending functions in obmc mode, and some code refactoring and cleanup. (ped_1080p25.y4m, 150 frame, 2000 tb) Encoding time overhead: +18.8% -> +18.1% Decoding time overhead: +21.3% -> +8.7% Change-Id: I9d856c32136e7e0e6e24ab5520ef901d7b1ee9c8
parent
863b0499
No related branches found
No related tags found
Showing
- aom_dsp/aom_dsp.mk 12 additions, 0 deletionsaom_dsp/aom_dsp.mk
- aom_dsp/aom_dsp_common.h 16 additions, 0 deletionsaom_dsp/aom_dsp_common.h
- aom_dsp/aom_dsp_rtcd_defs.pl 19 additions, 0 deletionsaom_dsp/aom_dsp_rtcd_defs.pl
- aom_dsp/blend.h 42 additions, 0 deletionsaom_dsp/blend.h
- aom_dsp/blend_a64_hmask.c 70 additions, 0 deletionsaom_dsp/blend_a64_hmask.c
- aom_dsp/blend_a64_mask.c 144 additions, 0 deletionsaom_dsp/blend_a64_mask.c
- aom_dsp/blend_a64_vmask.c 72 additions, 0 deletionsaom_dsp/blend_a64_vmask.c
- aom_dsp/x86/blend_a64_hmask_sse4.c 36 additions, 0 deletionsaom_dsp/x86/blend_a64_hmask_sse4.c
- aom_dsp/x86/blend_a64_mask_sse4.c 924 additions, 0 deletionsaom_dsp/x86/blend_a64_mask_sse4.c
- aom_dsp/x86/blend_a64_vmask_sse4.c 285 additions, 0 deletionsaom_dsp/x86/blend_a64_vmask_sse4.c
- aom_dsp/x86/blend_sse4.h 146 additions, 0 deletionsaom_dsp/x86/blend_sse4.h
- aom_dsp/x86/synonyms.h 70 additions, 0 deletionsaom_dsp/x86/synonyms.h
- av1/common/reconinter.c 52 additions, 155 deletionsav1/common/reconinter.c
- av1/common/reconinter.h 5 additions, 8 deletionsav1/common/reconinter.h
- av1/encoder/rdopt.c 3 additions, 3 deletionsav1/encoder/rdopt.c
- test/blend_a64_mask_1d_test.cc 334 additions, 0 deletionstest/blend_a64_mask_1d_test.cc
- test/blend_a64_mask_test.cc 270 additions, 0 deletionstest/blend_a64_mask_test.cc
- test/function_equivalence_test.h 62 additions, 0 deletionstest/function_equivalence_test.h
- test/test.mk 3 additions, 0 deletionstest/test.mk
Loading
Please register or sign in to comment