SSE2 high precision 32x32 forward DCT
Enable SSE2 implementation of high precision 32x32 forward DCT. The intermediate stacks are of 32-bits. The run-time goes down from 32126 cycles to 13442 cycles. Change-Id: Ib5ccafe3176c65bd6f2dbdef790bd47bbc880e56
Showing
- vp9/common/vp9_blockd.h 1 addition, 1 deletionvp9/common/vp9_blockd.h
- vp9/common/vp9_idct.h 3 additions, 0 deletionsvp9/common/vp9_idct.h
- vp9/common/vp9_rtcd_defs.sh 1 addition, 1 deletionvp9/common/vp9_rtcd_defs.sh
- vp9/encoder/vp9_rdopt.c 0 additions, 1 deletionvp9/encoder/vp9_rdopt.c
- vp9/encoder/x86/vp9_dct32x32_sse2.c 1244 additions, 1 deletionvp9/encoder/x86/vp9_dct32x32_sse2.c
- vp9/encoder/x86/vp9_dct_sse2.c 4 additions, 4 deletionsvp9/encoder/x86/vp9_dct_sse2.c
Loading
Please register or sign in to comment