Use low precision 32x32fdct for encodemb in speed1
The low precision 32x32 fdct has all the intermediate steps within 16-bit depth, hence allowing faster SSE2 implementation, at the expense of larger round-trip error. It was used in the rate-distortion optimization search loop only. Using the low precision version, in replace of the high precision one, affects the compression performance by about 0.7% (derf, stdhd) at speed 0. For speed 1, it makes derf set down by only 0.017%. Change-Id: I4e7d18fac5bea5317b91c8e7dabae143bc6b5c8b
Showing
- vp9/encoder/vp9_block.h 1 addition, 1 deletionvp9/encoder/vp9_block.h
- vp9/encoder/vp9_encodeframe.c 2 additions, 2 deletionsvp9/encoder/vp9_encodeframe.c
- vp9/encoder/vp9_encodemb.c 2 additions, 2 deletionsvp9/encoder/vp9_encodemb.c
- vp9/encoder/vp9_onyx_if.c 5 additions, 0 deletionsvp9/encoder/vp9_onyx_if.c
- vp9/encoder/vp9_onyx_int.h 1 addition, 0 deletionsvp9/encoder/vp9_onyx_int.h
Loading
Please register or sign in to comment