• Jingning Han's avatar
    Use low precision 32x32fdct for encodemb in speed1 · debb9c68
    Jingning Han authored
    The low precision 32x32 fdct has all the intermediate steps within
    16-bit depth, hence allowing faster SSE2 implementation, at the
    expense of larger round-trip error. It was used in the rate-distortion
    optimization search loop only.
    Using the low precision version, in replace of the high precision one,
    affects the compression performance by about 0.7% (derf, stdhd) at
    speed 0. For speed 1, it makes derf set down by only 0.017%.
    Change-Id: I4e7d18fac5bea5317b91c8e7dabae143bc6b5c8b