• Peng Bin's avatar
    Add inv txfm2d sse2 for sizes with 4 · 18976fa5
    Peng Bin authored
    Implement av1_lowbd_inv_txfm2d_add_4x4_sse2
    Implement av1_lowbd_inv_txfm2d_add_4x8_sse2
    Implement av1_lowbd_inv_txfm2d_add_8x4_sse2
    Implement av1_lowbd_inv_txfm2d_add_4x16_sse2
    Implement av1_lowbd_inv_txfm2d_add_16x4_sse2
    A brief speed test shows that using the included SSE2 functions
    completed by this CL, for speed1 lowbitdepth encoder speeds up >9%
    and lowbitdepth decoder speeds up >25%, comparing to the highbitdepth
    implementation in the baseline.
    Change-Id: I0576a2a146c0b1a7b483c9d35c3d21d979e263cd
av1_inv_txfm_sse2.c 89.6 KB