• David Barker's avatar
    Vectorize high-precision convolve filter · 5d34e6a7
    David Barker authored
    Add SSE2 lowbd and SSSE3 highbd versions of the filters
    introduced in https://aomedia-review.googlesource.com/c/11962/ .
    
    These filters are equivalent in speed to the SSE2 implementations
    of the regular convolve filter. The average time to filter a
    64x64 block is:
    
    lowbd C: 52us
    lowbd SSE2: 5.6us
    highbd C: 53us
    highbd SSSE3: 5.8us
    
    Also add a correctness test based on the warp filter tests.
    
    Change-Id: Ia0d81100e8a414bbfc2b5f664d751cf24765299e
    5d34e6a7
aom_dsp_rtcd_defs.pl 98.2 KB