SSE2 optimizations for _6/_16 lowbd lpf functions
Includes vertical and horizontal implementations and to fix 5/13 TAPs/Parallel deblocking support. Re-working internals of the filters for better re-usage across different sizes. Tests are enabled. Performance changes, SSE2 over C: Horizontal methods: up to 3-4x Vertical methods: up to 1.5x-2x Change-Id: I2e36035355d8c23c1d4b0d59d0e23f598e9d0e3f
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment