• levytamar82's avatar
    AVX2 SAD Optimization: · 0fa8b668
    levytamar82 authored
    2 functions were optimized for avx2 by using full 256 bit register
    In order to handle 32 elements in parallel instead of only 16 in parallel:
    1. vp9_sad32x32x4d
    2. vp9_sad64x64x4d
    
    The function level gain is 66% and the user level gain is ~1%.
    
    Change-Id: I4efbb3bc7d8bc03b64b6c98f5cd5c4a9dd3212cb
    0fa8b668
vp9cx.mk 4.98 KB