Skip to content
  • James Yu's avatar
    VP8 for ARMv8 by using NEON intrinsics 06 · 81ad047e
    James Yu authored
    
    
    Add idct_dequant_full_2x_neon.c
    - idct_dequant_full_2x_neon
    
    ==== Summary of apply VP8 decode patch series ====
    Benchmark on Samsung Chromebook, Cortex-A15, 1.7GHz, Dual core
    Toolchain: linaro-1.13.1-4.8-2014.01
    Compile argument: CROSS=arm-linux-gnueabihf- ../libvpx/configure
                         --target=armv7-linux-gcc --prefix=$HOME/out
                         --enable-shared --cpu=cortex-a7
    Test argument: vpxdec --summary --noblit ./tears_of_steel_1080p.webm
    
    NEON assembly   46.68 (fps)
    Apply patch 06  46.65, -0.03
    Apply patch 07  46.86, +0.21
    Apply patch 08  46.58, -0.28
    Apply patch 09  46.57, -0.01
    Apply patch 10  46.51, -0.06
    Apply patch 11  46.13, -0.38
    Apply patch 12  45.42, -0.71
    Apply patch 13  46.06, +0.64
    Apply patch 14  45.19, -0.87
    Apply patch 15  45.93, +0.74
    Apply patch 16  45.48, -0.45
    Apply patch 17  45.84, +0.36
    Apply patch 18  45.91, +0.07  <= With all NEON intrinsics patches
                     Total -0.77 fps, 1.65% performance regression
    
    Change-Id: I77bfc9eaccfb97b8d401e949ceff8795e26ca6b7
    Signed-off-by: default avatarJames Yu <james.yu@linaro.org>
    81ad047e