• Yi Luo's avatar
    Lowbd D207E/D63E/D45E intrapred x86 optimization · ae676953
    Yi Luo authored
    D207E
    Predictor  SSE2 vs C
    4x4        ~2.6X
    4x8        ~2.5X
    8x4        ~8.0X
    8x8        ~9.1X
    8x16       ~11.7X
    16x8       ~16.9X
    16x16      ~17.3X
    16x32      ~17.2X
    32x16      ~30.2X
    32x32      ~35.5X
    
    D63E
    Predictor  SSE2 vs C
    4x4        ~4.7X
    4x8        ~4.9X
    8x4        ~7.8X
    8x8        ~8.9X
    8x16       ~9.3X
    16x8       ~15.7X
    16x16      ~14.7X
    16x32      ~17.3X
    32x16      ~18.0X
    32x32      ~15.7X
    
    D45E
    Predictor  SSSE3 vs C
    4x4        ~1.8X
    4x8        ~2.9X
    8x4        ~6.7X
    8x8        ~6.5X
    8x16       ~7.4X
    16x8       ~24.4X
    16x16      ~21.5X
    16x32      ~24.2X
    32x16      ~25.4X
    32x32      ~25.2X
    
    Change-Id: I8215de190e2b6314272749761600e389d1ca0fdf
    ae676953
test_intra_pred_speed.cc 64.4 KB