-
Angie Chiang authored
The speed performance for running 20k times is as follows Notice that the vp10_highbd_fdct#x#_sse2 version is 16-bit version plus range check The rest are 32-bit version vp10_fwd_txfm2d_4x4_c (2 ms) vp10_fwd_txfm2d_8x8_c (9 ms) vp10_fwd_txfm2d_16x16_c (45 ms) vp10_fwd_txfm2d_32x32_c (233 ms) vp10_fwd_txfm2d_4x4_sse4_1 (2 ms) vp10_fwd_txfm2d_8x8_sse4_1 (3 ms) vp10_fwd_txfm2d_16x16_sse4_1 (16 ms) vp10_fwd_txfm2d_32x32_sse4_1 (80 ms) vp10_highbd_fdct4x4_c (1 ms) vp10_highbd_fdct8x8_c (3 ms) vp10_highbd_fdct16x16_c (17 ms) highbd_fdct32x32_c (160 ms) vp10_highbd_fdct4x4_sse2 (0 ms) vp10_highbd_fdct8x8_sse2 (2 ms) vp10_highbd_fdct16x16_sse2 (8 ms) highbd_fdct32x32_sse2 (105 ms) Change-Id: I24daf1e0d4d66e91e4ce61ef71cefa7b70ee90ce
25520d8d