1. 20 May, 2013 1 commit
    • Scott LaVarnway's avatar
      WIP: 4x4 idct/recon merge · ba48a111
      Scott LaVarnway authored
      This patch eliminates the intermediate diff buffer usage by
      combining the short idct and the add residual into one function.
      The encoder can use the same code as well.
      
      Change-Id: I296604bf73579c45105de0dd1adbcc91bcc53c22
      ba48a111
  2. 16 May, 2013 1 commit
    • Scott LaVarnway's avatar
      WIP: 8x8 idct/recon merge · 794a7bed
      Scott LaVarnway authored
      This patch eliminates the intermediate diff buffer usage by
      combining the short idct and the add residual into one function.
      The encoder can use the same code as well.
      
      Change-Id: Iacfd57324fbe2b7beca5d7f3dcae25c976e67f45
      794a7bed
  3. 15 May, 2013 1 commit
    • Scott LaVarnway's avatar
      WIP: 16x16 idct/recon merge · a272ff25
      Scott LaVarnway authored
      This patch eliminates the intermediate diff buffer usage by
      combining the short idct and the add residual into one function.
      The encoder can use the same code as well.
      
      Change-Id: Iea7976b22b1927d24b8004d2a3fddae7ecca3ba1
      a272ff25
  4. 14 May, 2013 1 commit
    • Scott LaVarnway's avatar
      WIP: 32x32 idct/recon merge · 2cf0d4be
      Scott LaVarnway authored
      This patch eliminates the intermediate diff buffer usage by
      combining the short idct and the add residual into one function.
      The encoder can use the same code as well.
      
      Change-Id: I4ea09df0e162591e420d869b7431c2e7f89a8c1a
      2cf0d4be
  5. 26 Apr, 2013 1 commit
    • Johann's avatar
      Normalize more intrinsic filenames · 863601c5
      Johann authored
      vp9_dequantize_x86 has only sse2 functions.
      
      vp9_dct_sse2_intrinsics has no namespace collision and can drop
      _intrinsics.
      
      vp9_idct_mmx.h is unused.
      
      Change-Id: Ic16e31fb372a1d1e841a62ecb4189fe8f95808ec
      863601c5
  6. 16 Apr, 2013 1 commit
  7. 07 Mar, 2013 3 commits
    • Yunqing Wang's avatar
      Optimize add_constant_residual function · f2407826
      Yunqing Wang authored
      Optimized adding constant diff to predictor, which gave about
      2% decoder performance gain.
      
      Change-Id: I47db20c31428e8c4a8f16214a85cbe386a6e9303
      f2407826
    • Yunqing Wang's avatar
      Allocate 16-byte aligned diff buffer · b339aea6
      Yunqing Wang authored
      This was done based on John's suggestion.
      
      Change-Id: I62516a513c31fe3dbea0d6cd063df79d9e819ec8
      b339aea6
    • Yunqing Wang's avatar
      Fix issue in add_residual intrinsic function · 31623715
      Yunqing Wang authored
      Yaowu found this function had a compiling issue with MSVC because
      of using _mm_storel_pi((__m64 *)(dest + 0 * stride), (__m128)p0).
      To be safe, changed back to use integer store instruction.
      
      Also, for some build, diff could not always be 16-byte aligned.
      Changed that in the code.
      
      Change-Id: I9995e5446af15dad18f3c5c0bad1ae68abef6c0d
      31623715
  8. 06 Mar, 2013 1 commit
    • Yunqing Wang's avatar
      Optimize add_residual function · 943c6d71
      Yunqing Wang authored
      Optimized adding diff to predictor, which gave 0.8% decoder
      performance gain.
      
      Change-Id: Ic920f0baa8cbd13a73fa77b7f9da83b58749f0f8
      943c6d71