Skip to content
Snippets Groups Projects
  1. Sep 30, 2010
  2. Sep 29, 2010
  3. Sep 28, 2010
    • Fritz Koenig's avatar
      Optimizations on the loopfilters. · 0964ef0e
      Fritz Koenig authored
      - Scheduling for Atom processors
      - Combining of macros to allow for better interleaving
      - Change from multiplies to adds for main filter
      - Use of movhps/movlps to fill xmm registers without
        shifting and orring
      
      Change-Id: I0b3500a5f58abf7085253ec92d64c8a96723040b
      0964ef0e
    • Adrian Grange's avatar
      Enabled AltRef motion map creation · 47fc8f26
      Adrian Grange authored
      Enabled the first-pass encode to output the
      map of macroblock coding modes required by
      the AltRef filter.
      47fc8f26
    • Adrian Grange's avatar
    • Adrian Grange's avatar
      Made AltRef filter adaptive & added motion compensation · 1b2f8308
      Adrian Grange authored
      Modified AltRef temporal filter to adapt filter length based
      on macroblock coding modes selected during first-pass
      encode.
      
      Also added sub-pixel motion compensation to the AltRef
      filter.
      1b2f8308
    • Johann Koenig's avatar
      Merge "update gitignore" · e4d43c21
      Johann Koenig authored
      e4d43c21
    • Johann Koenig's avatar
      update gitignore · 6fa5c24a
      Johann Koenig authored
      this was excluding all .asm files when it should have just been .asm
      files in the top level directory and .asm.s files lower down. also be
      more restrictive on some other items, and run the whole thing through
      sort to keep it organized
      
      Change-Id: Ia48525033226b13098a491ce89465d0377b990c2
      6fa5c24a
    • Timothy B. Terriberry's avatar
      Add 4-tap version of 2nd-pass ARMv6 MC filter. · 18dc92fd
      Timothy B. Terriberry authored
      The existing code applied a 6-tap filter with 0's on either end.
      We're already paying the branch penalty to avoid computing the two
       extra columns needed as input to this filter.
      We might as well save time computing the filter as well.
      This reduces the inner loop from 21 instructions to 16, the number
       of loads per iteration from 4 to 1, and the number of multiplies
       from 7 to 4.
      The gain in overall decoding performance, however, is small (less
       than 1%).
      
      This change also means we now valgrind clean on ARMv6, which is
       its real purpose.
      The errors reported here were valgrind's fault (it does not detect
       that 0 times an uninitialized value is initialized), but Julian
       Seward says it would slow down valgrind considerably to make such
       checks.
      Speeding up libvpx rather, even by a small amount, seems a much
       better idea if only to enable proper valgrind checking of the
       rest of the codec.
      
      Change-Id: Ifb376ea195e086b60f61daf1097d8910c4d8ff16
      18dc92fd
  4. Sep 27, 2010
  5. Sep 24, 2010
  6. Sep 23, 2010
  7. Sep 22, 2010
    • Johann Koenig's avatar
      Remove dead code · 7fed3832
      Johann Koenig authored
      The new loopfilter was originally introduced as an experimental change.
      It's permanent now.
      
      Change-Id: I25dbedb6ceff3e9f9c04e18bb29f84c3ecb7e546
      7fed3832
  8. Sep 21, 2010
Loading