Skip to content
Snippets Groups Projects
  1. Sep 27, 2010
  2. Sep 24, 2010
  3. Sep 23, 2010
  4. Sep 22, 2010
    • Johann Koenig's avatar
      Remove dead code · 7fed3832
      Johann Koenig authored
      The new loopfilter was originally introduced as an experimental change.
      It's permanent now.
      
      Change-Id: I25dbedb6ceff3e9f9c04e18bb29f84c3ecb7e546
      7fed3832
  5. Sep 21, 2010
  6. Sep 20, 2010
  7. Sep 17, 2010
    • Johann Koenig's avatar
      reorder data to use wider instructions · 022323bf
      Johann Koenig authored
      the previous commit laid the groundwork by doing two sets of idcts
      together. this moved that further by grouping the interesting data
      (q[0], q+16[0]) together to allow using wider instructions. also
      managed to drop a few instructions by recognizing that the constant
      for sinpi8sqrt2 could be downshifted all the time which avoided a
      dowshift as well as workarounds for a function which only accepted
      signed data
      
      looks like a modest gain for performance: at qcif, went from ~180
      fps to ~183
      Change-Id: I842673f3080b8239e026cc9b50346dbccbab4adf
      022323bf
    • Yunqing Wang's avatar
      Restructure multi-threaded decoder · f857a850
      Yunqing Wang authored
      On each MB, loopfiltering is done right after MB decoding. This
      combines two loops in multi-threaded code into one, which reduces
      number of synchronizations to half.
      
      The above-row/left-col data are saved in temp buffers for
      next-row/next MB decoding.
      
      Tests on 4-core gLucid machine showed 10% decoder performance
      gain with threads=4 (tulip clip). Testing on other platforms
      isn't done yet.
      
      Change-Id: Id18ea7c1e84965dabea65d4c01ca5bc056ddeac9
      f857a850
  8. Sep 16, 2010
    • John Koleszar's avatar
      cleanup: remove unused xprintf · 9100073e
      John Koleszar authored
      These files aren't currently used, and we can get them back if we
      need them.
      
      Change-Id: I62aa3bff828e491a80c80eeb84a7c44903df29b5
      9100073e
    • John Koleszar's avatar
      Reduce size of tokenizer tables · 147b125b
      John Koleszar authored
      This patch reduces the size of the global tables maintained by the
      tokenizer to 16k from 80k-96k. See issue #177.
      
      Change-Id: If0275d5f28389af11ac83c5d929d1157cde90fbe
      147b125b
  9. Sep 15, 2010
    • Fritz Koenig's avatar
      Modify GET_GOT macro for performance. · 746439ef
      Fritz Koenig authored
      GET_GOT was producing a zero length call.  This resulted in
      pipeline flushes occuring when returing from the assembly
      functions.  Masked on out of order cores, but evident on
      Atom cores.
      
      Change-Id: I8c375af313e8a169c77adbaf956693c0cfeb5ccd
      746439ef
  10. Sep 14, 2010
    • Fritz Koenig's avatar
      Removed unnecessary pxor. · 769f2424
      Fritz Koenig authored
      There is no need to make sure that the lower byte of the
      register is 0 because the downshift by 11 overwrites that byte.
      
      Change-Id: I89cbf004b2ff532a2c68e0dc399c45a49cdad5a1
      769f2424
Loading