1. 01 Sep, 2017 1 commit
  2. 23 May, 2017 1 commit
    • David Barker's avatar
      Vectorize high-precision convolve filter · 5d34e6a7
      David Barker authored
      Add SSE2 lowbd and SSSE3 highbd versions of the filters
      introduced in https://aomedia-review.googlesource.com/c/11962/ .
      
      These filters are equivalent in speed to the SSE2 implementations
      of the regular convolve filter. The average time to filter a
      64x64 block is:
      
      lowbd C: 52us
      lowbd SSE2: 5.6us
      highbd C: 53us
      highbd SSSE3: 5.8us
      
      Also add a correctness test based on the warp filter tests.
      
      Change-Id: Ia0d81100e8a414bbfc2b5f664d751cf24765299e
      5d34e6a7
  3. 04 May, 2017 1 commit
    • David Barker's avatar
      Add SSSE3 warp filter + const-ify warp filters · d8a423c6
      David Barker authored
      The SSSE3 filter is very similar to the SSE2 filter, but
      the horizontal pass is sped up by using the 8x8->16
      multiplies added in SSSE3.
      
      Also apply const-correctness to all versions of the filter
      
      The timings of the existing filters are unchanged, and the
      lowbd SSSE3 filter is ~17% faster than the lowbd SSE2 filter.
      
      Timings per 8x8 block:
      lowbd SSE2: 320ns
      lowbd SSSE3: 273ns
      highbd SSSE3: 300ns
      
      Filter output is unchanged.
      
      Change-Id: Ifb428a33b106d900cde1b080794796c0754ae182
      d8a423c6
  4. 12 Apr, 2017 1 commit
  5. 06 Apr, 2017 2 commits
  6. 04 Apr, 2017 1 commit
  7. 28 Mar, 2017 1 commit
  8. 13 Feb, 2017 1 commit
  9. 23 Jan, 2017 1 commit
    • David Barker's avatar
      Warp filter improvements · 13797462
      David Barker authored
      * The restriction on the parameter 'delta' was too strict, so we
        loosen it (delta only ever gets multiplied by -4, ... , 4,
        whereas beta gets multiplied by -7, ..., 7)
      * Correct a comment about the border clamping
      * Fix an issue with the test case
      
      Change-Id: I30e55203455ba6e419b5a8b646151a6d1fd5cc3b
      13797462
  10. 19 Jan, 2017 1 commit