1. 24 Jan, 2018 1 commit
    • Imdad Sardharwalla's avatar
      Added SSE4.1 and AVX2 implementations of FAST SGR. · 9d234571
      Imdad Sardharwalla authored
      The self-guided filter speed tests show that:
      - The SSE4.1 implementation of FAST SGR is ~35% faster than the corresponding
        implementation of SGR;
      - The AVX2 implementation of FAST SGR is ~28% faster than the corresponding
        implementation of SGR.
      
      Change-Id: Iecdc1f8cee79500084c71d06dbb02d804272aa99
      9d234571
  2. 17 Jan, 2018 1 commit
    • Imdad Sardharwalla's avatar
      Optimise self-guided restoration SIMD functions · f32dabd2
      Imdad Sardharwalla authored
      Improvements have been made to calc_ab for both the
      SSE4.1 and AVX2 versions of the self-guided filter.
      These result in an increase in the speed of between
      3% and 5% depending on the bit depth.
      
      Change-Id: I83a12ba452fcbb61cce5066801ae213e23c609cd
      f32dabd2
  3. 12 Jan, 2018 1 commit
    • Imdad Sardharwalla's avatar
      Added AVX2 implementation of self-guided filter · c6acc531
      Imdad Sardharwalla authored
      The self-guided filter has now been implemented using
      the intrinsics for AVX2. The corresponding speed and
      correctness tests have also been added.
      
      Note: All AVX2 functions are in synonyms_avx2.h, as
      GCC produces 'ABI change' warnings if they are
      included in synonyms.h.
      
      Change-Id: I2a283a4acf8c01ee835d5edc526abc242d87ad9b
      c6acc531