1. 14 Mar, 2014 1 commit
  2. 24 Feb, 2014 1 commit
    • Erik de Castro Lopo's avatar
      Don't use intrinsics when they are slower. · cf0e42ae
      Erik de Castro Lopo authored
      More thorough en-/decoding tests show that sometimes the functions
      that use intrinsics are slower (or not really faster) than old
      plain C functions.
      
      After this patch the encoder doesn't use these new functions
      when their usefulness is questionable.
      
      Patch-from: lvqcl <lvqcl.mail@gmail.com>
      cf0e42ae
  3. 01 Feb, 2014 1 commit
  4. 31 Jan, 2014 1 commit
    • Erik de Castro Lopo's avatar
      Add a fast shift for int64 values. · 4618512d
      Erik de Castro Lopo authored
      This patch changes the code from:
      	(FLAC__int32)(xmm.m128i_i64[0] >> lp_quantization)
      into:
      	_mm_cvtsi128_si32(_mm_srli_epi64(xmm, lp_quantization));
      
      Encoding of 24-bit .wav files with 32-bit FLAC became noticeably faster.
      
      Patch-from: lvqcl <lvqcl.mail@gmail.com>
      4618512d
  5. 30 Jan, 2014 1 commit
  6. 07 Jan, 2014 1 commit
    • Erik de Castro Lopo's avatar
      libFLAC : Add asm versions for two _wide() functions. · 7e927893
      Erik de Castro Lopo authored
      GCC generates slow ia32 code for FLAC__lpc_restore_signal_wide() and
      FLAC__lpc_compute_residual_from_qlp_coefficients_wide() so 24-bit
      encoding/decoding is slower for GCC compile than for MSVS or ICC
      compile. This patch adds ia32 asm versions of these functions.
      
      Patch-from: lvqcl <lvqcl.mail@gmail.com>
      7e927893
  7. 03 Oct, 2013 1 commit
    • Erik de Castro Lopo's avatar
      Improve x86 instrinsic implementation. · ecd0acba
      Erik de Castro Lopo authored
      * Splits lpc_x86intrin.c to lpc_intrin_sse.c and lpc_intrin_sse2.c
      * Add FLAC__lpc_compute_residual_from_qlp_coefficients_intrin_sse2()
        function to lpc_intrin_sse2.c
      * Add lpc_intrin_sse41.c with two ..._wide_intrin_sse41() functions
        (useful for 24-bit en-/decoding)
      * Add precompute_partition_info_sums_intrin_sse2() / ...ssse3() and
        disables precompute_partition_info_sums_32bit_asm_ia32_().
        SSE2 version uses 4 SSE2 instructions instead of 1 SSSE3 instruction
        PABSD so it is slightly slower.
      
      Patch-from: lvqcl <lvqcl.mail@gmail.com>
      ecd0acba
  8. 25 Sep, 2013 1 commit
  9. 06 Jun, 2013 1 commit
  10. 26 May, 2013 1 commit
  11. 25 May, 2013 1 commit
  12. 05 Apr, 2013 1 commit
  13. 29 Mar, 2013 2 commits
  14. 14 Mar, 2013 1 commit
  15. 06 Mar, 2013 1 commit
  16. 22 Jun, 2012 4 commits
  17. 05 Apr, 2012 1 commit
  18. 04 Apr, 2012 1 commit
  19. 04 Feb, 2012 2 commits
  20. 07 Jan, 2009 2 commits
  21. 28 Feb, 2008 1 commit
  22. 13 Sep, 2007 1 commit
  23. 11 Sep, 2007 2 commits
  24. 29 Aug, 2007 2 commits
  25. 23 Jul, 2007 1 commit
  26. 16 Jul, 2007 1 commit
  27. 10 Jul, 2007 1 commit
  28. 07 Jul, 2007 1 commit
  29. 16 Jun, 2007 1 commit
  30. 22 Mar, 2007 1 commit
  31. 14 Mar, 2007 1 commit
  32. 13 Mar, 2007 1 commit