Commits · 5a29ca93acc1b165498e1d8938d7bdfdf89c167a · Xiph.Org / Opus

Feb 13, 2025
- Preserving 24-bit accuracy for fixed-point encoder · 82485dd2
  Jean-Marc Valin authored 9 months ago
  
  82485dd2
- Preserving 24-bit accuracy for fixed-point decoder · a4854afa
  Jean-Marc Valin authored 9 months ago
  
  Convert to 16 bits only at the very end
  a4854afa
Feb 12, 2025
- C90 fixes: Removing declarations after statement · 6f9cb7aa
  Jean-Marc Valin authored 2 months ago
  
  6f9cb7aa
Jan 27, 2025
- Fix run-time warning in NSQ_del_dec_avx2() · 140e0982
  Jean-Marc Valin authored 7 months ago
  
  140e0982
Sep 11, 2024
- fix spelling error in docs · ff6dea5e
  Ralph Giles authored 6 months ago
  
  Address the same issue in the mips code.
  ff6dea5e
- Rename `Intsafe.h` to `intsafe.h` for case-sensitive OS · 1884ec0f
  Nam Se Hyun authored 6 months ago and Tristan Matthews committed 6 months ago
  
  Signed-off-by: Tristan Matthews <tmatth@videolan.org>
  1884ec0f
Mar 12, 2024
- Fix meson AVX2 fixed-point · 84a85e9e
  Jean-Marc Valin authored 1 year ago
  
  84a85e9e
Mar 11, 2024
- Remove the use of __m128i_u entirely · fcecf997
  Jean-Marc Valin authored 1 year ago
  
  It's just an internal gcc/clang type
  fcecf997
Mar 09, 2024

MSVC doesn't have a real __m128i_u, so it would generate an aligned
store, resulting in a segfault. Adding explicit loadu/stureu
intrinsics to make sure the compiler generates unaligned load/store

824f1bec

Mar 03, 2024
- Avoid OSCE crash if weights aren't loaded · 4eca11c0
  Jean-Marc Valin authored 1 year ago
  
  4eca11c0
Mar 01, 2024
- Allow wrap-around in Neon NSQ_del_dec LCG · b6fd9aaa
  Jean-Marc Valin authored 1 year ago
  
  Matches the C code and avoids undefined behaviour
  b6fd9aaa
Feb 25, 2024
- Move all DRED encoding/decoding files to dnn/ dir · dcce2fd4
  Jean-Marc Valin authored 1 year ago
  
  dcce2fd4
Feb 23, 2024

Rework 32-bit SSE loads yet again. · 59dc75fa

Timothy B. Terriberry authored 1 year ago and

Jean-Marc Valin committed 1 year ago

The existing code in vec_avx.h produced
  warning: dereferencing type-punned pointer will break
   strict-aliasing rules
 with gcc 6.4.0.
We already had a macro to work around this within the rules of the
 C standard, but trying to use that here does not get optimized
 into a single MOVD like we were hoping.
Replacing it with memcpy() instead does get optimized correctly,
 but requires switching from a macro to an inline function in order
 to be able to declare a local variable and return a value.
We already have such an inline function in NSQ_del_dec_avx2.c, so
 hoist that out and use it everywhere, and then convert vec_avx.h
 to use it also.

59dc75fa

Feb 22, 2024

Fix build on ARMv7 · 6673e34b

Jean-Marc Valin authored 1 year ago

Fixes regression in 83368e6.
vcgez_s16() is A64-only, but vcge_s16(..., vdup_n_s16(0)) works
everywhere.

6673e34b

Bump DRED experimental version for 3e2a6b62 · cf4e3a15
Jean-Marc Valin authored 1 year ago

cf4e3a15

Add signaling for a maximum DRED quantizer. · 3e2a6b62

Timothy B. Terriberry authored 1 year ago and

Jean-Marc Valin committed 1 year ago

Since any value of dQ > 0 will cause the initial quantizer to
 degrade to the format-implied maximum (15) with a sufficient
 number of DRED frames, allow signaling a maximum smaller than 15.
This allows encoders to improve the minimum quality of long DRED
 sequences (at the expense of bitrate) without requiring a constant
 quantizer for all frames (dQ == 0).

3e2a6b62

Remove some dead code. · 950d8bf1
Timothy B. Terriberry authored 1 year ago

950d8bf1

Feb 21, 2024
- bit-exact overflow fixes in silk/arm/NSQ_del_dec_neon_intr.c · 833688e6
  Jan Buethe authored 1 year ago
  
  833688e6
Feb 20, 2024
- Add missing RESTORE_STACK in tests · ecc10d83
  Jean-Marc Valin authored 1 year ago
  
  Silences NONTHREADSAFE_PSEUDOSTACK warnings
  ecc10d83
Feb 16, 2024
- Delaying new DRED data when just out of silence · db78df8c
  Jean-Marc Valin authored 1 year ago
  
  We don't need redundancy for the first active frame since we already have the main Opus payload.
  db78df8c
- Support for extra offset · 1f53f1e0
  Jean-Marc Valin authored 1 year ago
  
  Allows us to exclude the most recent silence from DRED
  1f53f1e0
- Refactoring: store all states · 183a8202
  Jean-Marc Valin authored 1 year ago
  
  183a8202
- Chopping the oldest silence in a DRED payload · 9f36bfc9
  Jean-Marc Valin authored 1 year ago
  
  9f36bfc9
Feb 15, 2024
- Fix missing dotprod optimization · 9b1da1fb
  Jean-Marc Valin authored 1 year ago
  
  Use the neon version of silk_noise_shape_quantizer_short_prediction()
  9b1da1fb
Feb 02, 2024
- Allow wrap-around in silk_LPC_analysis_filter_avx2() · 2582ca92
  Jean-Marc Valin authored 1 year ago
  
  Matches the C version (see 4a7027b2)
  2582ca92
Feb 01, 2024
- Fix OSCE using uninitialized range coder for PLC · f20575dd
  Jean-Marc Valin authored 1 year ago
  
  f20575dd
Jan 31, 2024
- Handle the offset from the DRED frame id · 43508197
  Jean-Marc Valin authored 1 year ago
  
  43508197
- Fix c90 build · 0fed741a
  Jean-Marc Valin authored 1 year ago
  
  0fed741a
Jan 25, 2024
- Initial DRED tuning · 19dd96b3
  Jean-Marc Valin authored 1 year ago
  
  Adjust q0, qD and duration based on bitrate and loss.
  19dd96b3
Dec 20, 2023
- Merge LACE/NoLACE under OSCE framework · 7d328f5b
  Jan Buethe authored 1 year ago and Jean-Marc Valin committed 1 year ago
  
  7d328f5b
Dec 15, 2023
- use opus_(re)alloc and opus_free for dnn and DRED related functions · 12fbd811
  Michael Klingbeil authored 1 year ago
  
  12fbd811
Nov 30, 2023
- don't redefine _mm_loadu_si32 on MSVC · 8090aaca
  Michael Klingbeil authored 1 year ago
  
  8090aaca
Nov 29, 2023
- Trying to fix/update meson build · c28b0f10
  Jean-Marc Valin authored 1 year ago
  
  Still don't quite know what I'm doing
  c28b0f10
Nov 28, 2023

Oops, fix the fixed-point build · 147b7229
Jean-Marc Valin authored 1 year ago

147b7229

Fixes for ARMv7/AArch32 · df637713

Jean-Marc Valin authored 1 year ago

1) Enable asm/intrinsics even for floating-point
2) Make sure ARMv8 asimd enables EDSP/MEDIA/Neon
3) Add dotp architecture to rtcd table since AArch *can* have dotp

df637713

Nov 21, 2023
- Add rtcd for silk_inner_product_FLP() · 239d223d
  Jean-Marc Valin authored 1 year ago
  
  239d223d
- Start enabling AVX2 silk_inner_product_FLP() · b93e4a14
  Jean-Marc Valin authored 1 year ago
  
  Not yet with rtcd
  b93e4a14
- Avoids AVX2 optimizations being disabled · ed900603
  Jean-Marc Valin authored 1 year ago
  
  ed900603
Nov 20, 2023

Misc fixes on previous patch · 6f99a338
Jean-Marc Valin authored 1 year ago
```
Fixes warnings, undefined behaviour, and check-asm failure
```
6f99a338

Optimize NSQ_del_dec() for AVX2 · 735c4070

Victor Ding authored 1 year ago and

Jean-Marc Valin committed 1 year ago

The optimization is bit-exact with C function.

This optimization speeds up SILK encoder (floating point) as following:

AMD Zen:
Complexity 0-5 :      0%
Complexity 6-7 : 3 -  7%
Complexity 8-10: 8 - 15%

Intel Skylake:
Complexity 0-5 :       0%
Complexity 6-7 : 14 - 18%
Complexity 8-10: 17 - 22%

Adapted by Jean-Marc Valin

735c4070