Commits · 3e69410e29b0cc4e8a6e9712d7c980d702597d62 · Xiph.Org / Opus

Feb 10, 2024

Fix OOB read in fixed-point NEON intrinsics. · 3e69410e

Timothy B. Terriberry authored Feb 09, 2024 and

Jean-Marc Valin committed Feb 09, 2024



xcorr_kernel_neon_fixed() read one more sample from y[] in the
 main loop than it needed to allow use of vector loads, but unlike
 the native asm in celt_pitch_xcorr_arm.s, the loop condition did
 not exit early enough to prevent this from overrunning the end of
 the array.
Additionally, the tail loop _always_ read one value beyond what it
 needed.

This patch fixes the loop condition on the main loop.
Since this makes the tail section run even for lengths that are a
 multiple of 8 (e.g., on fully half the multiplies for usages like
 celt_fir() or celt_iir() with an order of 16, which is common),
 rather than try to fix the tail loop, we replace it with a
 non-looping adaptation of the native asm, which continues to use
 vector loads as much as possible for the remaining elements (and
 also does not read ahead past the end of the y[] array).

Overall slowdown of test_opus_encode on a Raspberry Pi 5 Model B
 Rev 1.0 is 0.12% vs. 0.13% for fixing the existing tail loop.

Signed-off-by: Jean-Marc Valin <jmvalin@jmvalin.ca>

3e69410e

Add check-asm for fixed-point xcorr_kernel(). · d5031251

Timothy B. Terriberry authored Feb 09, 2024 and

Jean-Marc Valin committed Feb 09, 2024



Compare the output of xcorr_kernel() against the results of
 xcorr_kernel_c() when configured with --enable-check-asm.
Currently this is only checked in fixed point, as a float check
 requires more sophisticated error analysis and may need to be
 customized for each vector implementation.

Signed-off-by: Jean-Marc Valin <jmvalin@jmvalin.ca>

d5031251

Feb 07, 2024
- Add basic testing for Deep PLC, DRED, and OSCE · 65b131ec
  Jean-Marc Valin authored Feb 06, 2024
  
  Still need more targeted tests, DRED decoding
  65b131ec
- Make opus_packet_unpad() discard extensions too · 7070dfec
  Jean-Marc Valin authored Feb 06, 2024
  
  Same for opus_multistream_packet_unpad()
  7070dfec
Feb 06, 2024
- Fix internal error on DRED · 17922c2a
  Jean-Marc Valin authored Feb 06, 2024
  
  Forgot to account for padding length bytes when DRED payload is large.
  17922c2a
- Avoid size-zero OPUS_COPY() with NULL pointer · 562587e9
  Jean-Marc Valin authored Feb 06, 2024
  
  Fails ubsan because memcpy declares args as non-null
  562587e9
Feb 02, 2024
- Allow wrap-around in silk_LPC_analysis_filter_avx2() · 2582ca92
  Jean-Marc Valin authored Feb 02, 2024
  
  Matches the C version (see 4a7027b2)
  2582ca92
- Fix log(0) on silence for fixed-point · e12c7f58
  Jean-Marc Valin authored Feb 02, 2024
  
  e12c7f58
- Add missing NULL pointer check · 0e2d56d6
  Jean-Marc Valin authored Feb 02, 2024
  
  0e2d56d6
- Fix various typos · 009d7412
  luzpaz authored Jul 21, 2023 and Jean-Marc Valin committed Feb 02, 2024
  
  Found using `codespell -q 3 -L caf,highe,inlin,nd,ordert,shft` Signed-off-by: Jean-Marc Valin <jmvalin@jmvalin.ca>
  009d7412
Feb 01, 2024
- Fix OSCE using uninitialized range coder for PLC · f20575dd
  Jean-Marc Valin authored Jan 31, 2024
  
  f20575dd
Jan 31, 2024
- Fix lossgen shared build · 53c2313c
  Jean-Marc Valin authored Jan 31, 2024
  
  53c2313c
- Avoid padding multi-frame DTX packets · 6c8acc21
  Jean-Marc Valin authored Jan 31, 2024
  
  6c8acc21
- Allow for DRED in DTX refresh packets · 648a9f24
  Jean-Marc Valin authored Jan 31, 2024
  
  648a9f24
- Handle the offset from the DRED frame id · 43508197
  Jean-Marc Valin authored Jan 31, 2024
  
  43508197
- Fix frame separator parsing · f4ee2925
  Jean-Marc Valin authored Jan 31, 2024
  
  f4ee2925
- Fix c90 build · 0fed741a
  Jean-Marc Valin authored Jan 30, 2024
  
  0fed741a
Jan 25, 2024
- Cleanup previous commits · 468a693d
  Jean-Marc Valin authored Jan 21, 2024
  
  Rename, reindent, change arg order
  468a693d
- divide max payload too · b778271d
  Jean-Marc Valin authored Dec 16, 2023 and Jean-Marc Valin committed Jan 25, 2024
  
  b778271d
- First shot at multi-frame CBR with DRED · 073bec91
  Jean-Marc Valin authored Dec 16, 2023 and Jean-Marc Valin committed Jan 25, 2024
  
  073bec91
- More activity handling to opus_encode_native_process() · fe86db66
  Jean-Marc Valin authored Dec 17, 2023 and Jean-Marc Valin committed Jan 25, 2024
  
  fe86db66
- Handle rangeFinal, delay_compensation · 452abeea
  Jean-Marc Valin authored Dec 17, 2023 and Jean-Marc Valin committed Jan 25, 2024
  
  452abeea
- Refactor multi-frame encoding to be non-recursive · fd88e223
  Jean-Marc Valin authored Dec 16, 2023 and Jean-Marc Valin committed Jan 25, 2024
  
  fd88e223
- Splitting opus_encode_native() · f44069f5
  Jean-Marc Valin authored Dec 15, 2023 and Jean-Marc Valin committed Jan 25, 2024
  
  f44069f5
- Fix Hybrid CBR with DRED and CELT->SILK redundancy · 231caa37
  Jean-Marc Valin authored Dec 19, 2023 and Jean-Marc Valin committed Jan 25, 2024
  
  Need to move the redundant frame even in CBR because the hybrid frame now gets encoded as VBR, with DRED picking up the rest. Fixes an issue introduced in 4600e775.
  231caa37
- Fix desync for CBR DRED · b63e22cf
  Jean-Marc Valin authored Dec 19, 2023 and Jean-Marc Valin committed Jan 25, 2024
  
  The encoder wouldn't reserve enough bits for CELT, causing it to not have enough bits to code the switching redundancy flag when it should have.
  b63e22cf
- More DRED tuning · 7b73c9bc
  Jean-Marc Valin authored Jan 23, 2024
  
  7b73c9bc
- Initial DRED tuning · 19dd96b3
  Jean-Marc Valin authored Jan 22, 2024
  
  Adjust q0, qD and duration based on bitrate and loss.
  19dd96b3
Jan 23, 2024
- fixes in osce python code · 7df2c67b
  Jan Buethe authored Jan 23, 2024
  
  7df2c67b
Jan 22, 2024
- switched to smaller NoLACE model · 3499d0aa
  Jan Buethe authored Jan 22, 2024
  
  3499d0aa
- bugfix in SilkFeatureNetPL · ec04a94e
  Jan Buethe authored Jan 22, 2024
  
  ec04a94e
- OSCE_MAX_RNN_UNITS now derived from osce model parameters · 5f8201c7
  Jan Buethe authored Jan 22, 2024
  
  5f8201c7
Jan 21, 2024
- Remove run-time code for old TF2 models · 6a9831a6
  Jean-Marc Valin authored Jan 18, 2024
  
  No longer needed now that PLC is trained with PyTorch stack
  6a9831a6
- Using PyTorch model (same architecture for now) · 1ddfcfd4
  Jean-Marc Valin authored Jan 17, 2024
  
  1ddfcfd4
- Improving PLC · e6992636
  Jean-Marc Valin authored Jan 15, 2024
  
  Should handle the history in a more consistent way. Slightly increase the model size and re-enable biased band loss in training.
  e6992636
Jan 20, 2024
- Updated LACE and NoLACE models to version 2 · 299e38ca
  Jan Buethe authored Dec 18, 2023
  
  299e38ca
Jan 17, 2024
- PLC export script · 4f311a1a
  Jean-Marc Valin authored Jan 17, 2024
  
  mostly untested
  4f311a1a
Jan 15, 2024
- PyTorch code for training the PLC model · 26ddfd71
  Jean-Marc Valin authored Jan 15, 2024
  
  Should match the TF2 code, but mostly untested
  26ddfd71
Dec 23, 2023
- Prevent overshoots from CELT PLC with prediction · 6ad03ae0
  Jean-Marc Valin authored Dec 22, 2023
  
  Constrains the energy prediction to something safe.
  6ad03ae0
Dec 22, 2023
- Add simulated loss to opus_demo · bd2e9a34
  Jean-Marc Valin authored Dec 21, 2023
  
  bd2e9a34