Commits · 0889e2ac15675618d48e3c3fdd4a1bbe46d4d067 · Alexander Traud / Opus

Feb 03, 2011
- Getting the right DoFs for dual stereo · 0889e2ac
  Jean-Marc Valin authored 14 years ago
  
  0889e2ac
Feb 02, 2011

Removing ancient allocation matrix · 9cc56bf0
Jean-Marc Valin authored 14 years ago

9cc56bf0

Increase caps/allocation accuracy. · ce6d0904

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

This stores the caps array in 32nd bits/sample instead of 1/2 bits
 scaled by LM and the channel count, which is slightly less
 less accurate for the last two bands, and much more accurate for
 all the other bands.
A constant offset is subtracted to allow it to represent values
 larger than 255 in 8 bits (the range of unoffset values is
 77...304).
In addition, this replaces the last modeline in the allocation table
 with the caps array, allowing the initial interpolation to
 allocate 8 bits/sample or more, which was otherwise impossible.

ce6d0904

Only checking for a mismatch when RESYNTH is defined · 424eb742
Jean-Marc Valin authored 14 years ago

424eb742

Feb 01, 2011

Limit mode creation to supported modes. · aa6fec66

Timothy B. Terriberry authored 14 years ago

We did no real error checking to see if a mode is supported when it
 is created.
This patch implements checks for Jean-Marc's rules:
1) A mode must have frames at least 1ms in length (no more than
    1000 per second).
2) A mode must have shorts of at most 3.33 ms (at least 300 per
    second).
It also adds error checking to dump_modes so we report the error
 instead of crashing when we fail to create a mode.

aa6fec66

Fixing the global stack -- and an overflow in collapse_mask · 7e983194
Jean-Marc Valin authored 14 years ago

7e983194

Add assertions for band size restrictions. · 2799c297

Timothy B. Terriberry authored 14 years ago

The way folding is implemented requires two restrictions:
1. The last band must be the largest (so we can use its size to
 allocate a temporary buffer to handle interleaving/TF changes).
2. No band can be larger than twice the size of the previous band
 (so that once we have enough data to start folding, we will always
 have enough data to fold).

Mode creation makes a heuristic attempt to satisfy these
 conditions, but nothing actually guarantees it.
This adds some asserts to check them during mode creation.
They current pass for all supported custom modes.

2799c297

Don't allow empty eBands. · cb8f366a

Timothy B. Terriberry authored 14 years ago

Currently compute_ebands()'s attempts to round bands to even sizes
and enforce size constraints on consecutive bands can leave some
bands entirely empty (e.g., Fs=8000, frame_size=64, i=11).
This adds a simple post-processing loop to remove such bands.

cb8f366a

Adds a generic CELT_SET_BITRATE() ctl() API for CBR and VBR · 7bb26e13
Jean-Marc Valin authored 14 years ago

7bb26e13
Tuning the split threshold · 263e2719
Jean-Marc Valin authored 14 years ago

263e2719

Add a seprate qtheta offset for two-phase stereo. · 411a84fa

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

9b34bd83 caused serious regressions for 240-sample frame stereo,
 because the previous qb limit was _always_ hit for two-phase
 stereo.
Two-phase stereo really does operate with a different model (for
 example, the single bit allocated to the side should really
 probably be thought of as a sign bit for qtheta, but we don't
 count it as part of qtheta's allocation).
The old code was equivalent to a separate two-phase offset of 12,
 however Greg Maxwell's testing demonstrates that 16 performs
 best.

411a84fa

Adjust the splitting threshold. · 4499263b

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

Previously, we would only split a band if it was allocated more than
 32 bits.
However, the N=4 codebook can only produce about 22.5 bits, and two
 N=2 bands combined can only produce 26 bits, including 8 bits for
 qtheta, so if we wait until we allocate 32, we're guaranteed to fall
 short.
Several of the larger bands come pretty far from filling 32 bits as
 well, though their split versions will.

Greg Maxwell also suggested adding an offset to the threshold to
 account for the inefficiency of using qtheta compared to another
 VQ dimension.
This patch uses 1 bit as a placeholder, as it's a clear
 improvement, but we may adjust this later after collecting data on
 more possibilities over more files.

4499263b

Jan 31, 2011

Including static_mode* files in the distribution · 5cf41c9d
Jean-Marc Valin authored 14 years ago

5cf41c9d
Stop collapsing the background noise channels when switching to mono · a350bf52
Jean-Marc Valin authored 14 years ago

a350bf52

Don't destroy stereo history when switching to mono. · 682b6cf1

Timothy B. Terriberry authored 14 years ago

The first version of the mono decoder with stereo output collapsed
 the historic energy values stored for anti-collapse down to one
 channel (by taking the max).
This means that a subsequent switch back would continue on using
 the the maximum of the two values instead of the original history,
 which would make anti-collapse produce louder noise (and
 potentially more pre-echo than otherwise).

This patch moves the max into the anti_collapse function itself,
 and does not store the values back into the source array, so the
 full stereo history is maintained if subsequent frames switch
 back.
It also fixes an encoder mismatch, which never took the max
 (assuming, apparently, that the output channel count would never
 change).

682b6cf1

Propagate balance from compute_allocation() to quant_all_bands(). · 948d27c9

Timothy B. Terriberry authored 14 years ago

Instead of just dumping excess bits into the first band after
 allocation, use them to initialize the rebalancing loop in
 quant_all_bands().
This allows these bits to be redistributed over several bands, like
 normal.

948d27c9

Fix sample type conversion when resampling · 713d7a4c
Jean-Marc Valin authored 14 years ago

713d7a4c
No longer extracting the frame size from the mode to build the header · b35807d7
Jean-Marc Valin authored 14 years ago

b35807d7
Making the stereo encoder capable of encoding in mono · 00a98f5d
Jean-Marc Valin authored 14 years ago

00a98f5d
Making it possible for the stereo decoder to decode a mono stream · f1916a14
Jean-Marc Valin authored 14 years ago

f1916a14

Apply band caps to the band allocation table. · 89039a3f

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

The average caps over all values of LM and C are well below the
 target allocations of the last two modelines.
Lower them to the caps, to prevent hitting them quite so early.
This helps quality at medium-high rates, in the 180-192 kbps range.

89039a3f

More band caps updates. · b5d123a5

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

Use measured cross-entropy to estimate the real cost of coding
 qtheta given the allocated qb parameter, instead of the entropy of
 the PDF.
This is generally much lower, and reduces waste at high rates.
This patch also removes some intermediate rounding from this
 computation.

b5d123a5

Add generic fine-energy rebalancing. · 13bffd28
Timothy B. Terriberry authored 14 years ago and Jean-Marc Valin committed 14 years ago
```
This extends the previous rebalancing for fine energy in N=1 bands
 to also allocate extra fine bits for bands that go over their cap.
```
13bffd28
Custom and non-custom versions of the get_size() functions · 8cf29f09
Jean-Marc Valin authored 14 years ago

8cf29f09
Making sure that itheta=0 or 16384 really cuts allocation to one band · aaca4a71
Jean-Marc Valin authored 14 years ago

aaca4a71

Jan 30, 2011

Merge branch 'exp_api_change' · 665da0ba
Jean-Marc Valin authored 14 years ago

665da0ba

Use a smarter per-band bitrate cap. · c5643074

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

The previous "dumb cap" of (64<<LM)*(C<<BITRES) was not actually
 achievable by many (most) bands, and did not take the cost of
 coding theta for splits into account, and so was too small for some
 bands.
This patch adds code to compute a fairly accurate estimate of the
 real maximum per-band rate (an estimate only because of rounding
 effects and the fact that the bit usage for theta is variable),
 which is then truncated and stored in an 8-bit table in the mode.

This gives improved quality at all rates over 160 kbps/channel,
 prevents bits from being wasted all the way up to 255 kbps/channel
 (the maximum rate allowed, and approximately the maximum number of
 bits that can usefully be used regardless of the allocation), and
 prevents dynalloc and trim from producing enormous waste
 (eliminating the need for encoder logic to prevent this).

c5643074

Error handling in _create() functions · d6c3d3ce
Jean-Marc Valin authored 14 years ago

d6c3d3ce

Jan 29, 2011
- Adding resampling support · 913a1742
  Jean-Marc Valin authored 14 years ago
  
  We use the MDCT as low-pass filter.
  913a1742
- Change qb cap to prevent side-fold collapses. · 9b34bd83
  Timothy B. Terriberry authored 14 years ago and Jean-Marc Valin committed 14 years ago
  
  Previously, in a stereo split with itheta==16384, but without enough bits left over to actually code a pulse, the target band would completely collapse, because the mid gain would be zero and we don't fold the side. This changes the limit to ensure that we never set qn>1 unless we know we'll have enough bits for at least one pulse. This should eliminate the last possible whole-band collapse.
  9b34bd83
- celt_encoder_create() now defaults to Opus standard mode · c97b258c
  Jean-Marc Valin authored 14 years ago
  
  The old constructor is renamed celt_encoder_create_custom(). Same for the decoder.
  c97b258c
- Enabling the standard static mode by default · 5ad35bf3
  Jean-Marc Valin authored 14 years ago
  
  5ad35bf3
- Adding the auto-generated static modes for float and fixed · d9e4b1d7
  Jean-Marc Valin authored 14 years ago
  
  d9e4b1d7
- Using the actual degrees of freedom rather than N*C for fine offset · 17cab431
  Jean-Marc Valin authored 14 years ago
  
  17cab431
Jan 28, 2011
- Prevent VBR from shooting up to the maximum rate if set to very low target... · 420c3258
  Gregory Maxwell authored 14 years ago and Jean-Marc Valin committed 14 years ago
  
  Prevent VBR from shooting up to the maximum rate if set to very low target rates, and prevent the encoder VBR from producing 1 byte frames (which are no longer allowed).
  420c3258
- Don't rebalance bits for itheta=0 or 16384 · 09213de9
  Jean-Marc Valin authored 14 years ago
  
  09213de9
Jan 27, 2011
- Making rebalance a celt_int32 · a9285720
  Jean-Marc Valin authored 14 years ago
  
  a9285720
- Making anti-collapse a bit more conservative again · 47e905dc
  Jean-Marc Valin authored 14 years ago
  
  The energy memory can be lowered (not increased) during a transient
  47e905dc
- Changing some double constants to float · b417d839
  Jean-Marc Valin authored 14 years ago
  
  b417d839
- Adjusting post-filter coefficients to be exact in 13 bit precision. · 61f40418
  Jean-Marc Valin authored 14 years ago
  
  That way they can be exact in 16 bits once multiplied by the gain
  61f40418