Commits · 986e2695373f94d9572dbe08bb7cb87f01d9217b · Alexander Traud / Opus

Jan 21, 2011
- Prevents taking the log of zero in fixed-point · 986e2695
  Jean-Marc Valin authored 14 years ago
  
  986e2695
- Using previous range coder state for PRNG · 63fb61f1
  Jean-Marc Valin authored 14 years ago
  
  This provides more entropy and allows some more flexibility on the encoder side.
  63fb61f1
Jan 20, 2011

Fixes an irrelevant uninitialized bug · e8a373fd
Jean-Marc Valin authored 14 years ago

e8a373fd
Remove useless ec_dec_tell() call. · a363e395
Timothy B. Terriberry authored 14 years ago and Jean-Marc Valin committed 14 years ago

a363e395

Make collapse-detection bitexact. · 21af73eb

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

Jean-Marc's original anti-collapse patch used a threshold on the
 content of a decoded band to determine whether or not it should
 be filled with random noise.
Since this is highly sensitive to the accuracy of the
 implementation, it could lead to significant decoder output
 differences even if decoding error up to that point was relatively
 small.

This patch detects collapsed bands from the output of the vector
 quantizer, using exact integer arithmetic.
It makes two simplifying assumptions:
 a) If either input to haar1() is non-zero during TF resolution
     adjustments, then the output will be non-zero.
 b) If the content of a block is non-zero in any of the bands that
     are used for folding, then the folded output will be non-zero.
b) in particular is likely to be false when SPREAD_NONE is used.
It also ignores the case where mid and side are orthogonal in
 stereo_merge, but this is relatively unlikely.
This misses just over 3% of the cases that Jean-Marc's anti-collapse
 detection strategy would catch, but does not mis-classify any (all
 detected collapses are true collapses).

This patch overloads the "fill" parameter to mark which blocks have
 non-zero content for folding.
As a consequence, if a set of blocks on one side of a split has
 collapsed, _no_ folding is done: the result would be zero anyway,
 except for short blocks with SPREAD_AGGRESSIVE that are split down
 to a single block, but a) that means a lot of bits were available
 so a collapse is unlikely and b) anti-collapse can fill the block
 anyway, if it's used.
This also means that if itheta==0 or itheta==16384, we no longer
 fold at all on that side (even with long blocks), since we'd be
 multiplying the result by zero anyway.

21af73eb

Jan 18, 2011

Adds an anti-collapse mechanism for transients · 87efe1df

Jean-Marc Valin authored 14 years ago

This looks for bands in each short block that have no energy. For
each of these "collapsed" bands, noise is injected to have an
energy equal to the minimum of the two previous frames for that band.
The mechanism can be used whenever there are 4 or more MDCTs (otherwise
no complete collapse is possible) and is signalled with one bit just
before the final fine energy bits.

87efe1df

Moving the tapset signalling to the beginning of the stream · 2ce5c63d
Jean-Marc Valin authored 14 years ago

2ce5c63d

Jan 17, 2011

Adding tapset decision logic · 8d367029

Jean-Marc Valin authored 14 years ago

Based on spreading_decision()'s logic. We choose tapsets
with less roll-off when we think the HF are tonal.

8d367029

Support for multiple postfilter tapsets · dfa847a2

Jean-Marc Valin authored 14 years ago

Supporting three different tapsets with different roll-offs. The default
is now a 5-tap post-filter with a 13 kHz cutoff frequency.

dfa847a2

Jan 13, 2011

In CVBR mode the rate selection was failing to add bytes which were about to... · d85018cb
Gregory Maxwell authored 14 years ago and Jean-Marc Valin committed 14 years ago
```
In CVBR mode the rate selection was failing to add bytes which were about to fall off the end of the bitres and never be reusable, causing undershoot.
```
d85018cb

Setting oldBandE to zero outside of [start,end[ · 5677e34f

Jean-Marc Valin authored 14 years ago

In case start or end changes, we want the encoder and decoder
to be in sync and not do anything stupid.

5677e34f

Minor code cleanup, nothing to see here · f0d828fc
Jean-Marc Valin authored 14 years ago

f0d828fc
Proper scaling for the mid folding · a387ebfc
Jean-Marc Valin authored 14 years ago
```
Nor scaling the mid only after we've been able to store it
for folding.
```
a387ebfc

Replace log2_frac in the delta offset calculation. · 173774bb

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

Adds a new bitexact_log2tan() function which is much simpler, and
 more accurate.
The new approximation has an RMS error of 0.0038 bits from the
 correctly rounded result over the range of inputs we use, compared
 to an RMS error of 0.013 for the old log2_frac() method.
The actual computation of delta is also changed to use FRAC_MUL16,
 since this allows us to keep the full accuracy of the new method
 while avoiding 16-bit overflow.
The old delta computation actually could overflow 16 bits: it needed
 8 for the log2_frac() result, 1 for the sign of the difference, and
 8 more for N.

173774bb

Jan 12, 2011
- Allowing the tf recombining to go all the way to LM=3 · 2b13401f
  Jean-Marc Valin authored 14 years ago
  
  2b13401f
- Fixes constrained VBR · 6b565268
  Jean-Marc Valin authored 14 years ago
  
  Also removes the 8 byte/packet lower bound
  6b565268
- Enforces bands of even size even for custom modes · 44203907
  Jean-Marc Valin authored 14 years ago
  
  44203907
Jan 11, 2011
- Using a step pdf for the stereo itheta encoding · 235c64b9
  Jean-Marc Valin authored 14 years ago
  
  235c64b9
- Minor fixes to testcases · c52d7689
  Jean-Marc Valin authored 14 years ago
  
  c52d7689
- Add --export-symbols-regex · da290c88
  David Schleef authored 14 years ago
  
  da290c88
- MSVC build fixes · b045a26a
  David Schleef authored 14 years ago
  
  b045a26a
- Use more standard test for lrintf/lrint · 2d333b4d
  David Schleef authored 14 years ago
  
  2d333b4d
- Fixes the recombining stride and the deinterleaving stride · ecefde3d
  Jean-Marc Valin authored 14 years ago
  
  Previously, recombining only worked when going all the way back to frequency domain.
  ecefde3d
- Using intensity_stereo() when itheta==0 · 8cfda4a3
  Jean-Marc Valin authored 14 years ago
  
  8cfda4a3
Jan 10, 2011
- Defines MAX_FINE_BITS to ensure that we're using the same value everywhere · a66b7574
  Jean-Marc Valin authored 14 years ago
  
  a66b7574
- Using tell() rather than log2_frac() to compute qalloc · 9d2d0510
  Jean-Marc Valin authored 14 years ago
  
  9d2d0510
- Changes the N=2 stereo case to use the same sign convention as N=1 · d9f6ec3f
  Jean-Marc Valin authored 14 years ago
  
  d9f6ec3f
- Max delta: +/- 16384 · c2095a29
  Jean-Marc Valin authored 14 years ago
  
  c2095a29
- 32-bit fixes · d9127edb
  Jean-Marc Valin authored 14 years ago
  
  d9127edb
- Make LCG 16-bit clean · 75f99bc5
  Jean-Marc Valin authored 14 years ago
  
  75f99bc5
- Properly skip padding in testcelt for stereo. · 08ef1f4c
  Timothy B. Terriberry authored 14 years ago and Jean-Marc Valin committed 14 years ago
  
  The right amount of data was being written for the first frame, but from the wrong offset in the buffer.
  08ef1f4c
Jan 09, 2011

Prevent busts at low bitrates. · 76469c64

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

This patch makes all symbols conditional on whether or not there's
 enough space left in the buffer to code them, and eliminates much
 of the redundancy in the side information.

A summary of the major changes:
* The isTransient flag is moved up to before the the coarse energy.
  If there are not enough bits to code the coarse energy, the flag
   would get forced to 0, meaning what energy values were coded
   would get interpreted incorrectly.
  This might not be the end of the world, and I'd be willing to
   move it back given a compelling argument.
* Coarse energy switches coding schemes when there are less than 15
   bits left in the packet:
  - With at least 2 bits remaining, the change in energy is forced
     to the range [-1...1] and coded with 1 bit (for 0) or 2 bits
     (for +/-1).
  - With only 1 bit remaining, the change in energy is forced to
     the range [-1...0] and coded with one bit.
  - If there is less than 1 bit remaining, the change in energy is
     forced to -1.
    This effectively low-passes bands whose energy is consistently
     starved; this might be undesirable, but letting the default be
     zero is unstable, which is worse.
* The tf_select flag gets moved back after the per-band tf_res
   flags again, and is now skipped entirely when none of the
   tf_res flags are set, and the default value is the same for
   either alternative.
* dynalloc boosting is now limited so that it stops once it's given
   a band all the remaining bits in the frame, or when it hits the
   "stupid cap" of (64<<LM)*(C<<BITRES) used during allocation.
* If dynalloc boosing has allocated all the remaining bits in the
   frame, the alloc trim parameter does not get encoded (it would
   have no effect).
* The intensity stereo offset is now limited to the range
   [start...codedBands], and thus doesn't get coded until after
   all of the skip decisions.
  Some space is reserved for it up front, and gradually given back
   as each band is skipped.
* The dual stereo flag is coded only if intensity>start, since
   otherwise it has no effect.
  It is now coded after the intensity flag.
* The space reserved for the final skip flag, the intensity stereo
   offset, and the dual stereo flag is now redistributed to all
   bands equally if it is unused.
  Before, the skip flag's bit was given to the band that stopped
   skipping without it (usually a dynalloc boosted band).

In order to enable simple interaction between VBR and these
 packet-size enforced limits, many of which are encountered before
 VBR is run, the maximum packet size VBR will allow is computed at
 the beginning of the encoding function, and the buffer reduced to
 that size immediately.
Later, when it is time to make the VBR decision, the minimum packet
 size is set high enough to ensure that no decision made thus far
 will have been affected by the packet size.
As long as this is smaller than the up-front maximum, all of the
 encoder's decisions will remain in-sync with the decoder.
If it is larger than the up-front maximum, the packet size is kept
 at that maximum, also ensuring sync.
The minimum used now is slightly larger than it used to be, because
 it also includes the bits added for dynalloc boosting.
Such boosting is shut off by the encoder at low rates, and so
 should not cause any serious issues at the rates where we would
 actually run out of room before compute_allocation().

76469c64

Fix Jean-Marc's sqrt(0.5) constants. · 051e044d
Timothy B. Terriberry authored 14 years ago and Jean-Marc Valin committed 14 years ago
```
There were two different ones in use, one with less precision than
 a float, and the other missing a digit in the middle.
```
051e044d
Tuning the split allocation for temporal masking · d0aa9f86
Jean-Marc Valin authored 14 years ago

d0aa9f86

Use B0 instead of B for decisions in quant_band(). · a714994b

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

B contains the number of blocks _after_ splitting.
We were using it to decide a) when to use a uniform PDF instead of a
 triangular one for theta and b) whether to bias the bit allocation
 towards the lower bins.
Using B0 (the number of blocks before the split) instead for a)
 gives a PEAQ gain of 0.003 ODG (as high as 0.1 ODG on s02a samples
 006, 083, and 097) for 240-sample frames at 96kbps mono.
Using B0 instead for b) gives a gain of only 0.00002.

a714994b

Jan 08, 2011

Fix rounding in bits2pulses search. · 1cb32aa0

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

The mid = (lo+hi)>>1 line in the binary search would allow hi to drop
 down to the same value as lo, meaning the rounding after the search
 would be choosing between the same two values.
This patch changes it to (lo+hi+1)>>1.
This will allow lo to increase up to the value hi, but only in the
 case that we can't possibly allocate enough pulses to meet the
 target number of bits (in which case the rounding doesn't matter).
To pay for the extra add, this moves the +1 in the comparison to bits
 to the other side, which can then be taken outside the loop.
The compiler can't normally do this because it might cause overflow
 which would change the results.

This rarely mattered, but gives a 0.01 PEAQ improvement on 12-byte
 120 sample frames.
It also makes the search process describable with a simple
 algorithm, rather than relying on this particular optimized
 implementation.
I.e., the binary search loop can now be replaced with
  for(lo=0;lo+1<cache[0]&&cache[lo+1]<bits;lo++);
  hi=lo+1;
 and it will give equivalent results.
This was not true before.

1cb32aa0

Changes to ec_dec_cdf() to support 8-bit tables. · 845dfa19

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

This renames ec_dec_cdf() to ec_dec_icdf(), and changes the
 functionality to use an "inverse" CDF table, where
 icdf[i]=ft-cdf[i+1].
The first entry is omitted entirely.
It also adds a corresonding ec_enc_icdf() to the encoder, which uses
 the same table.
One could use ec_encode_bin() by converting the values in the tables
 back to normal CDF values, but the icdf[] table already has them in
 the form ec_encode_bin() wants to use them, so there's no reason to
 translate them and then translate them back.

This is done primarily to allow SILK to use the range coder with
 8-bit probability tables containing cumulative frequencies that
 span the full range 0...256.
With an 8-bit table, the final 256 of a normal CDF becomes 0 in the
 "inverse" CDF.
It's the 0 at the start of a normal CDF which would become 256, but
 this is the value we omit, as it already has to be special-cased in
 the encoder, and is not used at all in the decoder.

845dfa19

Dec 30, 2010

Code intensity offset relative to start. · 79d76a2e

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

The band where intensity stereo begins was being coded as an
 absolute value, rather than relative to start, even though the
 range of values in the bitstream was limited as if it was being
 coded relative to start (meaning there would be desync if
 intensity was sufficiently large).

79d76a2e

Make the dynalloc boost run over [start,end). · d6f61571
Timothy B. Terriberry authored 14 years ago and Jean-Marc Valin committed 14 years ago
```
Previously it was coded for all bands, even when not all of them
 were being used.
```
d6f61571

Fix the limits for CELT_SET_END_BAND_REQUEST. · 8893e530

Timothy B. Terriberry authored 14 years ago and

Jean-Marc Valin committed 14 years ago

The valid bands range from [start,end) everywhere, with start<end.
Therefore end should never be 0, and should be allowed to extend
 all the way to mode->nbEBands.
This patch does _not_ enforce that start<end, and it does _not_
 handle clearing oldBandE[] when the valid range changes, which
 are separate issues.

8893e530