Jean-Marc Valin
d9fb8a6651
Optimizing divisions with a signed numerator
2014-01-30 11:16:24 -05:00
Jean-Marc Valin
5123175017
Speeding up ec_tell_frac()
2014-01-29 10:18:06 -05:00
Jean-Marc Valin
c543ec1bd3
Speeding up extract_collapse_mask() slightly
2014-01-23 00:02:12 -05:00
Jean-Marc Valin
2e6876c07a
Using SSAT in SIG2WORD16() on ARMv6
2014-01-22 19:22:05 -05:00
Jean-Marc Valin
9b3a3ad7ff
Unrolled version of the comb filter for ARM (eliminates MOVs)
2014-01-21 16:23:24 -05:00
Jean-Marc Valin
6218cba258
Making decode_pulses() compute the L2-norm on the fly
2014-01-21 15:17:01 -05:00
Jean-Marc Valin
17b197837f
Speed up the comb filter on ARM by using MAC16_32_Q16()
2014-01-21 11:07:57 -05:00
Jean-Marc Valin
29354ff6e0
Save more integer divisions on ARM when we know the operands are positive
2014-01-21 10:39:33 -05:00
Jean-Marc Valin
ce1173c77f
Fixes use of uninitialized values in dynalloc_analysis()
2014-01-20 18:46:33 -05:00
Jean-Marc Valin
ec5d01cbe4
Using a table on ARM for unsigned division by small (<=256) integers.
...
Saves 0.6% for 64 kb/s and 1.8% for 128 kb/s when decoding on arm7tdmi.
2014-01-20 16:32:16 -05:00
Jean-Marc Valin
379af35fd4
Minor cleanup in dynalloc_analysis()
2014-01-19 01:42:04 -05:00
Jean-Marc Valin
05548fa1c1
Adds a median filter to make dynalloc_analysis() more conservative
...
This should prevent extreme dynalloc behaviour in cases where some
bands are heavily attenuated.
2014-01-19 01:32:53 -05:00
Gregory Maxwell
a65db56e54
Fix declaration after statement in fixed point.
2014-01-08 11:48:38 -08:00
Jean-Marc Valin
e775090427
pseudostack instrumentation (off by default)
2014-01-07 21:32:41 -05:00
Jean-Marc Valin
9134e96cb2
Fixes SMALL_FOOTPRINT for float
2014-01-07 17:50:46 -05:00
Jean-Marc Valin
e17ca25617
Don't allocate pulses on the stack when calling the SILK PLC.
...
Also minor C89 fix for the previous commit
2014-01-07 15:27:02 -05:00
Jean-Marc Valin
b63e7110cb
Moves CELT PLC pitch search to a separate function to reduce peak stack
2014-01-07 15:02:43 -05:00
Jean-Marc Valin
5f807c176f
Adds SMALL_FOOTPRINT hack to the CELT PLC too
2014-01-07 04:48:42 -05:00
Jean-Marc Valin
9d1b6fef2a
Moves deemphasis() call out of celt_decode_lost() to reduce peak stack
2014-01-07 04:32:41 -05:00
Jean-Marc Valin
ad8371d172
Cleaning up leftovers of "freq" in celt_decode_with_ec()
2014-01-06 17:45:57 -05:00
Jean-Marc Valin
4d07b1357e
Reduces the decoder stack use by removing the pcm_silk buffer in fixed-point
...
We only keep when concealing less than 10ms with SILK.
2014-01-06 17:43:20 -05:00
Jean-Marc Valin
14ca4ed682
Moves the remains of compute_inv_mdcts() to celt_synthesis()
2014-01-06 09:31:09 -05:00
Jean-Marc Valin
32454dcadc
Hack that makes the SMALL_FOOTPRINT CELT decoder use only 4.25 kB of stack.
2014-01-06 09:11:52 -05:00
Jean-Marc Valin
bdc7b93358
Reduces decoder stack usage by only storing one channel of denormalized MDCT
2014-01-06 08:58:38 -05:00
Jean-Marc Valin
4a6744a446
Some cleaning up of the synthesis code.
2014-01-05 21:40:02 -05:00
Jean-Marc Valin
ed01a596dc
Making exp_rotation1() use MAC16_16(), which saves a few cycles on ARM
2014-01-04 21:06:24 -05:00
Jean-Marc Valin
ccec752a38
Silences unused parameter warning
2014-01-04 00:49:46 -05:00
Jean-Marc Valin
ef0eac497f
Moving the radix-2 to expose trivial twiddle factors
2014-01-03 23:55:52 -05:00
Jean-Marc Valin
c8f62a4aed
Improving the accuracy of the fixed-point radix-3 and radix-5
2013-12-31 00:00:37 -05:00
Jean-Marc Valin
e1f846208e
Minor cleanup -- nothing to see here
2013-12-29 18:45:49 -05:00
Jean-Marc Valin
05291fd6a6
Fixed-point: slight accuracy improvement in the comb filter
2013-12-29 18:31:17 -05:00
Jean-Marc Valin
30f52cbe2d
Remove a SAVE_STACK that was pasted accidentally in the previous commit
2013-12-29 16:21:06 -05:00
Jean-Marc Valin
e1dc1e2238
Unifying scaling of fixed-point and float FFT
2013-12-29 13:34:17 -05:00
Jean-Marc Valin
dbb96ab5cc
Fixes C89 issue
2013-12-29 00:09:06 -05:00
Jean-Marc Valin
4c1a90a847
Getting rid of some negations
...
Since we're doing two rotations, we can invert the sign on both.
Also adding a few comments for optimizing the FFT.
2013-12-28 23:14:26 -05:00
Jean-Marc Valin
cc344fb8ff
Slightly improving the accuracy of the fixed-point MDCT downscale
...
Also simplifying the code
2013-12-28 19:10:44 -05:00
Jean-Marc Valin
e0c00e27d8
Commit 99968ab
was causing us to allocate too much stack in the MDCT
2013-12-27 03:16:34 -05:00
Jean-Marc Valin
e43a0abe0a
Removes the separate 1/8N rotation in the (I)MDCT and unmerges the MDCT sizes
...
Undoes commits f7547a4e
and 72513f3c
2013-12-27 00:10:54 -05:00
Jean-Marc Valin
a5e3c8a6a6
Inverse MDCT no longer requires any scratch space
2013-12-23 02:26:03 -05:00
Jean-Marc Valin
e2bcb3fe9b
Reverse the ordering of the FFT stage to optimize a degenerate radix-4 case.
...
This also happens to increase the accuracy since it appears that the new
ordering is optimal (at least for 20 ms frames), whereas the previous ordering
was pessimal.
2013-12-22 02:17:24 -05:00
Jean-Marc Valin
c8f4e1608a
Merges the FFT scaling with the MDCT pre-rotate
2013-12-21 16:30:49 -05:00
Jean-Marc Valin
153def2884
Getting rid of the inverse FFT entirely
...
IMDCT now uses the forward FFT.
2013-12-21 15:45:17 -05:00
Jean-Marc Valin
99968abba8
Moving bitrev step to forward MDCT too
2013-12-21 14:29:41 -05:00
Jean-Marc Valin
bc13bbaad7
Applying the forward FFT gain up-front for fixed-point too
...
This makes us lose a bit of precision in the first steps, but our gain is more
precise because it's only rounded once. Overall, SNR is slightly improved.
2013-12-21 02:33:22 -05:00
Jean-Marc Valin
2e26b82ec2
Moves the bitrev step to the IMDCT pre-rotation
2013-12-20 23:13:29 -05:00
Jean-Marc Valin
306d7f5a30
fixed-point: slight (but free) accuracy improvement in compute_band_energies()
...
Also moves the VSHR32() condition outside the loop just in case some compilers
don't optimize that properly.
2013-12-16 01:08:21 -05:00
Jean-Marc Valin
e0f26246b0
fixed-point: adds rounding to some shifts to eliminate bias
...
This reduces the peak decoding error by removing small (inaudible) spikes in
the error at the frame boundaries. These were due to the frequency-domain bias
ending up as a small pulse in the middle of the IMDCT overlap. None of this
was ever audible, but fixing it is still cleaner.
2013-12-14 11:07:13 -05:00
Jean-Marc Valin
4a168eb343
Remove useless code in alloc_trim_analysis()
2013-12-11 01:34:06 -05:00
Jean-Marc Valin
5752d659fd
Minor fixed-point accuracy improvements that were completely free
2013-12-11 00:21:38 -05:00
Jean-Marc Valin
91f8010108
Removing indirections
2013-12-10 22:09:33 -05:00