opus/celt
Timothy B. Terriberry 972a34ec2c Add ARMv4/ARMv5E macros.
Original patch by Aurélien Zanelli <aurelien.zanelli@parrot.com>:
 http://lists.xiph.org/pipermail/opus/2013-May/002078.html

Revised version:
- Add autconf detection (ported from libtheora).
- Rename ARM5E to ARMv5E (an ARM5 is not the same thing as ARMv5!).
- Use actual macros so they can still be selectively overridden.
- Split out ARMv4 parts and add a few more ARMv4 macros.
- Label blocks to make them easy to find in generated assembly.
- Fix MULT16_32_Q15() so we can pass make check.
  The MDCT test passes in values larger than 2**30 for b.
  The new version should be just as fast (or faster, since it's
   easier to merge the shift with following instructions), and
   there's no appreciable impact on accuracy (FFT/MDCT SNR actually
   goes up in most cases).
- Fix register constraints.
  We were using early-clobber flags in a bunch of places that
   didn't need them, and commutative-pair flags in a bunch of
   places that weren't actually commutative.
  This was Jean-Marc's fault (the original code came from Speex).
- Simplify silk_CLZ16().
- Port over iFFT C_MULC asm by Andree Buschmann
   <AndreeBuschmann@t-online.de> from Rockbox.
- Speed up the C_MULC asm by using LDRD, allowing more flexible
   addressing, re-ordering instructions to avoid some stalls,
   allowing more flexible register allocation, and getting things
   out of the inline asm block so the compiler can schedule them
   better.
- Add C_MUL and C_MUL4 asm for the FFT to the encoder based, on the
   new C_MULC.

In total, this patch gives a 22.3% speed-up on test_opus_encoder on
 a 600 MHz Cortex A8 using gcc 4.2.1,
When restricted to ARMv4 optimizations, it gives a 9.6% speed-up
 on the same processor/compiler.
On the conformance test vectors:
 Average mono quality is 97.0583 %
 Average stereo quality is 97.775 %
2013-05-19 19:12:51 -07:00
..
dump_modes s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
tests Fixes forward MDCT for overlaps that aren't a multiple of 4 2012-12-03 15:40:01 -05:00
_kiss_fft_guts.h Add ARMv4/ARMv5E macros. 2013-05-19 19:12:51 -07:00
arch.h Add ARMv4/ARMv5E macros. 2013-05-19 19:12:51 -07:00
bands.c Fixes a seed issue introduced in 5367dac 2013-01-03 21:15:16 -05:00
bands.h Documentation fixes 2013-03-07 18:17:52 -05:00
celt.c Fully automate version updating 2013-05-11 02:05:24 +09:30
celt.h Merge branch 'surround' 2013-05-17 14:31:36 -04:00
celt.vcxproj Change Visual Studio files to use PACKAGE_VERSION. 2013-05-10 11:27:13 -07:00
celt.vcxproj.filters Add the new source files to the MSVC projects. 2012-11-09 11:58:26 -08:00
celt_decoder.c Fixes fixed-point PLC issue reported in trac ticket #1954 2013-05-18 02:18:42 -04:00
celt_encoder.c celt_maxabs16() now returns an opus_val32 to avoid problems with -32768 2013-05-18 23:52:18 -04:00
celt_lpc.c s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
celt_lpc.h s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
cwrs.c Fix an indentation issue. 2012-10-17 11:12:55 -07:00
cwrs.h s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
ecintrin.h Guard _BitScanReverse on MSVC so that MSVC 6 doesn't break. 2012-11-27 13:07:45 -05:00
entcode.c Guard _BitScanReverse on MSVC so that MSVC 6 doesn't break. 2012-11-27 13:07:45 -05:00
entcode.h s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
entdec.c s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
entdec.h s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
entenc.c s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
entenc.h s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
fixed_armv4.h Add ARMv4/ARMv5E macros. 2013-05-19 19:12:51 -07:00
fixed_armv5e.h Add ARMv4/ARMv5E macros. 2013-05-19 19:12:51 -07:00
fixed_c5x.h celt_maxabs16() now returns an opus_val32 to avoid problems with -32768 2013-05-18 23:52:18 -04:00
fixed_c6x.h celt_maxabs16() now returns an opus_val32 to avoid problems with -32768 2013-05-18 23:52:18 -04:00
fixed_debug.h Replace long long in celt/ with opus_int64. 2012-08-21 10:45:58 -04:00
fixed_generic.h Add OPUS_{GET|SET}_GAIN CTLs for adjusting output gain. 2012-07-11 00:04:24 -04:00
float_cast.h Guard _MSC_VER tests, remove FLOAT2INT16 when DISABLE_FLOAT_API. 2012-07-26 14:26:43 -04:00
kiss_fft.c Convert some double constants to float. 2012-07-17 17:42:00 -04:00
kiss_fft.h Misc changes to address Robert Sparks' comments 2012-04-20 10:26:08 -04:00
laplace.c Merge commit '390c89225d' 2012-04-24 13:39:22 -04:00
laplace.h s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
mathops.c Fixes an overflow in the fixed-point celt_sqrt() for large values. 2012-11-29 16:51:50 -05:00
mathops.h celt_maxabs16() now returns an opus_val32 to avoid problems with -32768 2013-05-18 23:52:18 -04:00
mdct.c Do imdct post-rotate and deshuffle in-place. 2012-12-03 16:13:51 -05:00
mdct.h Replace the remaining instances of restrict with OPUS_RESTRICT. 2012-07-20 18:02:55 -04:00
mfrngcod.h s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
modes.c s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
modes.h s/FOUNDATION/COPYRIGHT OWNER/ in CELT code and "glue code" 2012-04-20 16:41:42 -04:00
opus_custom_demo.c Merge commit '390c89225d' 2012-04-24 13:39:22 -04:00
os_support.h renames the libcelt/ directory to celt/ 2011-09-13 18:21:18 -07:00
pitch.c celt_maxabs16() now returns an opus_val32 to avoid problems with -32768 2013-05-18 23:52:18 -04:00
pitch.h Replace C99 restrict keyword with OPUS_RESTRICT. 2012-07-18 12:12:35 -04:00
quant_bands.c Surround: Better LFE handling 2013-05-05 02:31:38 -04:00
quant_bands.h Surround: Better LFE handling 2013-05-05 02:31:38 -04:00
rate.c skip bands with no energy 2012-12-21 11:52:03 -05:00
rate.h skip bands with no energy 2012-12-21 11:52:03 -05:00
stack_alloc.h Adds explicit valgrind checks when ENABLE_VALGRIND is defined 2012-12-05 00:50:53 -05:00
static_modes_fixed.h Convert tabs to spaces in the opus and celt code. 2011-10-26 20:24:49 -07:00
static_modes_float.h Convert tabs to spaces in the opus and celt code. 2011-10-26 20:24:49 -07:00
vq.c Replace C99 restrict keyword with OPUS_RESTRICT. 2012-07-18 12:12:35 -04:00
vq.h Documentation fixes 2013-03-07 18:17:52 -05:00