Commit graph

82 commits

Author SHA1 Message Date
Jean-Marc Valin
7f7b2a1c66
Smaller version of fargan
800k parameters, 600 MFLOPS, with a receptive field of 3 feature vectors
2023-11-16 02:06:14 -05:00
Jean-Marc Valin
b0620c0bf9
Using sparse GRUs in DRED decoder
Saves ~270 kB of weights in the decoder
2023-11-15 04:08:50 -05:00
Jean-Marc Valin
77594bf158
Dumping RDOVAE stats from XML 2023-11-08 17:32:43 -05:00
Jean-Marc Valin
222662dac8
DRED: quantize scale and dead zone to 8 bits 2023-11-07 18:10:50 -05:00
Jan Buethe
4e104555e9
added weight export script for LACE/NoLACE 2023-11-07 15:12:12 +01:00
Jan Buethe
8af5c6b4a1
added transposed 1d convolutions to wexchange 2023-11-07 11:54:22 +01:00
Jean-Marc Valin
0ab0640d4a
Split stats in two and remove useless dimensions 2023-11-07 00:07:14 -05:00
Jan Buethe
2386a60ec6
updated moc to match results in ietf118 presentation 2023-11-06 17:50:48 +01:00
Jean-Marc Valin
544b3e576c
DRED: quantize r and p0 parameters with 8 bits
Only code non-degenerate symbols, which makes the encoder faster
2023-11-06 03:16:43 -05:00
Jan Buethe
da60266f6e
updated moc method 2023-11-02 16:52:50 +01:00
Jean-Marc Valin
ddd5669e79
Pitch and fargan model updates
Removing one of the 2d conv layers for pitch estimation reduces
complexity without noticeable degradation. FARGAN model has more
adversarial training.
Also, no need for the double precision in the low-pass filter.
2023-10-28 23:33:47 -04:00
Jean-Marc Valin
4598fe5409
Quantizing pitchdnn and rdovae weights 2023-10-20 12:54:13 -04:00
Jan Buethe
290be25b98
added 16kHz version of opus_compare in python 2023-10-20 14:24:27 +02:00
Jan Buethe
1accd2472e
finalized quantization option in export_rdovae_weights.py 2023-10-20 14:14:31 +02:00
Jean-Marc Valin
d720955d61
Marking RDOVAE layers to quantize 2023-10-19 16:06:52 -04:00
Jan Buethe
60ac1c6c99
prepared quantization implementation for DRED 2023-10-19 21:54:39 +02:00
Jan Buethe
2192e85b91
restructured osce readme 2023-10-19 21:45:45 +02:00
Jan Buethe
055c683018
added LACE/NoLACE checkpoint URL 2023-10-19 21:34:13 +02:00
Jean-Marc Valin
35cb8d7f66
C implementation of FARGAN 2023-10-14 16:42:33 -04:00
Jean-Marc Valin
9e76a7bfb8
update fargan to match version 45 2023-10-10 00:51:57 -04:00
Jan Buethe
0563d71b25
updated osce readme 2023-10-07 18:52:38 +02:00
Jean-Marc Valin
f0ec990dba
Switching to neural pitch estimator
Remove old pitch estimator and retrain all models
2023-10-06 03:14:56 -04:00
Jean-Marc Valin
27663d3641
Using a DenseNet for DRED 2023-10-02 01:43:44 -04:00
Jean-Marc Valin
8e8edf71bd
Remove unneeded (I think) tanh at the end 2023-10-01 21:34:58 -04:00
Jean-Marc Valin
33adba02c7
First version of pitch DNN C code
Totally untested -- most likely doesn't work
2023-10-01 03:59:17 -04:00
Jean-Marc Valin
f3b86f9414
Fix model saving 2023-09-30 02:48:26 -04:00
Jan Buethe
0459a572f5
updated PitchDNN export script 2023-09-29 15:34:59 +02:00
Jan Buethe
ce28695844
refactoring and cleanup 2023-09-29 15:31:45 +02:00
Jan Buethe
4901445490
fixed type in error message 2023-09-29 14:34:11 +02:00
Jan Buethe
c5c214df1b
added rudimentary support for dumping nn.Conv2d layers 2023-09-29 14:25:26 +02:00
Jean-Marc Valin
9a7bb764d4
No features skip needed to align pitch features 2023-09-27 13:01:21 -04:00
Jean-Marc Valin
a6b4fe375a
Script to compute the groundtruth data using CREPE 2023-09-27 13:00:12 -04:00
Jan Buethe
ae0a140c40
updated default values in adv_train_fargan.py 2023-09-26 21:42:01 +02:00
Jan Buethe
41a4c9515d
changed checkpoint format 2023-09-26 12:12:47 -04:00
Jean-Marc Valin
733a095ba2
Adapting to new data format/model 2023-09-26 12:12:47 -04:00
Krishna Subramani
f38b4a317f
Python code for neural pitch 2023-09-26 12:12:47 -04:00
Jean-Marc Valin
c4b83ae62d
RDOVAE model update 2023-09-26 12:11:25 -04:00
Jan Buethe
00580a63aa
bugfix 2023-09-22 11:39:22 +02:00
Jan Buethe
aad74fdfd6
pulse-related bugfix 2023-09-22 11:18:41 +02:00
Jan Buethe
0a92bc5eaa
more lavoce stuff 2023-09-21 15:01:11 +02:00
Jean-Marc Valin
b88644b9c7
Quantizing initial state with rdovae too
More efficient than PVQ
2023-09-20 18:04:08 -04:00
Jan Buethe
2ec31cc5cc
added FARGAN adversarial training script 2023-09-20 19:06:10 +02:00
Jan Buethe
82f48d368b
removed trailing whitespace in fargan
Signed-off-by: Jan Buethe <jbuethe@amazon.de>
2023-09-13 16:57:28 +02:00
Jan Buethe
e7beaec3fb
integrated JM's FFT ada conv
Signed-off-by: Jan Buethe <jbuethe@amazon.de>
2023-09-13 16:31:29 +02:00
Jean-Marc Valin
b24c7b433a
Remove --has-gain and --passthrough-size args
Couldn't be used anymore
2023-09-12 22:58:34 -04:00
Jean-Marc Valin
2f8b36d691
Add conditioning interpolation, fwconv layer 2023-09-12 22:50:48 -04:00
Jean-Marc Valin
72c5ea4129
Only use one frame of pre-loading 2023-09-12 22:50:48 -04:00
Jean-Marc Valin
108b75c4b1
Randomly double the training sequence length
Helps with stability with little cost in training speed
2023-09-12 22:50:48 -04:00
Jean-Marc Valin
d54b9fb49a
Adds skip connections 2023-09-12 22:50:47 -04:00
Jean-Marc Valin
fb570ed8bb
5-tap pitch predictor 2023-09-12 22:50:47 -04:00