Commit graph

6 commits

Author SHA1 Message Date
Jean-Marc Valin
d61f7e00f8 Fix missing transpose in the sparity code
CuDNNGRU and GRU don't use the same weight format
2021-06-25 13:43:37 -04:00
Jean-Marc Valin
c7ba313a67 Adding extra constraint to avoid saturation for SSE/AVX2
When implementing using SSSE3 or AVX2, our dot products can saturate
if two adjacent weights sum to more than 127.
2021-06-18 17:39:35 -04:00
Jean-Marc Valin
237245f815 Support for multi-GPU training
Not sure why CuDNNGRU doesn't get used by default, but we need
to explicitly use it to get things to run fast.
2021-06-18 13:20:43 -04:00
Jean-Marc Valin
1657bae024 WIP: Adding a constraint 2021-01-16 02:11:19 -05:00
Jean-Marc Valin
cc28518699 wip 8x4 sparseness 2021-01-16 02:11:19 -05:00
Jean-Marc Valin
90fec91b12 Convert training code to Tensorflow 2 2020-08-19 14:27:07 -04:00