- Jul 10, 2021
-
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
Avoids having to compute a sigmoid
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
It no longer overwrites its input vector
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
The 4* is now stored in the table to avoid computing it in the loop
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
Saves on the MDense/softmax computation since we only need to compute 8 values instead of 256.
-
- Jun 30, 2021
-
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
- Jun 29, 2021
-
-
Jean-Marc Valin authored
Using rational function approximation for tanh() and sigmoid.
-
- Jun 26, 2021
-
-
Jean-Marc Valin authored
-
- Jun 25, 2021
-
-
Jean-Marc Valin authored
CuDNNGRU and GRU don't use the same weight format
-
- Jun 24, 2021
-
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
- Jun 21, 2021
-
-
Jean-Marc Valin authored
This isn't necessary since valid exponents can't flip the sign bit
-
- Jun 18, 2021
-
-
Jean-Marc Valin authored
When implementing using SSSE3 or AVX2, our dot products can saturate if two adjacent weights sum to more than 127.
-
Jean-Marc Valin authored
Not sure why CuDNNGRU doesn't get used by default, but we need to explicitly use it to get things to run fast.
-
- Feb 01, 2021
-
-
Jean-Marc Valin authored
-
- Jan 18, 2021
-
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
was increased too much in 713d53e8a
-
- Jan 16, 2021
-
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-
Jean-Marc Valin authored
-