Skip to content
GitLab
Explore
Sign in
Register
Primary navigation
Search or go to…
Project
Opus
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Container Registry
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Xiph.Org
Opus
Commits
153e43aa
Commit
153e43aa
authored
3 years ago
by
Jean-Marc Valin
Browse files
Options
Downloads
Patches
Plain Diff
update instructions
parent
7fdca7f0
No related branches found
No related tags found
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
dnn/README.md
+4
-5
4 additions, 5 deletions
dnn/README.md
with
4 additions
and
5 deletions
dnn/README.md
+
4
−
5
View file @
153e43aa
...
...
@@ -65,13 +65,13 @@ This codebase is also meant for research and it is possible to train new models.
1.
Generate training data:
```
./dump_data -train input.s16 features.f32 data.
u8
./dump_data -train input.s16 features.f32 data.
s16
```
where the first file contains 16 kHz 16-bit raw PCM audio (no header) and the other files are output files. This program makes several passes over the data with different filters to generate a large amount of training data.
1.
Now that you have your files, train with:
```
./training_tf2/train_lpcnet.py features.f32 data.
u8
model_name
./training_tf2/train_lpcnet.py features.f32 data.
s16
model_name
```
and it will generate an h5 file for each iteration, with model
\_
name as prefix. If it stops with a
"Failed to allocate RNN reserve space" message try specifying a smaller --batch-size for train
\_
lpcnet.py.
...
...
@@ -79,14 +79,13 @@ This codebase is also meant for research and it is possible to train new models.
1.
You can synthesise speech with Python and your GPU card (very slow):
```
./dump_data -test test_input.s16 test_features.f32
./training_tf2/test_lpcnet.py test_features.f32 test.s16
./training_tf2/test_lpcnet.py
lpcnet_model_name.h5
test_features.f32 test.s16
```
Note the .h5 is hard coded in test_lpcnet.py, modify for your .h5 file.
1.
Or with C on a CPU (C inference is much faster):
First extract the model files nnet_data.h and nnet_data.c
```
./dump_lpcnet.py lpcnet
15_384_10_G16_64
.h5
./dump_lpcnet.py lpcnet
_model_name
.h5
```
and move the generated nnet_data.
*
files to the src/ directory.
Then you just need to rebuild the software and use lpcnet_demo as explained above.
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment