WaveRNN | WaveRNN Vocoder + TTS | Speech library
kandi X-RAY | WaveRNN Summary
kandi X-RAY | WaveRNN Summary
Pytorch implementation of Deepmind's WaveRNN model from Efficient Neural Audio Synthesis.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Generate the wave function
- Fold a tensor with overlap
- Sample from discretized mixture
- Decode the value of the mu - law function
- Performs training loop
- Save a checkpoint
- Helper function for training data
- Discretized mixture logistic loss
- Compute the logarithm of x
- Save an attention matrix
- Cleans up whitespace
- Transliterate transliteration
- Processes a WAV file
- Clean English text
- Reconstruct a waveform from a given mel file
- Reads text from CSV file
- Loads the vocab dataset
- Parse the pronunciation file
- Forward computation
- Restore a checkpoint
- Performs forward computation
- Create a simple table from a list of tuples
- Load TTS dataset
- Generate a welspectrogram from a file
- Collate the vocab
- Convert text to sequence
WaveRNN Key Features
WaveRNN Examples and Code Snippets
CUDA_VISIBLE_DEVICES=0 python vocoder_train.py -g --syn_dir datasets/vctk/synthesizer vctk datasets/vctk
Community Discussions
Trending Discussions on WaveRNN
QUESTION
I call same model on same input twice in a row and I don't get the same result, this model have nn.GRU
layers so I suspect that it have some internal state that should be release before second run?
How to reset RNN hidden state to make it the same as if model was initially loaded?
UPDATE:
Some context:
I'm trying to run model from here:
https://github.com/erogol/WaveRNN/blob/master/models/wavernn.py#L93
I'm calling generate
:
https://github.com/erogol/WaveRNN/blob/master/models/wavernn.py#L148
Here it's actually have some code using random generator in pytorch:
https://github.com/erogol/WaveRNN/blob/master/models/wavernn.py#L200
https://github.com/erogol/WaveRNN/blob/master/utils/distribution.py#L110
https://github.com/erogol/WaveRNN/blob/master/utils/distribution.py#L129
I have placed (I'm running code on CPU):
...ANSWER
Answered 2019-May-18 at 00:09I believe this may be highly related to Random Seeding. To ensure reproducible results (as stated by them) you have to seed torch
as in this:
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install WaveRNN
Python >= 3.6
Pytorch 1 with CUDA
If you want to use TTS functionality immediately you can simply use:. This will generate everything in the default sentences.txt file and output to a new 'quick_start' folder where you can playback the wav files and take a look at the attention plots.
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page