Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

To aux or not to aux? #332

Closed
macarbonneau opened this issue Jan 6, 2020 · 1 comment
Closed

To aux or not to aux? #332

macarbonneau opened this issue Jan 6, 2020 · 1 comment

Comments

@macarbonneau
Copy link

Hello!
First of all thank you for the repo. I tried to train my own universal WavRNN, but cannot get it to generate quality samples. I used the config file that is provided here: https://github.com/erogol/WaveRNN

However, here #221 (comment) I can download a trained model and peek at the configuration.

In the latter case, the aux_net and upsamling net from Fatchord is used. In the former case, these augmentations are not used.

Here is my question: Is it possible to train a universal WavRNN without the upsampling and auxiliary networks?

@erogol
Copy link
Contributor

erogol commented Jan 7, 2020

In my case, it was not possible to remove any of these. If you remove, the quality degrades much. One option for upsampling net is to estimate its output's mean value and just deterministically upsample reaching the same mean in inference time.

All these comments regarding LJSpeech dataset. I did not try any of these with a multi speaker dataset.

@erogol erogol closed this as completed Jan 7, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants