-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Status log #9
Comments
Thank you for your great work! |
Sure. Python 2 TF 1.3 linux |
Thanks! |
Any plan to add `JOINT REPRESENTATION OF CHARACTERS AND PHONEMES' as the (deepvoice3,part 3.2) saying Also, as my experiments, |
Do you have the synthesized speech files somewhere? |
hello, Kyubyong, we have pull your code, we test your code with LJ-speech data. we found the synthesized wav files has nothing to do with the content of the "test_sents.txt". Do you have any guide for us? |
22 Nov. 2017. Has completed the first draft. I've tested the current hyperparameters on only Nick dataset which is 8 hours long, but not on LJ which is 24 hours long. The results were not good, not terrible. As I tried with the same hyperparameters as the original paper with no success, I changed some of them. Amongst them are application of dilation and positional embedding instead of positional encoding. I found the attention plot of the last layer looks monotonic somewhat, but not clearly. I think the key signal that the network works is, of course, the attention plots.
The text was updated successfully, but these errors were encountered: