-
Notifications
You must be signed in to change notification settings - Fork 152
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to add pitch changes to a sentence #148
Comments
Pitch can be adjusted with the following Intonation features.
The voice pitch varies less when you reduce the stddev. This works with most of the voices, with consistency on the pitch being sung. (for me, Shouldn't be too difficult adding adjusting this feature with ssml https://github.com/MycroftAI/mimic/blob/master/src/synth/cst_ssml.c. What might be more difficult is mapping these numbers to something easier to use as a musician. ~~~( Edit: Google and Cepstral both define their standards with relative semitones.
|
Pitch is supported since #168 was merged Your specific example is a little more complicated What you want to first do is some pre-processing (perhaps with lexconvert.py?) to split your words into syllables. And you'd also need to convert the pitch name to frequency (using a table like this). <prosody pitch='349.23Hz' range='0'><phoneme ph="D EY">Dai</phoneme>
<prosody pitch='293.66' range='0'><phoneme ph='S IY'>sy</phoneme> Then you also want #190 to be supported for duration |
|
On June 24, 2020 10:26:26 AM EDT, shyam3089 ***@***.***> wrote:
>
>
> Pitch is supported since #168 was merged
>
> Your specific example is a little more complicated
>
> What you want to first do is some pre-processing (perhaps with
lexconvert.py?) to split your words into syllables. And you'd also need
to convert the pitch name to frequency (using [a table like
this](https://pages.mtu.edu/~suits/notefreqs.html)).
>
> ```
> <prosody pitch='349.23Hz' range='0'><phoneme ph="D EY">Dai</phoneme>
> <prosody pitch='293.66' range='0'><phoneme ph='S IY'>sy</phoneme>
> ```
>
> Then you also want #190 to be supported for duration
In which directory I can find `lexconvert.py`?
I installed mimic in /home/pi/mimic1 in a Raspberry Pi
It's an external project. Homepage is https://ssb22.user.srcf.net/gradint/lexconvert.html
(You can `wget https://ssb22.user.srcf.net/gradint/lexconvert.py`)
|
I'm looking to switch from Festival to a derivative of flite and looking for something with equivalent functionality to the PITCH markup, e.g.
DURATION BEATS="1.0,1.0" PITCH NOTE="F4,D4" Daisy PITCH DURATION
DURATION BEATS="1.0,1.0" PITCH NOTE="Bb3,F3" Daisy PITCH DURATION
[edit: stripped some XML characters]
from the Festival examples. I see that flite has support for local volume and rate changes via SSML attributes on the PROSODY tag. Is the pitch attribute on the road map? Or even possible?
I see Sinsy, also based on flite, has the ability to change pitch however it does it through markup via labels which seems something entirely different to the PROSODY tags.
The text was updated successfully, but these errors were encountered: