Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: SSML bookmark tag returning junk value with a specifc voice #828

Open
branaway opened this issue Jun 5, 2024 · 6 comments
Open
Assignees
Labels
bug Something isn't working in review Acknowledged and being looked at now service side issue This is a service side issue, not in SDK text-to-speech

Comments

@branaway
Copy link

branaway commented Jun 5, 2024

What happened?

for

<speak version='1.0' xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="https://www.w3.org/2001/mstts" xml:lang='en-US'>
        <voice name='zh-CN-XiaoxiaoMultilingualNeural'>
anything <bookmark mark="hello"/>  anything
        </voice>
      </speak>

The bookmark event returned weird characters: 배萓罵

Other models worked fine, such as: zh-CN-XiaoyuMultilingualNeural.

How does the bookmark have anything to do with the voice....

Version

1.36.0 (Latest)

What browser/platform are you seeing the problem on?

Safari

Relevant log output

No response

@branaway branaway added the bug Something isn't working label Jun 5, 2024
@branaway
Copy link
Author

branaway commented Jun 5, 2024

I was using the East Asia region, FYI

@BrianMouncer
Copy link
Contributor

@branaway do you see this happen with the none multilingual voices like zh-CN-XiaorouNeural?

@yulin-li Do you know of any open issue around book mark events an specific voices, other than the inherint limitation of the OpenAI whisper voice? I would expect any of our neural voices to work, but I'm not sure about the multilingual voices.

@branaway
Copy link
Author

branaway commented Jun 5, 2024

zh-CN-XiaorouNeural

no issue with that voice

@BrianMouncer
Copy link
Contributor

BrianMouncer commented Jun 5, 2024

zh-CN-XiaorouNeural

no issue with that voice

ok, poor data sample size, but that supports the idea that the multilingual voices may have an issue with the bookmark label.
I will follow up with our text to speech team, and see if they are aware of this.

In the meantime, you might try changing the bookmark label from hello, to either a number or some Chinese text that the multilingual engine would be less likely to mis-identify the source language of...

or

@BrianMouncer BrianMouncer added text-to-speech in review Acknowledged and being looked at now labels Jun 5, 2024
@branaway
Copy link
Author

branaway commented Jun 5, 2024

tried all various bookmarks in all kinds of symbols, languages. Did not work.

@yulin-li yulin-li added the service side issue This is a service side issue, not in SDK label Jun 5, 2024
@ForrestGumb
Copy link

I can reproduce this issue. Will investigate.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working in review Acknowledged and being looked at now service side issue This is a service side issue, not in SDK text-to-speech
Projects
None yet
Development

No branches or pull requests

5 participants