2
votes

I am very new to learn cognitive services of Text-to-Speech (TTS) of Microsoft Azure. I successfully able to convert the given text into an audio file by using TTS services of Azure.It works fine when I'm having a single voice element in my SSML XML document. The example of working SSML is;

<speak version="1.0" xml:lang="en-US">
  <voice xml:lang="en-US" xml:gender="Male" name="en-US-Jessa24kRUS"> 
       Hello, this is my sample text to convert into audio? 
  </voice>
</speak>

But, when I'm having multiple voice tags(on gender base), then it causes an error. The SSML of it is:

<speak version="1.0" xml:lang="en-US">
  <voice xml:lang="en-US" xml:gender="Male" name="en-US-Guy24kRUS"> What’s your name? </voice>
  <voice xml:lang="en-US" xml:gender="Female" name="en-US-Jessa24kRUS"> My name is Cindy Smith. Do you know John Silver?</voice>
  <voice xml:lang="en-US" xml:gender="Male" name="en-US-Guy24kRUS"> John and I are old friends. </voice>
  <voice xml:lang="en-US" xml:gender="Female" name="en-US-Jessa24kRUS"> John just joined our company as a salesperson. </voice>
  <voice xml:lang="en-US" xml:gender="Male" name="en-US-Guy24kRUS"> That’s good news. John has been a salesperson for chemical products for many years. </voice>
  <voice xml:lang="en-US" xml:gender="Female" name="en-US-Jessa24kRUS"> I head he really likes his new job.</voice>
</speak>

And the error is:

Response status code does not indicate success: 400 (SSML must contain a maximum of 5 voice elements. Actual 6.).

It'll be a great help for me if someone explain that why its limiting me to five voice tags, while there's no limitation mentioned in documentation.

1
Documentation is outdates sometimes. - Nikolay Shmyrev
I really got stuck on it and its really very insane. I didn't able to get any solution yet from any portal and my deadlines are near. :-( - Arsman Ahmad
My condolences to you - Nikolay Shmyrev
D'you have any idea, how can we contact Microsoft regarding to an issue!! - Arsman Ahmad
I don't think you will be able to do something at Microsoft. You'd better split your dialog on chunks and then glue them together. Or try other apis like amazon polly. - Nikolay Shmyrev

1 Answers

1
votes

This is a known settings due to latency. We've been aware of and working on removing this limitation. Hope we could complete the fix and deployment in this week, if things go smoothly, we may complete earlier.