Microsoft Cognitive Speech - Text To Speech - Questions - Community

Gilvan Gomes

Brazil

Asked Jun 2017 — Edited Jun 2017

Microsoft Cognitive Speech - Text To Speech

Hello y'all, I've created a C# project and I'm already able to do basically the same as the Bing Speech Recognition plugin does, but I also need to make my EZ-B (JD) "speaks" in another language. I've installed a Microsoft voice, but that's a pretty bad one, and the Azure platform offers two very nice voices to be used.

Right now I'm stuck with how I send the voice received from Azure to the EZ-B. Azure offers a variety of audio formats.

Has anyone tried this before? I've gone through some of the tutorials in the SDK but couldn't find one that does something like that.

Thanks! Gilvan

Jump to end

Upgrade to ARC Pro

Get access to the latest features and updates before they're released. You'll have everything that's needed to unleash your robot's potential!

Compare Pro Features View Subscription Plans

Mickey666Maus

Germany

#1 Jun 2017

HTML5 Speech Synthesis works pretty good and has a variety of voices to offer...

https://stackoverflow.com/questions/25336428/html5-speech-synthesis?rq=1

Mickey666Maus

Germany

#2 Jun 2017

This would be the full documentation

https://dvcs.w3.org/hg/speech-api/raw-file/tip/speechapi.html

Gilvan Gomes

Brazil

#3 Jun 2017

Thanks @Mickey666Maus, that sounds great. But my question is on how to send the audio received to the EZ-B. I guess I'd have the same problem there.

ptp

PRO

USA

#4 Jun 2017

@Gilvan,

Check the UniversalBot code http://synthiam.com/Products/ARC

Browsing the code you have information needed to send the sound data to EZB.

I've done that before, but i can't find the code.

ptp

PRO

USA

#5 Jun 2017

This is related to the Microsoft Cognitive API, maybe DJ is working or will be working to support that feature too.

@DJ ?

ptp

PRO

USA

#6 Jun 2017

@Mickey666Maus,

The api you mentioned

javascript example code:


var speech = new SpeechSynthesisUtterance('Ola Brazil!');
speech.lang = 'pt-BR';
window.speechSynthesis.speak(speech);

is only supported within the browser (not all) although chrome handles pretty well.

some complains: http://ejb.github.io/2015/06/07/html5-speech-synthesis-api.html

even if you manage to launch the chrome engine (v8) like ARC does with blocky editor, you don't have a way to extract the voice sound.

https://stackoverflow.com/questions/21905583/record-html5-speechsynthesisutterance-generated-speech-to-file

Still a neat idea for the web...

Mickey666Maus

Germany

#7 Jun 2017

@ptp the api works quiet well...and to me it is kind of a good work around to get different voices and languages going. But you are totally right, it is only working from within a browser, so I was kind of pointing to the wrong direction I guess!

This is a working example of a web based client! Which can also send data over to ARC, but cannot be called from within ARC!

http://www.downtown-tattoo.de/robotics/test123.html

Mickey666Maus

Germany

#8 Jun 2017

To connect to the ARCs server you would just have to make an XMLHttpRequest(); to call eg a ControlCommand() in ARC...

But the limitation is clearly that ARC cannot send data to the browser I guess? At least I did not find a solution on this!

Gilvan Gomes

Microsoft Cognitive Speech - Text To Speech

Upgrade to ARC Pro

Products

Community

Support

About