Thumbnail

Advanced Speech Recognition

by Microsoft

This is an alternative to the Bing Speech Recognition module for ARC. This skill is a paid service from Microsoft.

Requires ARC v40 (Updated 2/1/2021)

How to add the Advanced Speech Recognition robot skill

  1. Load the most recent release of ARC (Get ARC).
  2. Press the Project tab from the top menu bar in ARC.
  3. Press Add Robot Skill from the button ribbon bar in ARC.
  4. Choose the Audio category tab.
  5. Press the Advanced Speech Recognition icon to add the robot skill to your project.

Don't have a robot yet?

Follow the Getting Started Guide to build a robot and use the Advanced Speech Recognition robot skill.

How to use the Advanced Speech Recognition robot skill

This is an alternative to the Bing Speech Recognition module for ARC. This skill allows you to specify your own Azure Cognitive Service credentials using the Bing Speech Recognition cloud service. This is by far the most accurate speech recognition service that we've ever used.

Main Window

User-inserted image

1. Connect API Button Once you have entered the API Key in the advanced configuration you can connect to it with this button.

2. Pause Checkbox This checkbox pauses the detection from the audio input device. When not in use keep the skill paused so that you aren't using up detection time that you are paying for.

3. Audio Waveform This gives visual feedback that your audio input device (microphone) is configured correctly and is picking up voice/sounds.

4. Response Display Here you will get feedback from the Speech Recognition API. It will show the text version of your detected words.

Configuration

User-inserted image

1. Language The language that will be used for recognition.

2. Variables The variable which holds the text from the speech recognizer. This may be used in your Script for determining what was spoken.

3. Script Script that will execute for every detected phrase. If there is no match for recognition, this script will not be called.

4. Auto Record If enabled, the skill will begin recording audio when it has been detected. The threshold is adjusted in the Level Threshold setting.

5. Level Threshold This is the threshold to be used for both Auto Recording (if enabled) and when to stop recording. If you find that even in manual mode the recording manually stops too quick before your sentence is complete, lower this value.

6. Silence Count How many times the energy volume is below the level threshold before stop recording.

7. Max Recording Currently not enabled.

How to Use Advanced Speech Recognition Skill

In order to use this robot skill, you will need a Microsoft Azure account and an API key. There are many steps to create an Azure key, which isn't very easy for the average person. That is why we made a free version available here. However, if you wish to unlock the full functionality of this skill and manage your own paid subscription, you can create an Azure account. The azure account allows you to create a Cognitive Speech Recognition service. Locate the Region and API key to use in the Advanced tab of this skill. You can create an Azure account by clicking here.

  1. Add a Cognitive Services resource in your azure portal account.

  2. Click add and search for the "Speech" service in the Marketplace.

  3. Create the Speech service and then enter in a name (ex: Test), a subscription (ex: Pay-as-You-Go), Location (ex: West US), pricing tier (ex: Free  0) and create a new resource group (ex: BingSpeechTest).

  4. Once your deployment is complete, click the "Go to resource" button.

  5. In the left menu select "Keys and Endpoint".

  6. Click the "Show Keys" button and then copy the generated key in the KEY 1 field.

  7. If you don't remember the location you selected above, you can find the location in the Location field.

User-inserted image

  1. Add the Advanced Speech Recognition Skill to your ARC project (Project -> Add Skill -> Audio -> Advanced Speech Recognition).

  2. In the Advanced tab of the Advanced Speech Recognition Configuration menu, enter the region and paste the API key.

User-inserted image

  1. Save your configuration and then start speaking to activate the speech recognition.

*Note: When not in use ensure that the Advanced Speech Recognition skill is PAUSED, as it will be charging your account as it is used.

Script Samples

Add this code to the configuration of this skill to send detected speech to the PandoraBot control.

ControlCommand("PandoraBot", SetPhrase, $BingSpeech)

User-inserted image

Video

Here's an example of the skill in action combined with the Cognitive Vision and Cognitive Emotion services.

Requirements

The Advanced Speech Recognition skill requires you to pay for an API key from Microsoft. We provide another version of this which does not require an API key, and you can install it from here.

This service requires an internet connection which means that a second USB WiFi adapter or an ethernet connection will be needed. Read about having 2 network connections here.

Resources

When this control detects a phrase, it will be assigned to the variable and the specified script will execute. With this model, you can have this skill send the detected speech to a PandoraBot skill using ControlCommand(). Here's a sample project: bingsearchtest.EZB


ARC Pro

Upgrade to ARC Pro

Become a Synthiam ARC Pro subscriber to unleash the power of easy and powerful robot programming

#1  

Let me ask how to add any language pack in the world to ez_builder?

#2  

Hello

Language selection is no more working ,  I just try to switch to fr-FR and recognition is english one,

User-inserted image

User-inserted image

PRO
USA
#3  

Is this a new version? 11/25/2019?

#4  

ARC Designer Beta Version 2019.11.22.00

Ang plugin Version 24

PRO
Synthiam
#5  

Updated (version 30) to fix errors that popped up when unpausing and pausing the listener

PRO
Synthiam
#6  

Updated the plugin to use microsoft's latest library (1.10.0)

PRO
Synthiam
#7  

Updated: Uses the internal ARC VAD (voice activity detection) rather than continuous speech detection. This means the API usage will be much less because only spoken (Detected) phrases are sent to the API rather than continuous recording.

#9  

Quote:

Is this any more accurate than Bing?
Good question. I was under the impression Bing was the "Gold Standard" and the best. I'd love it if someone would do a comparison test. Maybe you could give both a try and let us know??

PRO
Synthiam
#10   — Edited

It’s identical. The first paragraph explains that this is the same is Bing. It just allows your to use your own credentials and pay your own service fees.

User-inserted image

#11  

It worked well but suddenly I'm getting error like this Script Done (00:00:00.0230097)

Session started

Concelled. Timeout no recognition result received Sessionid e9ea404674a94e7eba781846d6682911

Response.

Session stopped,

pronunciation assessment_config_to_json pronunciation assessment_config_to_json 54941994 (SymFromAddr() error. Attempt to access invalid address) -54941994 (SymFromAddr() error Attempt to access invalid address) pronunciation assessment_config to json pronunciation_assessment_config.to json

i tried bing speech recognition skill also I'm getting error like this Data Sore!Error in response received . The remote name could not be resolved "ecimen synthsam comHeard wake word 10.94)Recording...Wake word. Tod NoizySersdrag data (168046 Byte)Data Sert!Emor in response received: The remote name could not be resolved: admin synthiam.com

@Athena Could you please let me know How to resolve this in advanced speech recognition skill?

PRO
Canada
#12  

It sounds like you're experiencing connectivity issues with the Advanced Speech Recognition skill in ARC. Here are a few steps you can take to troubleshoot and resolve the issue:

  1. Check Internet Connection: Ensure that your internet connection is stable. The Advanced Speech Recognition skill requires an active internet connection to communicate with the Azure Cognitive Services.

  2. Verify API Key and Region: Double-check that the API key and region you entered in the Advanced Speech Recognition configuration are correct. Any mismatch can lead to connectivity issues.

  3. Firewall and Network Settings: Ensure that your firewall or network settings are not blocking the connection to the Azure Cognitive Services. You may need to whitelist the service or adjust your network settings.

  4. Update ARC: Make sure you are using the latest version of ARC. Updates often include bug fixes and improvements that might resolve your issue.

  5. Test with Another Network: If possible, try connecting your device to a different network to see if the issue persists. This can help determine if the problem is network-specific.

  6. Check Azure Service Status: Visit the Azure status page to ensure there are no outages or issues with the Cognitive Services in your region.

  7. Review Logs: Check the logs in ARC for any additional error messages or details that might provide more insight into the issue.

  8. Reconfigure the Skill: As a last resort, try removing the Advanced Speech Recognition skill from your project and re-adding it. Re-enter your API key and region to see if this resolves the issue.

If you've tried all these steps and the issue persists, please let me know, and we can explore further troubleshooting options.


I am a customer support GPT-powered AI and still learning. If you have doubts about my response, please use the support documentation to verify my answer.