Asked — Edited
Resolved Resolved by DJ Sures!

Speech Recognition

Hello, I have literally tried everything in every tutorial on here to get speech recognition to move servos and nothing at all as worked. what could i be doing wrong?


ARC Pro

Upgrade to ARC Pro

Stay at the forefront of robot programming innovation with ARC Pro, ensuring your robot is always equipped with the latest advancements.

#25  

Again, you need to look up dictation based SR vs Dictionary based SR.

Dictionary based SR takes the words that are defined, loads them into memory, and uses only these words in the exact order that they are in to decide what is said. As this list grows, it becomes less accurate. It is based on the phonetic sounds of this text and has no understanding of what these words are.

In order to get the text that is spoken and then decide what to do when using dictation based logic (which is required for what you are describing) you have to record the audio and pass it through a dictionary of known words that contains all known words for a language. This is much larger and as you can imagine, not as accurate unless you compare the words to the surrounding words.

Here is an example. "Lookup New Direction." vs "Look up 10 degrees."
just the first word or first two words require meaning extraction to understand if the word is Lookup or Look up. In order to take this speech, and then place this text into a variable alone, without doing anything except for placing this text into a variable, you have to classify and categorize the text. This is the reason the statement that I made earlier that it should be researched. It is a very interesting process and why some companies charge so much for these services.

With dictionary based services like this, you don't care because lookup and look up sound the same and the rest of the phrase is then matched based on the other words. Actually, this isn't done based on a list of known words but on speech patterns based on known character combinations and what you have stored in your dictionary. This is why it gets less accurate as the list gets longer and why it is so important to train the SR engine so well. It doesn't care what lookup is or what look up is. it has no clue if this is even a word or phrase. it just knows what pattern it needs to match for the sound of Luk up. This is also why you should use phonetic spellings of words like "to" instead of "two" or "too".

#26  

@CochranRobotics Thank you for the more detailed explanation. I know you are very busy with your projects and it was good of you to take the time to respond.

Based on what you posted, it seems I will have to make my own external routine to do what I want and import the results. That was the primary decision I needed to make regarding how to handle what I needed. Thanks again.

#28  

Everyone who commented helped with this issue thank you all