
jbrawley9906
USA
Asked
— Edited

Hello, I have literally tried everything in every tutorial on here to get speech recognition to move servos and nothing at all as worked. what could i be doing wrong?
Again, you need to look up dictation based SR vs Dictionary based SR.
Dictionary based SR takes the words that are defined, loads them into memory, and uses only these words in the exact order that they are in to decide what is said. As this list grows, it becomes less accurate. It is based on the phonetic sounds of this text and has no understanding of what these words are.
In order to get the text that is spoken and then decide what to do when using dictation based logic (which is required for what you are describing) you have to record the audio and pass it through a dictionary of known words that contains all known words for a language. This is much larger and as you can imagine, not as accurate unless you compare the words to the surrounding words.
Here is an example. "Lookup New Direction." vs "Look up 10 degrees."
just the first word or first two words require meaning extraction to understand if the word is Lookup or Look up. In order to take this speech, and then place this text into a variable alone, without doing anything except for placing this text into a variable, you have to classify and categorize the text. This is the reason the statement that I made earlier that it should be researched. It is a very interesting process and why some companies charge so much for these services.
With dictionary based services like this, you don't care because lookup and look up sound the same and the rest of the phrase is then matched based on the other words. Actually, this isn't done based on a list of known words but on speech patterns based on known character combinations and what you have stored in your dictionary. This is why it gets less accurate as the list gets longer and why it is so important to train the SR engine so well. It doesn't care what lookup is or what look up is. it has no clue if this is even a word or phrase. it just knows what pattern it needs to match for the sound of Luk up. This is also why you should use phonetic spellings of words like "to" instead of "two" or "too".
@CochranRobotics Thank you for the more detailed explanation. I know you are very busy with your projects and it was good of you to take the time to respond.
Based on what you posted, it seems I will have to make my own external routine to do what I want and import the results. That was the primary decision I needed to make regarding how to handle what I needed. Thanks again.
@wbs, if you want to try what speech recognition is like in dictation mode, use the PandoraBot control: https://synthiam.com/Tutorials/Help.aspx?id=189
See if you get the recognition accuracy that you desire before creating a plugin.
Everyone who commented helped with this issue thank you all