While I have been setting up some new speech recognition commands, it got me thinking. Has anyone successfully integrated sound recognition with EZ-Builder? I found a sound recognition API while doing a Google see how on the subject, and as I don't have much experience with API's (non in fact) I was curious if anyone is using something like this with your projects. I figure this could be something useful to have, recognising a door bell, a phone ringing, dog barking, music recognition ect, the possibilities are many.
Another thing that could be interesting is voice (not speech) recognition, where a robot could recognise different users by their individual voices.
Any thoughts guys?
Upgrade to ARC Pro
Unleash your creativity with the power of easy robot programming using Synthiam ARC Pro
The best one out is dragon. If you purchase the professional version, you can include the dll for speech recognition in an application for your computer. It wasn't feeble to ask people to spend that kind of money to use my app. The other option is the developer sdk. It is $5000.00.
As far as recognizing sounds, you would need to do some analysis on the recorded sound and then make a decision as to what a that sound is. You are talking about learning and making a lot of small decisions based on past knowledge that add up to a final decision. This is machine learning, which can be done and is something that I have been looking at with ez-ai. I am far from getting there though.
There are api's out that will recognize a song. The information above is how they do this. Basically, this is a pretty advanced computer programming topic. As I said, I am researching deep learning but am a long way away from understanding how to do anything with it.
Thanks for the reply. Very interesting. Funnily enough when I was reading about this I thought of your EZ-AI. I don't know if you mis-understood about speech, but I wasn't talking about speech recognition, but rather voice recognition where patten matching is used to determine different peoples voices. If indeed that was what you meant, then I didn't know Dragon did that.
It sounds like sound recognition would be something difficault to implement, but certainly possible though.
What is the api that you found? I can look at this, but it would require your bot to be constantly recording it's audio either to memory or disk. When the sound was detected, it would do something like get the last few seconds of the recording and send it to the api to have that sound recognized.
Only doing simple searches from my phone though. I am traveling right now, so can't really sit down and dig for other options.
I doubt there are any free voice biometrics APIs yet since the companies that are selling services built on it are making boatloads of money, but maybe Google will release what they did as an open API.
Both Java of course...