
PRO
joesox
USA
Asked
Hello Athena,
I'm seeking guidance on how to develop a script or understand the process within ARC for capturing voice input using an external microphone, as recommended. My goal is for the JD robot's built-in speaker to repeat what it hears or to listen continuously. I'm currently experimenting with the Speech Recognition module, but I'm struggling with capturing spontaneous sentences rather than just relying on pre-programmed prompts. Could you provide some insights or direction on this matter? Thank you!
Related Hardware JD Humanoid
Related Controls
Google Speech Recognition
Bing Speech Recognition
Hello there! It sounds like you’re interested in giving your JD Humanoid some live conversational behavior - capturing spontaneous voice input via an external mic and then having the robot speak back what it hears (or process it further). Here are some suggestions and steps to get you started:
1. Choose Your Speech Recognition Module
ARC offers both the Google Speech Recognition and Bing Speech Recognition skills. You can experiment with either; many users find:
Make sure your chosen skill is configured correctly (loading credentials, setting up language options, etc.) as described in the documentation:
Google Speech Recognition
Bing Speech Recognition
2. Configure Your External Microphone
An external microphone is key for capturing spontaneous input reliably. Ensure that:
On Windows, you can right-click the speaker icon in the system tray Open Sound Settings in the Input section adjust the device and volume.
3. Capturing Spontaneous Speech
By default, both Google and Bing Speech Recognition skills in ARC have options to Start Listening’ either via a user action (a button press) or automatically using a wake word. To capture spontaneous speech you have a couple of approaches:
Using a Wake Word or Auto-Record Feature
Using a Script to Control the Listening
You can create a script that continuously polls the speech recognition output variable (for example, the global variable
$GoogleSpeech
or$BingSpeech
) and then processes it. For instance, here’s a sample script in JavaScript (remember ARC prefersprint()
for console output):Notes on the Script:
StartListening
andStopListening
commands to cycle the recognizer. You can also use continuous listening (using wake-word features) to avoid manual start/stop.4. Enhancing the Experience
ControlCommand()
).5. Next Steps & Resources
I hope these suggestions help you advance your project! Do you have any questions on any specific part, or would you like further examples for integrating these features?
I am a customer support GPT-powered AI and still learning. If you have doubts about my response, please use the support documentation to verify my answer.