Integrating Voice Commands into ARC

PRO

Automation Man

USA

Asked Apr 6

Resolved by Synthiam Support!

Skip to comments Jump to end

I have a question regarding the robot skill PushOver. Is it possible to integrate text input into ARC using this skill? The voice-to-text recognition on my iPhone is highly accurate, and I would like to leverage this feature alongside various skills in ARC. Could you provide an example of how this might work? For instance, commands like "Move the Big Arm to position 4000," "Turn on Digital Port 10," or "Move fingers to position 1000." Incorporating such a capability would significantly enhance the functionality of ARC in various applications.

Related Hardware EZ-B v4

Related Controls PushOver OpenAI Chat GPT

Jump to end

Upgrade to ARC Pro

ARC Pro is more than a tool; it's a creative playground for robot enthusiasts, where you can turn your wildest ideas into reality.

Compare Pro Features View Subscription Plans

Athena AI

PRO

Canada

#1 Apr 6

Below is one way you can bridge your iPhone’s voicetotext results with ARC robot skills using PushOver to help integrate and notify ARC of received commands. (Keep in mind that the PushOver skill itself is designed to send notifications from ARC to your mobile device-it isn’t an input control per se. However, you can set up a workflow where your mobile device sends text commands via push (or another method) into ARC’s global variable storage, and then use ARC scripting to parse that text into actions.)

Below is an example approach:

1. Use your iPhone’s voicetotext capability (or a dedicated app) to send a text message (or HTTP call) that contains the command you’d like executed. For example, the text might be:
Move the Big Arm to position 4000
Turn on Digital Port 10
Move fingers to position 1000

2. Have that text arrive in ARC-for example, by using a custom robot skill or other method that saves the incoming text into a global variable (say, $TextCommand).

3. Create a JavaScript (or Python) script in ARC that reads this global variable and parses the command. In the script you can use simple conditionals to determine which robot skills to call. For example:

// Example ARC JavaScript to parse a text command from $TextCommand

// Retrieve the command text that might have been sent via voice-to-text
var commandText = getVar("$TextCommand", "").toLowerCase();

if (commandText.length == 0) {
    print("No command received.");
} else {
    print("Received command: " + commandText);
    
    // Command: Move the Big Arm to position 4000
    if (commandText.indexOf("move the big arm to position") !== -1) {
        // Extract the numeric value (e.g., 4000)
        var pos = parseInt(commandText.replace(/[^0-9]/g, ""));
        // Assume the Big Arm is controlled by a servo (e.g., servo port d5)
        Servo.setPosition(d5, pos);
        print("Big Arm moving to position " + pos);
        
        // Optionally, send a push notification giving feedback
        controlCommand("PushOver", "Push", "Big Arm moving to " + pos);
    }
    
    // Command: Turn on Digital Port 10
    else if (commandText.indexOf("turn on digital port") !== -1) {
        // Extract the digital port number from the command
        var portNum = parseInt(commandText.replace(/[^0-9]/g, ""));
        // Construct port constant name (e.g., d10)
        var digitalPort = "d" + portNum;
        Digital.set(digitalPort, 1); // 1 for high/on
        print("Digital Port " + portNum + " turned ON");
        controlCommand("PushOver", "Push", "Digital Port " + portNum + " is now ON");
    }
    
    // Command: Move Fingers to position 1000
    else if (commandText.indexOf("move fingers to position") !== -1) {
        var posFingers = parseInt(commandText.replace(/[^0-9]/g, ""));
        // Assume the fingers are controlled by a different servo, maybe on port d7
        Servo.setPosition(d7, posFingers);
        print("Fingers moving to position " + posFingers);
        controlCommand("PushOver", "Push", "Fingers moving to " + posFingers);
    }
    
    else {
        print("Command not recognized.");
        controlCommand("PushOver", "Push", "Received unrecognized command: " + getVar("$TextCommand"));
    }
    
    // Clear the command after processing
    setVar("$TextCommand", "");
}

4. You could trigger the above script on a timer (or with a Check Command button) so that when a new text comes in, the script parses it and calls the appropriate ARC functions (such as Servo.setPosition or Digital.set).

5. The PushOver skill is used here to send back a notification to your device so you’re aware of what action is taking place. For example, after commanding the Big Arm, the script issues:
controlCommand("PushOver", "Push", "Big Arm moving to 4000");
This gives visual feedback on your mobile device.

Using this method you’re effectively integrating your voice-to-text input (via your iPhone) to power ARC’s various skills. You can extend the parser to support additional commands or more complex natural language processing.

Keep in mind:
You will need to establish a mechanism (via a custom robot skill, web service, or manual input) to inject the text command into ARC (i.e. into $TextCommand).
The commands used in the script are just examples. Adjust the port names (like d5, d7, etc.) as needed to match your hardware setup.
You can mix in other ARC control commands if you wish (for example using Speech Recognition skills) to enable a two-way dialogue.

With this approach, you can leverage your iPhone’s accurate voice-to-text and push those commands via ARC scripting to control your robot. If you have any questions or need further clarification on customizing the script, feel free to ask!

I am a customer support GPT-powered AI and still learning. If you have doubts about my response, please use the support documentation to verify my answer.

Automation Man

PRO

USA

#2 Apr 6

@Athena What you are describing makes sense, now I'd like to dig a little deeper and possibly use the IFTTT skill to receive the text and put it into the variable. Once it it is in the variable then I can take it from there. Since ARC can send a trigger to IFTTT to do what it wants how would I receive a text from IFTTT and place it into the global variable? Can you give me an example of this working?

Athena AI

PRO

Canada

#3 Apr 6

Below is one way to set up a pathway that lets IFTTT send back a text message (or command) so that ARC can pick it up and store it into a global variable. The important point is that the IFTTT Sender robot skill is designed to send messages from ARC to your mobile device; it isn’t built as a direct listener. To receive text from IFTTT, you’ll need an external (or intermediary) web service that IFTTT can call when your phone’s voicetotext engine produces a command. Then ARC can poll that web service and load the text into a global variable. One typical approach is as follows:

Step 1. Create an IFTTT Applet to Send the Text

In your IFTTT account, create a new applet.
For the IF This part, select a channel that receives text from your phone (for example, using your mobile SMS app or another voicetotext integration).
For the Then That action, choose the Maker Webhooks (now Webhooks) service.
Configure the Webhooks action so that it makes an HTTP request (GET or POST) to a URL on a web service that you control.
- This web service’s job is merely to store or echo back the command text in a simple format (for example, in plain text or JSON).

You can use a small web service you create on a publicly accessible server (or even a free service like a RequestBin gateway) that receives the IFTTT call and stores the incoming data.

Step 2. Poll the Web Service from ARC to Retrieve the Command

Because ARC doesn’t have a builtin HTTP listener, you can have a script (or even a RoboScratch/Blockly behavior) that periodically polls your web service. In ARC you’d use the Net.hTTPGet function to fetch data from your web service.

For example, suppose your web service is hosted at:
http://myserver.com/iftttcommand

And the service returns the plain text of the command (for instance, move the big arm to position 4000).

Here’s a sample ARC JavaScript snippet that polls the server and then stores the returned text into a global variable named $TextCommand:

// Poll the external web service for an IFTTT command
var url = "http://myserver.com/iftttcommand";;
var response = Net.hTTPGet(url, 5000); // 5000ms timeout, adjust as needed

if(response != "") {
    // Store the retrieved command text into our global variable
    setVar($TextCommand, response);
    print("New IFTTT command received: " + response);
    
    // Optionally, trigger a PushOver notification for feedback
    controlCommand(PushOver, Push, "IFTTT Command received: " + response);
} else {
    print("No new command from IFTTT.");
}

You can run this polling script on a timer (for example, every few seconds) so that ARC continuously checks for new commands.

Step 3. Process the Command from the Global Variable

Once your command text is stored in $TextCommand, you can run your parsing script (like the example provided earlier) to take appropriate actions based on the text. For instance, if $TextCommand contains
"move the big arm to position 4000"
your parser would extract the numeric value and then use a command (such as Servo.setPosition) to command that servo, after which you can clear the $TextCommand variable.

A Few Points to Consider

ARC does not directly host an HTTP endpoint, so an external web service is needed.
Your web service could be as simple as a small script on a hosted PHP, Node.js, or Python server that writes the incoming data to a text file (or even echoes it back) so that ARC can pick it up.
Security is something to keep in mind-a publicly accessible endpoint should at least have some form of authentication or password token (which you can include in the URL call) so that only IFTTT can update your command.

Summary

Set up an IFTTT applet so that when your voice-to-text on the phone produces a command, IFTTT makes an HTTP request with that text to your external web service.
In your ARC project, create a script that periodically uses Net.hTTPGet to read that URL and store the returned text into a global variable (for example, $TextCommand).
Process the command stored in $TextCommand using your ARC scripting to drive your robot’s skills.

This approach lets you leverage IFTTT as an intermediary so that your voice-derived text can be introduced into ARC’s global variable pool where you can parse and act on it.

If you have any questions or need help setting up either the web service part or the ARC polling script, please let me know!

I am a customer support GPT-powered AI and still learning. If you have doubts about my response, please use the support documentation to verify my answer.

Automation Man

PRO

USA

#4 Apr 6

@Athena Looks like the biggest obstacle is having ARC retrieve a text and putting it into the variable. How about rather than a text that it communicates via email or something similar. Do you have some recommendations for different services (text through web, email or similar) that may work for this scenario? I really like the accuracy of voice to text in my Iphone and just want to use it with ARC.

Athena AI

PRO

Canada

#5 Apr 6

Below is an example workflow and some recommendations for services you can use to feed text into ARC from your iPhone’s voicetotext engine.

Recommended Workflow Overview

Voice-to-Text on Your iPhone:
Use your iPhone’s builtin voice-to-text (or a dedicated app) to capture your command. For example, you might dictate:
"Move the big arm to position 4000"
Use IFTTT (or a Similar Service) as a Bridge:
Set up an IFTTT applet (or another automation service such as Zapier or Microsoft Power Automate) that, upon capturing your voicetotext command, triggers an action. Rather than only sending a push notification, have IFTTT forward the text via: A Webhook (HTTP POST/GET) to a simple server you control
An email - though this requires an additional step (for example, a script that monitors an inbox and then exposes the text via an HTTP endpoint)
Another cloud service that can host a lightweight webapp (such as Google Apps Script)
Provide an HTTP Endpoint for ARC to Poll:
Since ARC does not natively host an HTTP listener for incoming commands, the easiest method is to let ARC pull the latest command. For instance, you can set up a simple web service that stores your command text (perhaps in a text file, database, or Google Sheet) and exposes it at a URL. ARC can then use the Net.hTTPGet command to retrieve the text and load it into a global variable (e.g., $TextCommand).

Example Services & Approaches

IFTTT Maker Webhooks with a Custom Web Service:
IFTTT Maker Webhooks Channel:
Create an applet with the trigger of your choice (for example, Receive a voice command via the IFTTT app or an SMS trigger).
For the Then That action, select Maker Webhooks and set it to make an HTTP POST (or GET) to a URL that you control.
Custom Web Service:
You could build a very simple PHP, Node.js or Python script hosted on a lowcost provider (even a free web host) that simply writes the POST data into a file (or in memory) and then echoes the latest command when queried.
Example (PHP):
--------------------------------------------------
<?php
// if a command is posted, save it
if ($_SERVER['REQUEST_METHOD'] == 'POST' && !empty($_POST['cmd'])) {
file_put_contents('command.txt', $_POST['cmd']);
echo "OK";
exit();
}

// for a GET request, return the stored command
if ($_SERVER['REQUEST_METHOD'] == 'GET') {
echo file_get_contents('command.txt');
// (Optionally clear the file here)
}
?>
--------------------------------------------------
Then, in ARC you would use:
--------------------------------------------------
var url = "http://yourserver.com/command_endpoint.php";;
var commandText = Net.hTTPGet(url, 5000);
if (commandText != "") {
setVar($TextCommand, commandText);
print("Received command: " + commandText);
// Process using your command parser
}
--------------------------------------------------

Google Apps Script Web App:
Google Apps Script:
Create a new Google Apps Script that, upon receiving a GET or POST request, stores the text in a Google Sheet (or simply a script property) and then returns the latest text when queried.
Deploy as a Web App:
Make sure it’s deployed with Anyone, even anonymous access (or secure it using a parameter).
IFTTT Integration:
Use IFTTT to POST your voicetotext result to this web app URL.
ARC Polling:
In ARC, call Net.hTTPGet with the Google Apps Script URL.
Example snippet in ARC script is similar to the one shown above.
Email-Based Trigger (with an Intermediary):
IFTTT Email Channel:
Have IFTTT send an email when you dictate a command.
Intermediary Service:
Set up another system (could be a simple email parser using Zapier) that checks your inbox for these emails, extracts the command text, and then exposes it via an HTTP endpoint.
This extra step is a bit more involved but can work if you prefer using email as a delivery method.

Summary

IFTTT, Zapier, or Microsoft Power Automate can serve as a bridge between your iPhone’s voice-to-text and ARC.
The easiest method is to have IFTTT use the Webhooks (Maker) service to send your command text to an HTTP endpoint.
ARC, which cannot directly receive emails or texts, can then poll that endpoint (using Net.hTTPGet) to retrieve and store the command in a global variable.
Once in the global variable (such as $TextCommand), you can use your ARC scripts to parse and execute the command accordingly.

Follow-Up

Do any of these approaches look like they might work for your setup? I can provide additional code samples or further details on setting up the Google Apps Script or any other part if needed. Just let me know!

I am a customer support GPT-powered AI and still learning. If you have doubts about my response, please use the support documentation to verify my answer.

DJ Sures

PRO

Synthiam

#6 Apr 6

I’m wondering why you aren’t using one of the several robot skill solutions that does 10 times more than your example? Such as linking Bing speech recognition with chat gpt robot skills.

Automation Man

PRO

USA

#7 Apr 6

Because I did not have good results with it. Maybe I need to try it again with Bing and chat gpt and see if there was something that I was doing wrong or it has improved. There was a lot of latency which caused some issues as I had to wait about 3 seconds before I could start talking. Even with the headgear and microphone it was about 25% accurate- but that was with an older computer. A different gaming computer might make things better.

DJ Sures

PRO

Synthiam

#8 Apr 6

Parsing natural language is gonna take significant processing if it’s something you want to do on your own. There’s only a few companies that offer that technology - which is why it’s quite difficult to accomplish without leveraging their APIs.

there’s two technologies that you’re wanting to link.

speech recognition 2) Natural language processing (nlp)

but you can replace nlp with a gpt. GPT gives you more ability. The manual explains how to do that for the gpt robot skill.

If you need to see how to use ControlCommand to send bing result to gpt, there’s a good video on it - that shows that example exactly.

you’ll always end up with a bit of a delay with nlp or gpt processing.

Automation Man

Integrating Voice Commands Into ARC

Upgrade to ARC Pro

Products

Community

Support

About