Openai Text To Speech

by OpenAI

Use the OpenAI models for converting text to speech. It includes nine built-in voices that can be used.

Requires ARC v4 (Updated 3/18/2025)

How to add the Openai Text To Speech robot skill

Load the most recent release of ARC (Get ARC).
Press the Project tab from the top menu bar in ARC.
Press Add Robot Skill from the button ribbon bar in ARC.
Choose the Audio category tab.
Press the Openai Text To Speech icon to add the robot skill to your project.

Don't have a robot yet?

Follow the Getting Started Guide to build a robot and use the Openai Text To Speech robot skill.

How to use the Openai Text To Speech robot skill

Jump to Control Commands list
Jump to comments
Jump to end

The OpenAI Text-to-Speech robot skill for Synthiam ARC allows your robot to communicate naturally by converting written text into audible speech using advanced OpenAI models. Simply send any text you want the robot to vocalize using the ControlCommand(), and the robot will produce clear, expressive speech in real-time.

One of the unique aspects of this robot skill is that it leverages OpenAI's sophisticated AI-driven speech synthesis. This means that each time you request speech - even when requesting the exact same text - the resulting audio will never sound identical. Variations in tone, pacing, and inflection make the robot's voice dynamic, realistic, and engaging.

Benefits of Text-to-Speech

Enhanced Interaction: Creates a more immersive and interactive experience for users, making robot-human interactions more intuitive.
Accessibility: Assists users who may have visual impairments or prefer auditory communication.
Efficiency: Automates announcements, instructions, and feedback, saving time and effort.
Personalization: Multiple voices allow customization to suit specific applications or contexts, from professional environments to casual interactions.

Available Voices

This robot skill includes nine distinct OpenAI-generated voices, each with unique characteristics:

Alloy: Clear, neutral, and versatile-ideal for general-purpose announcements and technical explanations.
Echo: Soft and reflective, suited for storytelling and calm interactions.
Fable: Expressive and animated, perfect for creative narratives and entertainment.
Onyx: Deep, authoritative voice, well-suited for commands, alerts, or serious contexts.
Nova: Friendly and inviting, great for customer service and casual conversation.
Shimmer: Bright and enthusiastic, effective for lively engagements and educational content.
Ember: Warm and comforting, excellent for reassuring interactions or empathetic communication.
Luna: Calm and soothing, ideal for relaxation, guided activities, or therapeutic applications.
Zephyr: Light and airy, providing a gentle and approachable tone suitable for general dialogue.

How to Use

Simply invoke the ControlCommand() with your desired text and voice selection to enable your robot to speak dynamically. Each command produces uniquely synthesized audio, creating engaging interactions every time. View the bottom of this document or the Cheat Sheet in ARC to see available control commands for this robot skill.

Configuration

The configuration screen has 2 tabs and several options.

Settings Tab

Voice Model: This is the voice that can be configured. There are nine different voices, each with a unique sound and quality.

Speak out of EZB: If this option is checked, the audio will be spoken out of the EZB. Otherwise, it will speak out of the PC Speaker
Replace Audio Script Commands: Checking this option will use this robot skill as the default speech synthesis for ARC scripting. This means using the Audio.say(), and other speech scripting commands will use this robot skill rather than the built-in speech synthesis.

Start Speaking Script: This script will execute as soon as the text begins to speak. You can use it to move a robot's mouth or turn on an LED or something when the robot is speaking.

Text Variable: This is the variable that holds the text that is currently being spoken.

API Settings Tab

API Key: Configure the Open AI API Key that you can obtain from the open ai website.

Example

ControlCommand("OpenAI Text To Speech", "speak", "The text to speak as a string");

Enjoy bringing dynamic and expressive speech capabilities to your robotics projects with the OpenAI Text-to-Speech Robot Skill.

Control Commands for the Openai Text To Speech robot skill

There are Control Commands available for this robot skill which allows the skill to be controlled programmatically from scripts or other robot skills. These commands enable you to automate actions, respond to sensor inputs, and integrate the robot skill with other systems or custom interfaces. If you're new to the concept of Control Commands, we have a comprehensive manual available here that explains how to use them, provides examples to get you started and make the most of this powerful feature.

Control Command Manual

Speak Text Asynchronously

Speaks the provided text asynchronously in the background.

Syntax:

controlCommand("OpenAI Text To Speech", "speak", "The text to speak as a string");

Example:

controlCommand("OpenAI Text To Speech", "speak", "Hello, world!");

Speak Text and Wait for Completion

Speaks the provided text and waits until the speech completes.

Syntax:

controlCommand("OpenAI Text To Speech", "speakWait", "The text to speak as a string");

Example:

controlCommand("OpenAI Text To Speech", "speakWait", "Hello, world!");

Similar Skills

Upgrade to ARC Pro

Experience the transformation – subscribe to Synthiam ARC Pro and watch your robot evolve into a marvel of innovation and intelligence.

Compare Pro Features View Subscription Plans

Synthiam Support

Canada

#1 Mar 12

v1 of this robot skill has been added to the robot skill store.

Synthiam Support

Canada

#2 Mar 12

v3 has been updated to fix a minor bug where not all voices were being displayed in the configuration window.

Synthiam Support

Canada

#3 Mar 18 — Edited Mar 19

v4 updated to use the new audio manager in ARC that allows this robot skill to replace the Audio.say() etc script commands. This means you don't need to use the control commands to make this robot skill speak. Simply continue using the Audio.say() and other script commands.

Openai Text To Speech

How to add the Openai Text To Speech robot skill

Don't have a robot yet?

How to use the Openai Text To Speech robot skill

Benefits of Text-to-Speech

Available Voices

How to Use

Configuration

Settings Tab

API Settings Tab

Example

Control Commands for the Openai Text To Speech robot skill

Speak Text Asynchronously

Speak Text and Wait for Completion

Similar Skills

Audiotoolbox Plugin

Advanced Speech Synthesis

Advanced Speech Recognition

Upgrade to ARC Pro

Products

Community

Support

About