Asked — Edited

The Speech Is Cut Off Using Bing Speech And CHATGPT And Iotiny

So I ask the BingSpeech "What is your name?" it sends question to CHATGPT which then provides it response in the form of the $OpenAIResponse "My name is XR1..........." but what I get out of the speaker is "is XR1......" or "name is XR1........" Why would it be cut off like that?

@Nink I have not tried Watson speech to text and text to speech skills for this.  I just can't understand how it can be dropping part of the output in the audio.  The EZBSAY command is sending the complete response but parts are missing in the audio output. To me this sounds like a software issue talking to the hardware or a firmware issue in the hardware. I know wifi can cause delays but if the packets got sent and received the audio should play everything not parts of it.


Related Hardware EZ-B IoTiny

ARC Pro

Upgrade to ARC Pro

Don't limit your robot's potential – subscribe to ARC Pro and transform it into a dynamic, intelligent machine.

#1  

I had similar problems. At first I tested all the skills, in the end there was no sound. Deleted everything. I installed again only two skills Bing and Chat. Didn't work again. Installed several times in different order. In the end, a miracle happened and everything worked. Apparently it is important to have a clean place, a clean cache, the correct sequence of installing skills.

Then when the sound was working I started adding servos, mouth and eye movement. In this case, there were delays, sometimes rattling of the voice, loss of letters. This means that the performance of the computer is not enough. Closed all programs and windows on the computer. It began to work better. Perhaps the problem is in the performance of the computer. Voice and video consume a lot of resources.

PRO
Canada
#2  

Oh sorry misunderstood. I don’t use sayEZB command. I use on board SBC and speaker .  The speaker on EZB is pretty bad, maybe If you don’t have an on board PC you can run a Bluetooth speaker connected to external PC and that may work better.

#3  

I attached a 1.5 speaker to the IOTiny and the sound is load and clear for speaking not sure I would want to play music through it. I just wish someone who had knows more about this hardware and skill could explain where the issue is so I might be able to address it.

PRO
Canada
#4  

Hey @athena how is the audio transmitted from the PC to the EZB. Does it use TCP or UDP, what compression, Audio codec or file format is used?   Is any of this configurable?

PRO
Canada
#5  

Hello! The audio is transmitted from the PC to the EZB using TCP/IP protocol. The audio is compressed using the Opus codec, which is a high-quality, low-latency audio codec. The file format used is WAV. Currently, the configuration options for audio transmission are limited within ARC. However, you can adjust the audio quality settings in the ARC software. If you need further assistance with audio configuration, I recommend checking out the support section on the Synthiam website for relevant articles and tutorials.


I am a customer support GPT-powered AI and still learning. If you have doubts about my response, please use the support documentation to verify my answer.
PRO
Canada
#6   — Edited

So sounds like the entire file is transmitted to the IOTiny and data is not being lost due to transmission, compression or file format so this kinda points to a hardware performance issue either on the PC end like @FANTOMAS said or on the IOTiny end. Perhaps interupts are occurring when controlling other devices cutting off speech.

to rule out PC performance I guess you could try local speaker.

Hey @jeremie any issues with sayEZB and speech on IOTiny?

#7  

Hey @athena

I ask the BingSpeech "What is your name?" it sends question to CHATGPT which then provides it response in the form of the $OpenAIResponse "My name is XR1..........." but what I get out of the speaker is "is XR1......" or "name is XR1........" Why would it be cut off like that?

PRO
Canada
#8  

Hello @Redzone! It seems like you're experiencing some issues with the speech output from the Bing Speech Recognition skill. There could be a few reasons why the response is being cut off.

One possibility is that there might be a delay or interruption in the communication between the Bing Speech Recognition skill and the speech output device. This could be due to network latency or other factors affecting the transmission of data.

Another possibility is that there might be limitations in the speech output capabilities of the device you are using. Some devices may have limitations on the length of the speech output or the quality of the audio playback.

To troubleshoot this issue, you can try the following steps:

  1. Check the network connection between the device running ARC and the speech output device to ensure a stable connection.
  2. Test the speech output with different phrases of varying lengths to see if the issue persists consistently or only occurs with specific phrases.
  3. If possible, try using a different speech output device or method to see if the issue is specific to the current setup.

If the issue persists, it may be helpful to reach out to the Synthiam support team for further assistance. They can provide more specific guidance based on your setup and help troubleshoot any potential hardware or software issues.


I am a customer support GPT-powered AI and still learning. If you have doubts about my response, please use the support documentation to verify my answer.