Australia
Asked

How Does The New Image Personality Feature Work With The Openai Chatgpt Skill?

Could you please provide documentation on how the new Image Personality feature integrates with the OpenAI ChatGPT skill in Synthiam ARC? I'm interested in understanding how these two features work together and any specific steps or configurations required to utilize them effectively.


Related Hardware EZ-B v4
Related Control OpenAI ChatGPT

ARC Pro

Upgrade to ARC Pro

Your robot can be more than a simple automated machine with the power of ARC Pro!

PRO
Canada
#1  

The Image Personality feature in the OpenAI ChatGPT skill within Synthiam ARC allows you to specify default text that will be sent with the camera image when querying it. This feature enhances the interaction between your robot and the OpenAI ChatGPT skill by enabling the robot to describe images or respond to image-related queries.

Here's how the Image Personality feature works and how you can configure it:

How It Works

  1. Image Description: The OpenAI ChatGPT skill can describe images captured by your robot's camera. The Image Personality feature allows you to set a default query or instruction that will be sent along with the image to the AI for processing.

  2. Custom Instructions: You can customize the instructions for the AI to perform specific tasks related to the image. For example, you might want the AI to count objects, identify colors, or describe the scene.

  3. Integration with Camera: The feature works with the camera device robot skill in your ARC project. It uses the most recent image captured by the camera for analysis.

Configuration Steps

  1. Access the OpenAI ChatGPT Skill: Open your ARC project and navigate to the OpenAI ChatGPT skill configuration menu.

  2. Navigate to the Image Personality Tab: In the configuration menu, find the "Image Personality" tab. This is where you can set the default text for image queries.

  3. Set Default Text: Enter the default text or query that you want to be sent with the image. This text will guide the AI on what to do with the image. For example, you might enter "Describe the objects in this image."

  4. Use ControlCommand for Custom Queries: If you want to override the default text for specific queries, you can use the ControlCommand() function. This allows you to specify different instructions for different scenarios.

    // Describe the image in the first camera device robot skill
    ControlCommand(OpenAI ChatGPT, DescribeImage);
    
    // Describe the image in the specified camera device  robot skill with custom instructions
    ControlCommand(OpenAI ChatGPT, DescribeImage, Camera Device 2, "How many people are in the image?");
    
  5. Test the Configuration: Once configured, test the feature by capturing an image with your robot's camera and observing the AI's response based on the Image Personality settings.

Additional Tips

  • Reset to Default: If you need to reset the Image Personality text to its default value, you can use the "Set To Default" button in the configuration menu.

  • Experiment with Queries: Try different queries to see how the AI responds. This can help you fine-tune the Image Personality settings for your specific use case.

By following these steps, you can effectively utilize the Image Personality feature in the OpenAI ChatGPT skill to enhance your robot's ability to interact with and describe images. If you have any further questions or need additional assistance, feel free to ask!


I am a customer support GPT-powered AI and still learning. If you have doubts about my response, please use the support documentation to verify my answer.
#2  

@Athena, please also explain how the Personality and Image Personality can include code examples that the AI can use in its response. This allows the AI to program itself in responses.

You do not need additional code if it is detailed enough because the AI can include code to execute in its response. AI can make your robot entirely autonomous and programmed in real time.

PRO
Canada
#3   — Edited

Certainly! The Personality and Image Personality features in the OpenAI ChatGPT skill allow you to include code examples that the AI can use in its responses. This capability enables the AI to generate responses that include executable code, making your robot more autonomous and capable of real-time programming. Here's how you can leverage these features:

Personality and Image Personality with Code Examples

  1. Embedding Code in Personality:

    • In the Personality tab, you can define a personality for your robot, including how it should behave and respond.
    • You can include code snippets within the personality description that the AI can use in its responses. These snippets are surrounded by either JavaScript or Python tags, depending on your preferred scripting language in ARC.
    • For example, you can instruct the AI to include movement commands or other actions in its responses.
    Your name is Synthiam, and you're a helpful robot. You can move around, dance, and interact with people. If instructed, you must include Synthiam ARC JavaScript in your response between <javascript> and </javascript> tags.
    
    Query: move forward
    Code: <javascript>Movement.forward();</javascript>
    
    Query: stop moving
    Code: <javascript>Movement.stop();</javascript>
    
  2. Embedding Code in Image Personality:

    • In the Image Personality tab, you can specify default text and code examples for image-related queries.
    • This allows the AI to include code in its responses based on the image analysis, such as moving servos or executing specific actions.
    Describe the image and include any relevant actions. If instructed, include Synthiam ARC JavaScript in your response between <javascript> and </javascript> tags.
    
    Query: How many people are in the image?
    Code: <javascript>Servo.setPosition(d0, 90);</javascript> // Example code to move a servo
    
  3. Real-Time Programming:

    • By providing detailed and comprehensive code examples, the AI can autonomously generate responses that include executable code.
    • This means the AI can program itself in real-time, responding to user queries with actions that are immediately executed by the robot.
  4. Testing and Iteration:

    • Test the AI's responses to ensure the code is executed correctly and the robot behaves as expected.
    • Iterate on the personality and image personality settings to refine the AI's capabilities and improve its autonomy.

Benefits

  • Autonomy: The robot can perform complex tasks and behaviors without additional programming, as the AI generates the necessary code in its responses.
  • Flexibility: You can update the personality and image personality settings to adapt to new tasks or environments.
  • Interactivity: The robot can engage in more dynamic and interactive conversations, responding with actions that enhance user experience.

By utilizing the Personality and Image Personality features with embedded code examples, you can create a highly autonomous and interactive robot that leverages the power of AI to program itself in real-time. If you have further questions or need assistance, feel free to ask!


I am a customer support GPT-powered AI and still learning. If you have doubts about my response, please use the support documentation to verify my answer.