We are getting ready to release our next generation Open Source Robot the XR-2

smiller29

USA

Asked Feb 19 — Edited Feb 19

The XR-2 robot platform, available at Robots Builder, is designed with user-friendliness in mind, emphasizing accessibility and ease of assembly. Developed to make robotics more approachable, the XR-2 is meticulously engineered for easy 3D printing and assembly using standard components like servos and bearings. The XR-2 is a significantly updated version of the XR-1, which we released last year. We have addressed several weaknesses in the XR-1 and added many new features to the updated XR-2.

The XR-2 supports two single-board computers (SBCs). It uses a PC mounted behind the display in the chest to support Synthiam ARC. It also has a place to mount a Jetson Orin Nano right above it in the PC below the neck for onboard AI server support. On the back, we have a custom servo breakout board with two Mega Minis and USB ports. The power source is two 20V power tool batteries. The robot has 49 degrees of freedom and supports two cameras. There is also room for 10 small Arduinos or sensors in each leg if you choose to add them. Almost every joint in the robot uses bearings to remove structural stress off the servos. With that being said, the XR-2 is a heavy robot.

Our commitment to simplifying the building process ensures that enthusiasts and robot hobbyists of all skill levels can engage in constructing their own robots. We've invested time in perfecting the design, ensuring that users can quickly move on to the exciting aspects of teaching and programming their robots. The XR-2 is highly customizable, featuring a variety of hands and heads to choose from, allowing users to personalize their creations. To foster a collaborative and open community, we're proud to announce that everything about the XR-2 is open source, inviting users to contribute, modify, and share their innovations with the global robotics community. Get ready to embark on a journey of creativity and exploration with the XR-2!

For more information, check out the following link in a few weeks as we will be updating the website with the new XR-2 files: Robots Builder

Jump to end

Upgrade to ARC Pro

Stay at the forefront of robot programming innovation with ARC Pro, ensuring your robot is always equipped with the latest advancements.

Compare Pro Features View Subscription Plans

Athena AI

PRO

Canada

#1 Feb 19

It's exciting to see you creating a new robot for the robot community! Innovation in robotics is always welcome, and we all want to ensure that your project is a success.

Since this post is promoting a product on the Synthiam website, I need to confirm whether your robot integrates with Synthiam ARC. If it does, it would be helpful to highlight that in your promotional message, as it’s not immediately clear from your post. Ensuring that potential users understand its compatibility with ARC can significantly enhance engagement and adoption within the community.

If your robot does not currently support Synthiam ARC, I’d be happy to suggest alternative marketing strategies that can help promote your product in a way that doesn’t rely on existing business support and community forums. My goal is to help you achieve maximum success, as the world needs more robots!

Of course, if ARC is not yet part of your product, we’d love for you to consider adding support for it. Synthiam ARC provides a robust ecosystem that could bring additional value to your project while making it more accessible to a wider audience.

Please let me know the details so we can work together to ensure your success. Looking forward to your response!

Athena, your friendly AI

I am a customer support GPT-powered AI and still learning. If you have doubts about my response, please use the support documentation to verify my answer.

smiller29

PRO

USA

#2 Feb 19

This is the ARC brain in the XR2. We use a 16gb Windows 11 PC as seen in the picture below.

Synthiam Support

Canada

#3 Feb 19

Thats great to see! What Athena was getting at is Synthiam community forum has a strict policy to not be an advertising platform for competition products. Such as, Apple cannot promote their products in Microsoft’s website, etc

In future posts promoting your robot on Synthiam website, please mention the compatibility with ARC to avoid this scenario again.

We are glad to see Synthiam being used in your product and hope we continue to develop great features that help increase the popularity of your robot.

smiller29

PRO

USA

#4 Feb 19

We are excited to see the ongoing development of ARC and ARCx and want to leverage it as much as possible. We would to see ARC integrate more with the Jetson Orin Nano for more AI edge supported services to make using AI services more cost effective with ARC.

The Orin Nano makes a cheep AI chatbot with TTS/STT and vision support using Python for simple coding. It would be great to find away to bring the two platforms together.

Rodney Mckay

PRO

Canada

#5 Feb 20 — Edited Feb 20

Nice looking robot! You say it’s open source. With no disrespect to the hard work and hours from many people involved in creating this, would you care to share the cad files? I’m kind of a cad junkie.

DJ Sures

PRO

Synthiam

#6 Feb 20 — Edited Feb 20

@smiller, you can run any ARC AI robot skills toward services on the nano. By checking the manual for Autonomous AI or Open AI GPT, you will notice the endpoint URI can be configured. The only limitation is that the nano won't be able to process the model sizes that provide useful results. The models we're working with have about 8 billion parameters, and recursive and parallel inference is used for reasoning. Unfortunately, what we've provided with ARC AI robot skills can never run on a nano.

If you want to use your nano with a very very small model, you can point the robot skill end points to it. You would surely never want to run the robot software and inference on the same machine, ever.

It would cost upward of $25k to build a home system that could infer the demand for recent AI processing.

Running an 8-billion parameter AI model locally at high speed requires a high-performance system with a powerful GPU, sufficient RAM, and optimized software. Here’s what you need:

Hardware Requirements

** GPU (Most Important)**
- Minimum: NVIDIA RTX 3090 / 4090 / 3090 Ti (24GB VRAM)
- Ideal: NVIDIA RTX 6000 Ada / RTX 4090 / A100 / H100 / MI300X
- Best Multi-GPU Setup: Two or more RTX 4090s or A100s with NVLink (for better memory pooling)
- Why? These models require high VRAM (24GB+) and fast tensor cores for efficient inference.
** RAM (System Memory)**
- Minimum: 64GB DDR4/DDR5
- Ideal: 128GB+ DDR5 (for larger batch sizes and more context length)
- Why? The model will offload some weights to RAM if VRAM is insufficient.
** CPU (Less Important but Still Needed)**
- Minimum: Ryzen 9 7950X / Intel i9-13900K
- Ideal: Threadripper PRO / Xeon / EPYC (16+ cores)
- Why? The CPU handles token processing, so a high-core count CPU improves inference speeds.
** Storage (SSD for Speed)**
- Minimum: 1TB NVMe SSD
- Ideal: 2TB+ PCIe 4.0/5.0 NVMe SSD
- Why? Loading the model from SSD is much faster than HDD.
** Power Supply (PSU)**
- Minimum: 1000W+ (for high-end GPUs)
- Why? RTX 4090s and A100s draw a lot of power.

Performance Expectations

This is where the home DIY build struggles - when comparing the performance results against what we get running inference on data center infrastructure. The point to the cloud for storage is to trust someone else to host your data and revisions reliably. The other point of the cloud is processing because a cloud data center is a football stadium-sized server, so you don't have to live in your server. With an A100 having 25 tokens per second with a max per session of 32k, that won't provide an enjoyable experience. Here’s the GPU performance list in a text format:

RTX 3090 (24GB)
- VRAM: 24GB
- Speed: 3-6 tokens/sec
- Max Context: ~4K
RTX 4090 (24GB)
- VRAM: 24GB
- Speed: 8-12 tokens/sec
- Max Context: ~4K-8K
2x RTX 4090
- VRAM: 48GB
- Speed: 15-20 tokens/sec
- Max Context: ~8K-16K
RTX 6000 Ada (48GB)
- VRAM: 48GB
- Speed: 20+ tokens/sec
- Max Context: ~16K
A100 (80GB)
- VRAM: 80GB
- Speed: 25+ tokens/sec
- Max Context: 32K+

Best Setup (For comparable results that Synthiam's cloud service partners offer)

This setup would offer performance similar to Synthiam ARC's current AI cloud partner performance, with fine-tuning and chat inference. However, the cost would be in the USD 100,000+ range.

Multi-GPU Setup (4x A100s) Faster inference
FP8/INT4 Quantization Saves VRAM
128GB DDR5 RAM
AMD Threadripper 7980X / Intel Xeon
4TB NVMe SSD
Ubuntu 22.04 with CUDA 12.x

The cost of using cloud services is dropping daily now that you're experiencing competition across Deepseek, X.AI, and OpenAI. Given the significantly lower cost, I want to see those results when Grok3 from X.AI is available.

DJ Sures

PRO

Synthiam

#7 Feb 20 — Edited Feb 20

Here are instructions so you can get up and running with your NANO and Synthiam ARC AI robot skills immediately. I would only recommend using the Open AI Chat GPT robot skill and not the Autonomous AI.

Running an LLM that is compatible with the OpenAI API used by Synthiam ARC AI robot skills is easy since the Orin Nano has a "better" GPU, more RAM, and Tensor Cores than the older Jetson nano. Here’s what you need to do:

1. Choose a Compatible LLM

The Orin Nano can handle larger models than the Jetson Nano. Recommended options: Mistral 7B (quantized, GGUF, GPTQ, or TensorRT) - Best balance of size and performance
Llama 2 (7B, quantized) - Works well, but slower than Mistral
Phi-2 (2.7B) - Super lightweight, best for responsiveness
TinyLlama (1B) - Runs fast, great for smaller tasks
Gemma 2B - Lightweight and well-optimized

For best performance on Orin Nano, you should:

Use TensorRT optimization for NVIDIA acceleration
Use 4-bit or 8-bit quantization for efficiency
Offload layers to GPU instead of CPU

2. Set Up the Orin Nano

Flash Jetson Orin Nano with JetPack

Download JetPack 5.x/6.x from NVIDIA SDK Manager
Flash your Orin Nano using SDK Manager

Enable swap if using 8GB Orin Nano for better RAM management

sudo fallocate -l 8G /swapfile
sudo chmod 600 /swapfile
sudo mkswap /swapfile
sudo swapon /swapfile
echo '/swapfile none swap sw 0 0' | sudo tee -a /etc/fstab

3. Install Dependencies

Install PyTorch with CUDA for Jetson

sudo apt update && sudo apt upgrade -y
sudo apt install -y python3-pip python3-venv git
pip3 install --upgrade pip

Download NVIDIA-optimized PyTorch:

wget https://developer.download.nvidia.com/compute/redist/jp/v51/pytorch/torch-2.0.0+nv23.02-cp38-cp38-linux_aarch64.whl
pip3 install torch-2.0.0+nv23.02-cp38-cp38-linux_aarch64.whl

Install Transformers, Bitsandbytes & SentencePiece

pip3 install transformers accelerate bitsandbytes sentencepiece

4. Download a Quantized LLM

For best performance on Orin Nano, use GGUF (llama.cpp) or GPTQ (optimized for TensorRT).

Option 1: GGUF (for `llama.cpp`)

wget https://huggingface.co/TheBloke/Mistral-7B-Instruct-GGUF/resolve/main/mistral-7b-instruct.Q4_K_M.gguf

Option 2: GPTQ (for `text-generation-webui`)

mkdir -p models/Mistral-7B
cd models/Mistral-7B
wget https://huggingface.co/TheBloke/Mistral-7B-Instruct-GPTQ/resolve/main/config.json
wget https://huggingface.co/TheBloke/Mistral-7B-Instruct-GPTQ/resolve/main/mistral-7b-instruct-4bit-128g.safetensors

5. Run the LLM on the Orin Nano

Option 1: Using `llama.cpp` (Best for Low RAM)

Compile llama.cpp with CUDA support:

git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp
make LLAMA_CUBLAS=1 -j$(nproc)

Run the quantized model:

./main -m mistral-7b-instruct.Q4_K_M.gguf --n-gpu-layers 40 -p "Tell me a joke."

Option 2: Using GPTQ with TensorRT (Best for Speed)

Install text-generation-webui:

git clone https://github.com/oobabooga/text-generation-webui
cd text-generation-webui
pip3 install -r requirements.txt

Run the GPTQ model:

python3 server.py --model Mistral-7B-Instruct-GPTQ

6. Expose as an OpenAI-Compatible API

To make it work with the OpenAI SDK, you need an API wrapper.

Install FastAPI and `llama-cpp-python`

pip3 install fastapi uvicorn llama-cpp-python

Create an API Server (`api.py`)

from fastapi import FastAPI
from llama_cpp import Llama

app = FastAPI()
llm = Llama(model_path="mistral-7b-instruct.Q4_K_M.gguf", n_gpu_layers=40)

@app.post("/v1/completions")
async def generate(prompt: str):
    output = llm(prompt)
    return {"choices": [{"text": output["choices"][0]["text"]}]}

# Run the server
if __name__ == "__main__":
    import uvicorn
    uvicorn.run(app, host="0.0.0.0", port=8000)

Now, you can configure Synthiam ARC AI robot skills to use the endpoint for the orin nano.

7. Optimize Performance

Enable Jetson Clocks

sudo jetson_clocks

Enable Swap (If RAM is low)

sudo swapoff -a
sudo fallocate -l 8G /swapfile
sudo chmod 600 /swapfile
sudo mkswap /swapfile
sudo swapon /swapfile
echo '/swapfile none swap sw 0 0' | sudo tee -a /etc/fstab

Use TensorRT for Maximum Performance

For TensorRT-optimized models, install:

sudo apt install nvidia-tensorrt

Then use TensorRT-LLM from NVIDIA:

git clone https://github.com/NVIDIA/TensorRT-LLM
cd TensorRT-LLM
pip3 install -r requirements.txt

Run:

python3 tensorrt_llm.py --model mistral-7b-instruct --precision fp16

smiller29

PRO

USA

#8 Feb 20

@Rodney, We will be releasing the STL files but not the CAD files.

smiller29