Rhoda AI Exits Stealth With $450M for Robot Training

From Stealth to Spotlight

A new robotics AI company has emerged from stealth mode with one of the largest debut funding rounds in the history of the robotics industry. Rhoda AI has raised $450 million to commercialize a system that trains robots to perform complex tasks by watching video demonstrations rather than through traditional programming or manual teleoperation.

The company says its approach dramatically reduces the time and expertise required to teach robots new skills, potentially solving one of the biggest bottlenecks in robotics deployment: the programming problem. Today, getting a robot to perform a new task typically requires weeks or months of specialized engineering work. Rhoda AI claims its system can accomplish the same in hours.

Learning by Watching

The core technology behind Rhoda AI is a foundation model trained on vast amounts of video data showing humans performing physical tasks. The model learns not just what actions look like, but the underlying physics, spatial relationships, and causal chains that connect an intention to a completed task.

When a user wants to teach a Rhoda-equipped robot a new skill, they can simply show the robot a video of the task being performed, whether from a smartphone recording, an instructional video, or existing surveillance footage. The AI system analyzes the video, extracts the relevant actions and their sequence, maps them onto the robot's physical capabilities, and generates a control policy that allows the robot to replicate the task in its own environment.

This represents a fundamental shift from current approaches. Most robot training today relies on either explicit programming, where engineers manually code every movement and decision point, or reinforcement learning, where robots learn through millions of trial-and-error attempts in simulation before transferring skills to the physical world. Both approaches are time-consuming, expensive, and require specialized expertise.

AI & Robotics

OpenAI has introduced three new audio models aimed at turning voice interfaces into more capable real-time systems that can reason, translate, and transcribe while conversations are happening.

DT Editorial AI·May 9, 2026·via openai.com

AI & Robotics

OpenAI has introduced GPT-5.5-Cyber, a less-restricted model variant for authorized security researchers, allowing tasks such as exploit development and malware analysis under a tiered access system.

DT Editorial AI·May 8, 2026·via the-decoder.com

AI & Robotics

Reported fundraising moves by Deepseek and Core Automation show how aggressively investors are still backing frontier AI, with capital pouring into both large model labs and younger companies pursuing post-training and商业

DT Editorial AI·May 8, 2026·via the-decoder.com

AI & Robotics

Anthropic is reportedly discussing a funding round of up to $50 billion at a valuation near $900 billion, a sign of how aggressively investors are rewarding AI revenue growth and compute access.

Bridging the Reality Gap

One of the most significant claims Rhoda AI makes is that its system is designed to operate beyond controlled laboratory demonstrations and into real-world environments. This addresses what roboticists call the sim-to-real gap or, in this case, the video-to-real gap, the challenge of transferring skills learned from one context into the messy, unpredictable conditions of actual deployment.

Real-world environments differ from training scenarios in countless ways. Lighting changes, objects are positioned differently, surfaces have different friction properties, and unexpected obstacles appear. Systems that work perfectly in controlled settings often fail catastrophically when these conditions vary even slightly.

Rhoda AI says it addresses this through a combination of robust visual understanding and adaptive control. The foundation model has been trained on sufficiently diverse video data that it develops generalized understanding of physics and object interactions rather than memorizing specific scenarios. When deploying in a new environment, the system continuously adapts its control policies based on real-time sensory feedback.