OpenAI Adds Agent Runtime to Responses API

From Language Model to Agent Platform

OpenAI has announced a significant expansion of its Responses API, equipping it with a hosted container environment that transforms the API from a text generation service into a full agent runtime platform. The update adds shell tool access, file management capabilities, and sandboxed compute containers that allow AI agents to execute code, manipulate files, and maintain persistent state across multi-step tasks — all within a secure, managed infrastructure.

The announcement represents OpenAI's most direct move into the agent infrastructure space, providing developers with the building blocks needed to create AI agents that can autonomously perform complex, multi-step workflows without requiring developers to manage their own compute infrastructure for agent execution.

Architecture of the Agent Runtime

The new agent runtime consists of three core components. First, the shell tool gives AI agents the ability to execute arbitrary shell commands within a sandboxed container. This means an agent can install packages, run scripts, compile code, and interact with command-line tools just as a human developer would from a terminal.

Second, a file management system allows agents to read, write, create, and modify files within their container. Files persist across multiple API calls within a session, enabling agents to build up complex artifacts — codebases, data analysis pipelines, documentation — over the course of a multi-step task.

Third, the containers themselves are fully isolated sandboxes that prevent agents from accessing resources outside their designated environment. Each container runs in its own namespace with restricted network access, ensuring that even if an agent executes malicious or erroneous code, the impact is contained within the sandbox.

How we used Gemini to build Google I/O 2026

Google erklärt, wie Gemini bei der Produktion von I/O 2026 half

Google sagt, Teams hätten Gemini und andere KI-Tools eingesetzt, um Filme, Visuals und Event-Elemente für Google I/O 2026 zu erstellen, und stellt die Konferenz als internes Schaufenster KI-gestützter Produktion dar.

Read article

Why Developers Need This

Building AI agents that can take actions in the real world — rather than merely generating text — has been one of the most active areas of AI development over the past year. Frameworks like LangChain, AutoGPT, and CrewAI have demonstrated the potential of AI agents, but developers using these frameworks have had to manage their own infrastructure for code execution, file storage, and state management.

This infrastructure burden is significant. Running AI-generated code safely requires sandboxing to prevent security incidents. Maintaining state across multi-step agent workflows requires persistent storage. Scaling agent execution across multiple concurrent sessions requires container orchestration. By providing a managed runtime, OpenAI absorbs these infrastructure responsibilities, allowing developers to focus on agent design and task orchestration rather than DevOps.

Use Cases and Applications

The agent runtime enables several categories of applications that were previously difficult to build with API-only access. Code generation and testing agents can now write code, run it, observe the output, and iteratively debug — all within a single API session. Data analysis agents can load datasets, execute analysis scripts, generate visualizations, and return results without round-tripping data between the API and the developer's infrastructure.

Research agents can be equipped with tools that access databases, APIs, and web services, synthesizing information from multiple sources into coherent reports. DevOps agents can execute deployment scripts, run health checks, and respond to operational incidents.

The runtime is also designed to support long-running tasks. Containers can persist for extended periods, allowing agents to work on tasks that take minutes or hours rather than the seconds typical of single API calls.

OpenAI starts with infrastructure robots but aims for "everyone having a personal robot doing anything they need"

OpenAI baut seine Robotik rund um Infrastrukturarbeit und eine längerfristige Verbraucher-Vision neu auf

OpenAI hat sein Robotik-Team neu aufgebaut und beginnt mit Infrastrukturaufgaben, während CEO Sam Altman ein längerfristiges Ziel persönlicher Roboter für alle beschreibt.

Read article

Competition and Market Context

OpenAI's agent runtime enters a competitive landscape. Anthropic offers a similar computer use capability for Claude, allowing the model to interact with desktop environments. Google's Gemini platform includes code execution through its AI Studio. And a growing ecosystem of open-source tools provides agent infrastructure that is not tied to any single model provider.

The differentiator for OpenAI's approach is integration depth. Because the runtime is built directly into the Responses API, agent capabilities are tightly coupled with the model's reasoning capabilities. The model can decide when to execute code, what files to create or modify, and how to interpret shell output — all as part of its natural response generation process.

Security and Governance

OpenAI emphasizes that the hosted container environment includes multiple security layers. Containers run with minimal privileges, network access is restricted to approved endpoints, and all agent actions are logged for audit purposes. Developers can set resource limits on containers — CPU, memory, disk space, execution time — to prevent runaway processes.

The logging and audit capabilities are particularly important for enterprise use cases where compliance requirements demand visibility into what AI agents are doing. Every shell command executed, every file created or modified, and every network request made by an agent is recorded and can be reviewed.

As AI agents take on increasingly consequential tasks, the infrastructure that supports them must be as robust as the models themselves. OpenAI's hosted container environment represents an acknowledgment that the path from language model to autonomous agent requires not just better models but better infrastructure.

This article is based on reporting by OpenAI. Read the original article.

Anthropic bans AI tools during job interviews to see how candidates actually think

Anthropic verbietet KI-Tools in Interviews, um Bewerber zu prüfen

Anthropic verbietet Berichten zufolge KI-Hilfe in Live-Jobinterviews, sofern dies nicht ausdrücklich erlaubt ist, während das Unternehmen prüfen will, wie Bewerber eigenständig denken.

Read article

Originally published on openai.com

OpenAI Launches Agent Runtime With Hosted Containers

From Language Model to Agent Platform

Architecture of the Agent Runtime

Google erklärt, wie Gemini bei der Produktion von I/O 2026 half

Why Developers Need This

Use Cases and Applications

OpenAI baut seine Robotik rund um Infrastrukturarbeit und eine längerfristige Verbraucher-Vision neu auf

Competition and Market Context

Security and Governance

Anthropic verbietet KI-Tools in Interviews, um Bewerber zu prüfen

Comments (0)

Related Articles

KI-Modelle trennen Rezeptlogik von Geschmackschemie

MISUMI startet mit 1-Milliarde-Dollar-Wette auf KI-Fertigung in den Amerikas durch

Keep Reading