OpenAI Adds Agent Runtime to Responses API

From Language Model to Agent Platform

OpenAI has announced a significant expansion of its Responses API, equipping it with a hosted container environment that transforms the API from a text generation service into a full agent runtime platform. The update adds shell tool access, file management capabilities, and sandboxed compute containers that allow AI agents to execute code, manipulate files, and maintain persistent state across multi-step tasks — all within a secure, managed infrastructure.

The announcement represents OpenAI's most direct move into the agent infrastructure space, providing developers with the building blocks needed to create AI agents that can autonomously perform complex, multi-step workflows without requiring developers to manage their own compute infrastructure for agent execution.

Architecture of the Agent Runtime

The new agent runtime consists of three core components. First, the shell tool gives AI agents the ability to execute arbitrary shell commands within a sandboxed container. This means an agent can install packages, run scripts, compile code, and interact with command-line tools just as a human developer would from a terminal.

Second, a file management system allows agents to read, write, create, and modify files within their container. Files persist across multiple API calls within a session, enabling agents to build up complex artifacts — codebases, data analysis pipelines, documentation — over the course of a multi-step task.

Third, the containers themselves are fully isolated sandboxes that prevent agents from accessing resources outside their designated environment. Each container runs in its own namespace with restricted network access, ensuring that even if an agent executes malicious or erroneous code, the impact is contained within the sandbox.

How we used Gemini to build Google I/O 2026

Google explique comment Gemini a aidé à produire I/O 2026

Google indique que ses équipes ont utilisé Gemini et d’autres outils d’IA pour créer des films, des visuels et des éléments de l’événement Google I/O 2026, présentant la conférence comme une vitrine interne de production assistée par IA.

Read article

Why Developers Need This

Building AI agents that can take actions in the real world — rather than merely generating text — has been one of the most active areas of AI development over the past year. Frameworks like LangChain, AutoGPT, and CrewAI have demonstrated the potential of AI agents, but developers using these frameworks have had to manage their own infrastructure for code execution, file storage, and state management.

This infrastructure burden is significant. Running AI-generated code safely requires sandboxing to prevent security incidents. Maintaining state across multi-step agent workflows requires persistent storage. Scaling agent execution across multiple concurrent sessions requires container orchestration. By providing a managed runtime, OpenAI absorbs these infrastructure responsibilities, allowing developers to focus on agent design and task orchestration rather than DevOps.

Use Cases and Applications

The agent runtime enables several categories of applications that were previously difficult to build with API-only access. Code generation and testing agents can now write code, run it, observe the output, and iteratively debug — all within a single API session. Data analysis agents can load datasets, execute analysis scripts, generate visualizations, and return results without round-tripping data between the API and the developer's infrastructure.

Research agents can be equipped with tools that access databases, APIs, and web services, synthesizing information from multiple sources into coherent reports. DevOps agents can execute deployment scripts, run health checks, and respond to operational incidents.

The runtime is also designed to support long-running tasks. Containers can persist for extended periods, allowing agents to work on tasks that take minutes or hours rather than the seconds typical of single API calls.

OpenAI starts with infrastructure robots but aims for "everyone having a personal robot doing anything they need"

OpenAI reconstruit sa branche robotique autour du travail d’infrastructure et d’une vision grand public à plus long terme

OpenAI a rebâti son équipe robotique, en commençant par des tâches d’infrastructure, tandis que le PDG Sam Altman décrit un objectif à long terme de robots personnels pour tous.

Read article

Competition and Market Context

OpenAI's agent runtime enters a competitive landscape. Anthropic offers a similar computer use capability for Claude, allowing the model to interact with desktop environments. Google's Gemini platform includes code execution through its AI Studio. And a growing ecosystem of open-source tools provides agent infrastructure that is not tied to any single model provider.

The differentiator for OpenAI's approach is integration depth. Because the runtime is built directly into the Responses API, agent capabilities are tightly coupled with the model's reasoning capabilities. The model can decide when to execute code, what files to create or modify, and how to interpret shell output — all as part of its natural response generation process.

Security and Governance

OpenAI emphasizes that the hosted container environment includes multiple security layers. Containers run with minimal privileges, network access is restricted to approved endpoints, and all agent actions are logged for audit purposes. Developers can set resource limits on containers — CPU, memory, disk space, execution time — to prevent runaway processes.

The logging and audit capabilities are particularly important for enterprise use cases where compliance requirements demand visibility into what AI agents are doing. Every shell command executed, every file created or modified, and every network request made by an agent is recorded and can be reviewed.

As AI agents take on increasingly consequential tasks, the infrastructure that supports them must be as robust as the models themselves. OpenAI's hosted container environment represents an acknowledgment that the path from language model to autonomous agent requires not just better models but better infrastructure.

This article is based on reporting by OpenAI. Read the original article.

Une étude révèle que l’usage des agents de codage par IA est fortement inégal dans les sciences sociales

Une étude d’Anthropic a mis en évidence de fortes disparités dans l’adoption des agents de codage en sciences sociales, avec des écarts selon le genre, la discipline, le stade de carrière et le rang de l’université.

Read article

Originally published on openai.com

OpenAI Launches Agent Runtime With Hosted Containers

From Language Model to Agent Platform

Architecture of the Agent Runtime

Google explique comment Gemini a aidé à produire I/O 2026

Why Developers Need This

Use Cases and Applications

OpenAI reconstruit sa branche robotique autour du travail d’infrastructure et d’une vision grand public à plus long terme

Competition and Market Context

Security and Governance

Une étude révèle que l’usage des agents de codage par IA est fortement inégal dans les sciences sociales

Comments (0)

Related Articles

Anthropic interdit les outils d’IA en entretien pour tester les candidats

Les modèles d’IA séparent la logique des recettes de la chimie des saveurs

Keep Reading