OpenAI Adds Agent Runtime to Responses API

From Language Model to Agent Platform

OpenAI has announced a significant expansion of its Responses API, equipping it with a hosted container environment that transforms the API from a text generation service into a full agent runtime platform. The update adds shell tool access, file management capabilities, and sandboxed compute containers that allow AI agents to execute code, manipulate files, and maintain persistent state across multi-step tasks — all within a secure, managed infrastructure.

The announcement represents OpenAI's most direct move into the agent infrastructure space, providing developers with the building blocks needed to create AI agents that can autonomously perform complex, multi-step workflows without requiring developers to manage their own compute infrastructure for agent execution.

Architecture of the Agent Runtime

The new agent runtime consists of three core components. First, the shell tool gives AI agents the ability to execute arbitrary shell commands within a sandboxed container. This means an agent can install packages, run scripts, compile code, and interact with command-line tools just as a human developer would from a terminal.

Second, a file management system allows agents to read, write, create, and modify files within their container. Files persist across multiple API calls within a session, enabling agents to build up complex artifacts — codebases, data analysis pipelines, documentation — over the course of a multi-step task.

Third, the containers themselves are fully isolated sandboxes that prevent agents from accessing resources outside their designated environment. Each container runs in its own namespace with restricted network access, ensuring that even if an agent executes malicious or erroneous code, the impact is contained within the sandbox.

How we used Gemini to build Google I/O 2026

Google、Gemini が I/O 2026 制作にどう役立ったかを明かす

Google は、Gemini やその他の AI ツールを使って Google I/O 2026 の映像、ビジュアル、イベント要素の制作を支援したと説明し、同カンファレンスを AI 支援制作の社内ショーケースとして位置づけている。

Read article

Why Developers Need This

Building AI agents that can take actions in the real world — rather than merely generating text — has been one of the most active areas of AI development over the past year. Frameworks like LangChain, AutoGPT, and CrewAI have demonstrated the potential of AI agents, but developers using these frameworks have had to manage their own infrastructure for code execution, file storage, and state management.

This infrastructure burden is significant. Running AI-generated code safely requires sandboxing to prevent security incidents. Maintaining state across multi-step agent workflows requires persistent storage. Scaling agent execution across multiple concurrent sessions requires container orchestration. By providing a managed runtime, OpenAI absorbs these infrastructure responsibilities, allowing developers to focus on agent design and task orchestration rather than DevOps.

Use Cases and Applications

The agent runtime enables several categories of applications that were previously difficult to build with API-only access. Code generation and testing agents can now write code, run it, observe the output, and iteratively debug — all within a single API session. Data analysis agents can load datasets, execute analysis scripts, generate visualizations, and return results without round-tripping data between the API and the developer's infrastructure.

Research agents can be equipped with tools that access databases, APIs, and web services, synthesizing information from multiple sources into coherent reports. DevOps agents can execute deployment scripts, run health checks, and respond to operational incidents.

The runtime is also designed to support long-running tasks. Containers can persist for extended periods, allowing agents to work on tasks that take minutes or hours rather than the seconds typical of single API calls.

OpenAI starts with infrastructure robots but aims for "everyone having a personal robot doing anything they need"

OpenAIは、インフラ作業と長期的な消費者向けビジョンを軸にロボティクスを再構築している

OpenAIはロボティクスチームを再編し、まずはインフラ関連の作業から着手する一方で、サム・アルトマンCEOは、将来的に誰もが個人用ロボットを持つという目標を描いている。

Read article

Competition and Market Context

OpenAI's agent runtime enters a competitive landscape. Anthropic offers a similar computer use capability for Claude, allowing the model to interact with desktop environments. Google's Gemini platform includes code execution through its AI Studio. And a growing ecosystem of open-source tools provides agent infrastructure that is not tied to any single model provider.

The differentiator for OpenAI's approach is integration depth. Because the runtime is built directly into the Responses API, agent capabilities are tightly coupled with the model's reasoning capabilities. The model can decide when to execute code, what files to create or modify, and how to interpret shell output — all as part of its natural response generation process.

Security and Governance

OpenAI emphasizes that the hosted container environment includes multiple security layers. Containers run with minimal privileges, network access is restricted to approved endpoints, and all agent actions are logged for audit purposes. Developers can set resource limits on containers — CPU, memory, disk space, execution time — to prevent runaway processes.

The logging and audit capabilities are particularly important for enterprise use cases where compliance requirements demand visibility into what AI agents are doing. Every shell command executed, every file created or modified, and every network request made by an agent is recorded and can be reviewed.

As AI agents take on increasingly consequential tasks, the infrastructure that supports them must be as robust as the models themselves. OpenAI's hosted container environment represents an acknowledgment that the path from language model to autonomous agent requires not just better models but better infrastructure.

This article is based on reporting by OpenAI. Read the original article.

AIコーディングエージェントの利用は社会科学全体で大きな偏りがあることが判明

Anthropicの研究により、社会科学におけるコーディングエージェントの採用には、性別、分野、キャリア段階、大学ランキングによる大きな差があることが示された。

Read article

Originally published on openai.com

OpenAI Launches Agent Runtime With Hosted Containers

From Language Model to Agent Platform

Architecture of the Agent Runtime

Google、Gemini が I/O 2026 制作にどう役立ったかを明かす

Why Developers Need This

Use Cases and Applications

OpenAIは、インフラ作業と長期的な消費者向けビジョンを軸にロボティクスを再構築している

Competition and Market Context

Security and Governance

AIコーディングエージェントの利用は社会科学全体で大きな偏りがあることが判明

Comments (0)

Related Articles

Anthropic、候補者を見極めるため面接でのAIツール使用を禁止

Keep Reading