OpenAI Adds Agent Runtime to Responses API

From Language Model to Agent Platform

OpenAI has announced a significant expansion of its Responses API, equipping it with a hosted container environment that transforms the API from a text generation service into a full agent runtime platform. The update adds shell tool access, file management capabilities, and sandboxed compute containers that allow AI agents to execute code, manipulate files, and maintain persistent state across multi-step tasks — all within a secure, managed infrastructure.

The announcement represents OpenAI's most direct move into the agent infrastructure space, providing developers with the building blocks needed to create AI agents that can autonomously perform complex, multi-step workflows without requiring developers to manage their own compute infrastructure for agent execution.

Architecture of the Agent Runtime

The new agent runtime consists of three core components. First, the shell tool gives AI agents the ability to execute arbitrary shell commands within a sandboxed container. This means an agent can install packages, run scripts, compile code, and interact with command-line tools just as a human developer would from a terminal.

Second, a file management system allows agents to read, write, create, and modify files within their container. Files persist across multiple API calls within a session, enabling agents to build up complex artifacts — codebases, data analysis pipelines, documentation — over the course of a multi-step task.

Third, the containers themselves are fully isolated sandboxes that prevent agents from accessing resources outside their designated environment. Each container runs in its own namespace with restricted network access, ensuring that even if an agent executes malicious or erroneous code, the impact is contained within the sandbox.

How we used Gemini to build Google I/O 2026

Google detalha como o Gemini ajudou a produzir o I/O 2026

A Google diz que equipes usaram o Gemini e outras ferramentas de IA para ajudar a criar filmes, visuais e elementos do evento do Google I/O 2026, apresentando a conferência como uma vitrine interna de produção assistida por IA.

Read article

Why Developers Need This

Building AI agents that can take actions in the real world — rather than merely generating text — has been one of the most active areas of AI development over the past year. Frameworks like LangChain, AutoGPT, and CrewAI have demonstrated the potential of AI agents, but developers using these frameworks have had to manage their own infrastructure for code execution, file storage, and state management.

This infrastructure burden is significant. Running AI-generated code safely requires sandboxing to prevent security incidents. Maintaining state across multi-step agent workflows requires persistent storage. Scaling agent execution across multiple concurrent sessions requires container orchestration. By providing a managed runtime, OpenAI absorbs these infrastructure responsibilities, allowing developers to focus on agent design and task orchestration rather than DevOps.

Use Cases and Applications

The agent runtime enables several categories of applications that were previously difficult to build with API-only access. Code generation and testing agents can now write code, run it, observe the output, and iteratively debug — all within a single API session. Data analysis agents can load datasets, execute analysis scripts, generate visualizations, and return results without round-tripping data between the API and the developer's infrastructure.

Research agents can be equipped with tools that access databases, APIs, and web services, synthesizing information from multiple sources into coherent reports. DevOps agents can execute deployment scripts, run health checks, and respond to operational incidents.

The runtime is also designed to support long-running tasks. Containers can persist for extended periods, allowing agents to work on tasks that take minutes or hours rather than the seconds typical of single API calls.

OpenAI starts with infrastructure robots but aims for "everyone having a personal robot doing anything they need"

A OpenAI está reconstruindo sua área de robótica em torno de trabalho de infraestrutura e de uma visão de consumo de longo prazo

A OpenAI refez sua equipe de robótica, começando por tarefas de infraestrutura, enquanto o CEO Sam Altman descreve uma meta de longo prazo de robôs pessoais para সবাই.

Read article

Competition and Market Context

OpenAI's agent runtime enters a competitive landscape. Anthropic offers a similar computer use capability for Claude, allowing the model to interact with desktop environments. Google's Gemini platform includes code execution through its AI Studio. And a growing ecosystem of open-source tools provides agent infrastructure that is not tied to any single model provider.

The differentiator for OpenAI's approach is integration depth. Because the runtime is built directly into the Responses API, agent capabilities are tightly coupled with the model's reasoning capabilities. The model can decide when to execute code, what files to create or modify, and how to interpret shell output — all as part of its natural response generation process.

Security and Governance

OpenAI emphasizes that the hosted container environment includes multiple security layers. Containers run with minimal privileges, network access is restricted to approved endpoints, and all agent actions are logged for audit purposes. Developers can set resource limits on containers — CPU, memory, disk space, execution time — to prevent runaway processes.

The logging and audit capabilities are particularly important for enterprise use cases where compliance requirements demand visibility into what AI agents are doing. Every shell command executed, every file created or modified, and every network request made by an agent is recorded and can be reviewed.

As AI agents take on increasingly consequential tasks, the infrastructure that supports them must be as robust as the models themselves. OpenAI's hosted container environment represents an acknowledgment that the path from language model to autonomous agent requires not just better models but better infrastructure.

This article is based on reporting by OpenAI. Read the original article.

Estudo descobre que o uso de agentes de codificação por IA é fortemente desigual nas ciências sociais

Um estudo da Anthropic encontrou grandes disparidades na adoção de agentes de codificação nas ciências sociais, com diferenças por gênero, área, estágio de carreira e ranking da universidade.

Read article

Originally published on openai.com

OpenAI Launches Agent Runtime With Hosted Containers

From Language Model to Agent Platform

Architecture of the Agent Runtime

Google detalha como o Gemini ajudou a produzir o I/O 2026

Why Developers Need This

Use Cases and Applications

A OpenAI está reconstruindo sua área de robótica em torno de trabalho de infraestrutura e de uma visão de consumo de longo prazo

Competition and Market Context

Security and Governance

Estudo descobre que o uso de agentes de codificação por IA é fortemente desigual nas ciências sociais

Comments (0)

Related Articles

Anthropic proíbe ferramentas de IA em entrevistas para testar candidatos

Modelos de IA separam a lógica de receitas da química do sabor

MISUMI amplia aposta nas Américas com investimento de US$ 1 bilhão em manufatura com IA

Keep Reading