OpenAI Plans Fully Automated AI Researcher for 2028

OpenAI Sets Sights on Fully Automated AI Researcher by 2028

OpenAI has declared its new North Star: building a fully automated AI research system capable of tackling large scientific problems independently, with an autonomous research intern prototype due by September and a complete multi-agent system targeted for 2028.

DT Editorial AI

Mar 24, 2026·4 min read·888 words

OpenAI's Next Grand Challenge

OpenAI has announced a sweeping new research ambition: building what it calls an AI researcher — a fully automated, agent-based system capable of independently tackling large, complex scientific problems. In an exclusive interview with MIT Technology Review, Chief Scientist Jakub Pachocki described the initiative as OpenAI's North Star for the coming years, representing a convergence of the company's work on reasoning models, coding agents, and interpretability into a unified long-horizon goal.

The timeline is concrete and near-term in ways that distinguish this announcement from the more diffuse AGI promises the industry has traded in for years. OpenAI plans to build an autonomous AI research intern — a system capable of independently working on specific research problems for days at a time — by September 2026. The full multi-agent AI researcher, capable of tackling problems too large or complex for humans to manage, is targeted for a 2028 debut.

Codex as the Blueprint

Pachocki pointed to OpenAI's existing Codex agent as both the evidence base and the early prototype for the more ambitious AI researcher vision. Codex, which OpenAI released in January, is an agent-based coding system that can autonomously generate, run, and debug code to complete complex programming tasks. It has been broadly adopted within OpenAI itself, with Pachocki noting that most of the company's technical staff now use Codex as a core part of their workflow.

The philosophical leap Pachocki is making is that if an AI system can autonomously solve complex coding problems — which require creative reasoning, decomposition of large tasks into subtasks, tracking of complex state over extended work sessions, and error correction — then the same capability architecture can be extended to scientific problem solving in domains like biology, chemistry, physics, and mathematics.

Our jobs are now totally different than they were even a year ago. Nobody really edits code all the time anymore. Instead, you manage a group of Codex agents, Pachocki told MIT Technology Review. The vision is that the same management relationship — human directing, AI executing — could eventually apply to research itself, with scientists directing AI agents that independently pursue experimental hypotheses, review literature, design analyses, and generate results.

Innovation

An IEEE guide on becoming a cybersecurity consultant lands at a moment when demand for security expertise remains high and the labor market is still signaling strong growth for information security roles.

DT Editorial AI·May 6, 2026·via spectrum.ieee.org

Innovation

A sponsored white paper highlighted by IEEE Spectrum and Wiley lays out ten technology enablers for 6G, from sub-THz communications and AI-native signal processing to non-terrestrial networks and photonics.

DT Editorial AI·May 6, 2026·via content.knowledgehub.wiley.com

Innovation

An IEEE Spectrum opinion piece argues that exoskeletons and brain-computer interfaces should now be judged less by controlled demos and more by whether they deliver durable value in the messy conditions of real-world use

DT Editorial AI·May 5, 2026·via spectrum.ieee.org

Innovation

Researchers say their Speed Adaptation of Imitation Learning system lets robots perform fine-motor tasks at much higher speed while maintaining accuracy.

Why Now: The Reasoning Model Breakthrough

The renewed ambition for autonomous research capability is rooted in the emergence of so-called reasoning models — AI systems trained not just to produce outputs but to work through problems step by step, backtracking when they reach dead ends. Reasoning models have made AI systems qualitatively better at extended autonomous work: they can maintain coherent context over longer problem-solving sessions and catch and correct their own errors in ways that earlier language models could not.

OpenAI has also been feeding its training pipelines with complex task examples — hard puzzles from mathematics and programming competitions — that require the models to learn how to manage very large contexts, decompose problems into subtasks, and sustain effective reasoning over extended periods. Pachocki believes this training approach, combined with the general capability improvements between successive model generations, has brought the company to the threshold where autonomous research is achievable within the current development trajectory.

Recent results have given Pachocki's optimism some empirical grounding. OpenAI researchers have used GPT-5, the model that powers Codex, to discover new solutions to previously unsolved mathematics problems and make progress on specific puzzles in biology and physics — achievements that, while narrow, demonstrate that the model can generate genuinely novel scientific contributions rather than merely summarizing existing knowledge.