Anthropic Warns AI Could Soon Build Itself, Posing Risks and Opportunities
Anthropic Warns AI Could Soon Build Itself

Artificial intelligence may not yet be capable of building better versions of itself, but Anthropic, the company behind the Claude AI model, suggests that the world is moving closer to this possibility than many realize. In a research paper titled "When AI Builds Itself," the company outlines how AI is increasingly being used to develop, test, and improve new AI systems.

Anthropic states that its own engineers now rely heavily on AI for coding, research, and experimentation, enabling the company to progress much faster than just a few years ago. According to the company, Claude now writes over 80% of the code merged into Anthropic's production systems, while engineers produce several times more output with AI assistance.

The technical trends discussed in the paper suggest that AI systems will become significantly more capable in the coming years, with major implications. "AI that can build itself would be a major development in the history of technology—one that could bring enormous good for the world in science, healthcare, and beyond," the paper states. However, it warns that "full recursive self-improvement also might increase the risks of humans losing control over AI systems," emphasizing the need for robust security, monitoring, and behavioral shaping.

Wide Pickt banner — collaborative shopping lists app for Telegram, phone mockup with grocery list

The research paper outlines three possible future scenarios:

  • The trend stalls, but today's AI capabilities are widely diffused.
  • AI labs continue to see compounding efficiency gains.
  • AI systems become capable of full recursive self-improvement and begin building their successors.

While much attention focuses on futuristic scenarios where machines become fully autonomous and self-improving, Anthropic suggests a more immediate and realistic future may arrive first. In the first possible outcome, AI progress slows compared to aggressive predictions, but powerful AI systems continue spreading across businesses, governments, and everyday work.

Even in that scenario, the impact could be enormous. Anthropic argues that companies equipped with advanced AI tools may achieve the output of organizations many times their size. Tasks that once required large teams could increasingly be handled by small groups working alongside AI agents capable of writing code, analyzing data, conducting research, and automating routine work. The company points to growing evidence that AI systems are already completing longer and more complex tasks, accelerating organizational speed.

Anthropic says this future would give governments, businesses, and societies more time to adapt than extreme scenarios involving fully self-improving AI. However, it warns that even if AI capabilities stopped advancing today, the technology could still reshape industries, change the nature of work, and create challenges around cybersecurity, misinformation, and economic disruption.

What Anthropic's Fear of 'Recursive Self-Improvement' in AI Actually Means

At the center of Anthropic's report is the concept of "recursive self-improvement"—the idea that an AI system could eventually design, build, and improve the next generation of AI models without human engineers doing most of the work. Currently, humans make key decisions, including which problems to solve, experiments to run, and ideas to pursue. AI tools like Claude assist with coding, testing, analysis, and technical tasks but depend on human direction and judgment.

Anthropic's concern is about what happens if that balance changes. In a fully recursive self-improvement scenario, an AI model would not just assist researchers but could effectively become the researcher. The system could design new AI architectures, run thousands of experiments, evaluate results, improve its own capabilities, and build a more advanced successor. That successor could repeat the process, creating a cycle of increasingly powerful AI systems.

According to Anthropic, such a future is not inevitable and may never happen. However, recent progress suggests AI systems are steadily taking on more of the work involved in AI development. The biggest unanswered question is whether AI can eventually develop "research taste"—the ability to identify important problems, choose the right direction, and make complex judgment calls. If AI systems become capable of that, the final barrier to autonomous AI development could begin to disappear.

Pickt after-article banner — collaborative shopping lists app with family illustration

Anthropic warns that such a future could bring major benefits, including faster scientific discoveries, better healthcare technologies, and rapid technological progress. At the same time, it could make it much harder for humans to monitor, control, and understand increasingly advanced AI systems. That is why the company believes governments, researchers, and technology firms need to start preparing now, even though fully autonomous AI development remains a future possibility rather than a present reality.

What Happens if AI Systems Themselves Become Capable of Full Recursive Self-Improvement

If technical trends continue and AI systems develop capabilities akin to transformative human ingenuity, it is plausible that AI systems could design and refine themselves. In this world, the pace of AI progress becomes determined entirely by the availability of compute or algorithmic efficiencies. Humans play a substantially diminished role in development, likely shifting efforts toward oversight, validation, and verification of an expanding "virtual lab" run by AI systems. Anthropic expects that systems capable of automated AI research would have skills transferable to other sciences, revolutionizing fields beyond AI.

In the research paper, Anthropic states it does "not have good intuitions for what this world would look like, because our economy is currently driven by humans and human-built tools." It warns: "By its nature, a world driven by fast recursive self-improvement could become dominated by the self-improving model as its capabilities fully eclipse those of humans and the model proliferates across the broader economy. It is difficult to predict what the economy looks like if human labor stops being competitive."

"Even if model development became fully automated and recursive, we can’t predict what that would mean for most humans’ daily lives. Amdahl’s law applies here as well. Recursive intelligence could lead to achieving many of the benefits outlined in Machines of Loving Grace, quickly in some domains. We expect that embodied intelligence (i.e., robotics) might quickly follow recursive intelligence, and follow a similar path of increasing returns at decreasing cost. More powerful intelligence might help us build things in the physical world more quickly, run more productive clinical trials of lifesaving drugs, and develop novel forms of coordination."

But achieving recursive improvement alone does not suggest an immediate change in industrial production, societal organization, or market functioning. More intelligence cannot learn what a drug does over decades of use, hold elections sooner than a constitution dictates, or turn a stranger into an old friend in a weekend. For most people, the felt pace of this future will still be set by bottlenecks, even if the laboratory upstream runs at the speed of compute. That collision, where recursive intelligence building itself ever faster meets the world of humans, relationships, and governance, is another part of this future the company says it cannot predict.