AI Code Generation: Claude Now Writes Most Code

From Coding Assistant to Primary Developer

Self-improving AI systems are AI models that can generate, test, and refine the code of the software they run on, creating feedback loops where each improvement enables the next and gradually shifts human developers from primary authors to reviewers and supervisors of AI-driven change. Anthropic’s internal data shows how quickly this future is arriving. The company reports that its engineers now merge about eight times more code per active contributor than in the pre-2025 baseline, with productivity jumping steeply after major Claude releases. According to Anthropic, Claude wrote more than 80% of the code merged into its production systems in May, turning AI code generation from a productivity boost into the main development engine. Human engineers still choose tasks and approve merges, but most new lines of code now originate from AI instead of people.

Claude Now Writes Most of Anthropic’s Code—What It Means for Developers

Claude Productivity Boost and the New Engineering Workflow

Inside Anthropic, AI code generation has restructured the daily work of engineers. Claude Code and related tools now handle the bulk of implementation, while humans shape specifications, prompt the model, and perform AI code review on suggested changes. Internal charts show a long plateau in code output from 2021 through 2024, followed by a sharp rise that tracks Claude’s upgrade cycle and the internal rollout of Mythos. Some engineers quoted by Anthropic say they stopped writing code themselves months ago, preferring to “Claudify” tasks by letting the model produce first drafts and then editing. Lines of code per contributor are up 8x, but that volume would be impossible without new guardrails: structured pull requests, capped per-PR line counts, and careful tracking of who authored what. The coding bottleneck has moved from typing functions to validating large waves of AI-authored changes.

Self-Improving Loops Without Full Autonomy—Yet

Anthropic now talks openly about self-improving AI loops, where models help build the next version of themselves. Claude already assists with debugging, performance tuning, and infrastructure work for Anthropic’s own systems, including handling multi-hour tasks that would once have required dedicated engineering sprints. In one example, the model made around 800 fixes to an API, cutting error rates and completing work estimated to take a human engineer several years. Mythos-based iterative rewriting loops reportedly speed up some software by about 52x on average. Still, Anthropic stresses that Claude has not achieved full recursive self-improvement: engineers remain in control of design choices, experiments, and deployment decisions. For now, self-improvement looks more like a powerful co-optimization cycle—AI accelerates coding and testing, humans set direction and decide which improvements become part of production systems.

Risk Shifts from Coding to AI Code Review

With AI generating most of the code, risk management focuses less on writing bugs and more on missing them during review. Anthropic’s disclosure reframes the main enterprise question from “Can AI write code?” to “Can teams review and approve AI-written code safely before it hits live systems?” Claude is now integrated into production workflows as a coding agent whose changes only reach users after passing human oversight, automated tests, and security checks. Anthropic highlights the need for audit trails, rollback paths, and explicit control gates that require human approval before deployment. The quality of Claude’s code is expected to match or exceed typical human output on many tasks, but speed cuts both ways: subtle security flaws or logic errors could propagate faster if review systems lag. Effective AI development safeguards now depend on scalable review practices, not individual heroics by engineers paging through massive diffs.

Calling for AI Development Safeguards and Pause Options

As Claude’s coding abilities accelerate, Anthropic is pushing for broader AI development safeguards beyond its own repositories. The company argues that advanced systems may eventually help create more powerful successors, raising the stakes if oversight falls behind. Its leaders have called for the option to temporarily pause advanced AI development if capabilities move too quickly, alongside stronger international cooperation on standards and monitoring. A recent blog post from the Anthropic Institute explores how close current systems are to something like recursive self-improvement and notes that humans still have better “research taste” for designing tests and evaluation regimes. Not all experts agree with Anthropic’s warnings, but even critics acknowledge that clearer regulations and safety practices are increasingly important. For developers, the message is direct: design workflows where AI code generation and AI code review are both first-class citizens, backed by controls that can slow or stop deployment when needed.

Claude Now Writes Most of Anthropic’s Code—What It Means for Developers

From Coding Assistant to Primary Developer

Claude Productivity Boost and the New Engineering Workflow

Self-Improving Loops Without Full Autonomy—Yet

Risk Shifts from Coding to AI Code Review

Calling for AI Development Safeguards and Pause Options

You May Also Like