When AI Builds Itself — Recursive Self-Improvement

The Accelerating Pace¶

Anthropic's internal analysis of recursive self-improvement (RSI) reveals a striking trend: the duration of tasks AI systems can complete independently has been doubling every 4 months — down from every 7 months just a year ago.

Consider the trajectory:

Opus 3 (2024) — could sustain ~4 minutes of autonomous work.
Sonnet 4 (2025) — extended to ~90 minutes.
Opus 4.6 (2026) — now reaches ~12 hours of continuous autonomous operation.

At current acceleration, days-long tasks are expected within this year. The compounding is not just in duration but in quality — each generation of models can make better use of its expanded horizon.

Code: Anthropic's Own Transformation¶

The most concrete signal of RSI in action comes from Anthropic's own engineering organization:

>80% of Anthropic's production code is now authored by Claude — from initial drafts to bug fixes to refactoring.
Engineers ship 8× more code per day than before Claude-assisted workflows.
Claude is superhuman at experimental loops — iterating through hypotheses, running experiments, and analyzing results at roughly 52× human speed for well-scoped tasks.

This isn't a projection. It's the current operational reality at one of the frontier labs.

Three Scenarios for the Future¶

Anthropic's analysis outlines three possible trajectories for recursive self-improvement:

1. Trend Stalls¶

Improvement hits diminishing returns as models saturate available data, compute, and architectural headroom. Progress continues but linear, not exponential. This is the benign scenario — we get very capable assistants but no discontinuous takeoff.

2. Soft Takeoff (Assistance)¶

RSI continues but remains tightly coupled with human oversight. AI systems accelerate research and development, but humans remain in the loop as bottlenecks — reviewing, directing, and validating. Productivity gains compound, but the ceiling is set by human throughput.

3. Hard Takeoff¶

AI systems achieve the ability to autonomously improve their own capabilities, recursively, without meaningful human oversight. Under this scenario, Anthropic estimates the fastest plausible timeline is 2027–2028.

Key Uncertainties¶

The hard takeoff timeline depends on resolving several bottlenecks:

Chip fabrication — can AI design and validate next-generation chips without human fabrication step? Fabrication remains a physical bottleneck with hard latency.
Power grid — training runs already strain regional power infrastructure. RSI-driven demand compounds this rapidly.
Research taste bottleneck — perhaps the most subtle constraint. Improving a model's ability to evaluate its own research directions — knowing what's worth pursuing — may not emerge automatically from scaling. If "taste" is hard to transfer, the recursion may stall.

Anthropic frames these uncertainties not as arguments against concern, but as critical monitoring points. If all three bottlenecks show signs of resolution simultaneously, the hard takeoff scenario becomes substantially more probable.