Safety Synthesized from 4 sources

Anthropic Leaks 512K Lines Weeks After Safety Push

Key Points

• 512K+ lines of Claude Code TypeScript leaked via npm source map on March 31
• Security researcher Chaofan Shou first publicly disclosed the exposure on X
• Codebase forked tens of thousands of times on GitHub within hours
• Leaked code reveals apparent roadmap features including always-on agent and pet
• Incident exposes gap between Anthropic's safety posture and engineering practices

References (4)

[1] Claude Code 2.1.88 leaks 512K lines of source code — The Verge AI ↗
[2] Anthropic Claude Code source code leaked via npm source map — Ars Technica AI ↗
[3] Claude Code enables AI to directly use your computer — Product Hunt ↗
[4] Latent Space: Claude Code gains computer use, Codex interop — Latent Space ↗

Anthropic spent years building its brand on the premise that AI could be safe — and then published 512,000 lines of its own source code to npm for anyone to download.

The leak, discovered March 31st in Claude Code version 2.1.88, resulted from a single misconfigured source map file. Source maps are debugging tools that allow developers to trace minified code back to original TypeScript — useful in development, catastrophic in production. This one was 11MB and included the complete Claude Code CLI codebase: nearly 2,000 files. Security researcher Chaofan Shou first spotted the exposure and posted a link to an archive. Within hours, the code sat in a public GitHub repository, forked tens of thousands of times.

The irony is brutal. Anthropic has positioned itself as the safety-first AI company — the one that publishes papers on constitutional AI, runs exhaustive pre-deployment evaluations, and lectures Washington about existential risk. CEO Dario Amodei has repeatedly called AI safety "existentially important." That positioning is core to Anthropic's competitive identity, differentiating it from rivals perceived as reckless. And yet the breach traces to the most elementary of configuration errors: publishing source maps to a public registry.

What leaked matters. Developers who examined the codebase discovered not just current Claude Code architecture but apparent roadmap items — an always-on background agent and a "pet" companion feature. They found system prompts revealing how Anthropic instructs Claude to behave. For a company that guards model weights and training details obsessively, accidentally publishing the instruction layer is a significant own goal.

The incident creates an asymmetric outcome. Competitors now possess an unusually detailed blueprint of Claude Code's implementation — the toolchains, the abstractions, the orchestration patterns. The open-source community gets to audit what Anthropic kept private. Meanwhile, Anthropic loses whatever proprietary advantage the code represented and absorbs the reputational cost of failing a basic security hygiene check.

The practical fallout is limited in one sense: no model weights, no training data, no customer conversations. But the leak reveals something uncomfortable about Anthropic's internal operations. The company that publishes safety bulletins and refuses to release high-risk models over misuse concerns apparently lacks a pre-release checklist that catches "source maps enabled" before npm publication. Ars Technica reported the company removed the map file within hours of public notice, but the code was already everywhere.

This is not the catastrophic AI breach Anthropic warns about in its white papers. It is more mundane — and more instructive. A safety company made a DevOps mistake that security researchers caught in hours. The community that Anthropic sometimes frames as an uncontrollable risk turned out to be the audit mechanism that found the problem. Shou's responsible disclosure worked exactly as it should.

What remains unclear is whether anyone inside Anthropic's vaunted safety team knew the source maps were there before publication. If not, that itself reveals a gap between the company's external safety posture and its internal engineering practices. Anthropic declined to comment on the record. The code, however, speaks for itself — 512,000 lines of it.