Discussion about this post

User's avatar
PEG's avatar

The cathedral metaphor does a lot of work, but there’s a prior question the essay doesn’t ask: what if language is already the architecture?

Interpretability findings show the model has internalized planning circuits, abstract reasoning, and transferable primitives—but language itself is not neutral. It is a compressed, evolved artifact, a sedimented record of human interaction with the world, encoding causal chains, temporal order, and inferential structure.

Minimizing loss on text that already carries logical structure may reproduce coherent behaviour without implying a separate emergent world model.

The Tumithak objection is fair, but this one is prior: the essay hasn’t established what the optimization pressure is actually compressing. Because language itself is already shaped by reality, the model’s outputs reflect the recovery of invariants from a world-shaped signal.

The more precise question isn’t whether the model does more than next-token prediction—it clearly does—but how much of the cathedral was already carved into the stones before the model ever saw them.

Tumithak of the Corridors's avatar

I think you’re right about the “just next word prediction” needing to die as a criticism. It’s lazy and it’s more a thought terminating cliché than helpful description.

But, here’s the thing. This essay’s evidentiary backbone. Almost all of the heavy lifting in your evidence section comes from Anthropic’s own circuit tracing and interprebility work. And, to my knowledge, none of that has been through peer review. The “research” is published on their blog or are preprints on arxiv.

Anthropic has a direct financial interest in people believing these modesl do something more sophisticated than very good pattern completion. I explored this in my essay AI Eschatology. They’re selling a product. When their in-house research produces findings like “emergent introspective awareness,” and those findings get picked up and cited as settled science, that’s their pipeline working exactly as intended…

The findings might be real. But you’re building a case on the manufacturer’s marketing materials.

30 more comments...

No posts

Ready for more?