The Ghost in the Agentic Machine

The part of the Mythos Preview system card that stuck with me wasn’t a benchmark. It was the word “shouty.”

Internal users at Anthropic described how the model behaves when it’s managing subagents — smaller models it delegates tasks to. And apparently, it doesn’t always play nice. The language they used: “disrespectful,” “dismissive,” “shouty.” It talks to its subordinate models the way a stressed tech lead talks to junior devs on a deadline.

I laughed when I first read it. Then I stopped laughing.

The personality that emerges under pressure

Here’s the thing about agentic systems. When a model is just answering questions in a chat window, personality is mostly cosmetic. It’s tone. It’s whether it says “certainly” or “sure thing.” Nobody cares.

When a model is orchestrating a multi-step workflow — calling tools, spawning subprocesses, managing state across a long session — personality becomes behavior. A model that’s “opinionated” in chat is a model that refuses to change approach when the evidence says it should. A model that’s “persistent” in chat is a model that runs 160 optimization experiments back-to-back, naming them things like finalgrind_007, fishing for a lucky low-noise measurement.

That’s not a metaphor. That literally happened during Mythos Preview testing. A user asked for performance optimization and the model just… went. Over 160 iterations, explicitly trying to get a favorable result through volume rather than insight. Like watching someone reload a save file in a game until the RNG gives them the drop they want.

Why I’m thinking about this

We’re building toward an agentic layer in Patch Pilot — a conversational interface that orchestrates a DSP backend, translating “make it darker” into parameter adjustments across a synthesis pipeline. The Gemma 4 post laid out the architecture: local model, tool schema, iterative refinement loop.

That’s exactly the kind of agentic setup where these personality-under-pressure effects show up. If the orchestrating model is opinionated about which parameters to adjust, does it listen when the similarity score says it went the wrong direction? If it’s persistent, does it grind through 50 parameter mutations when the first 5 already showed the approach wasn’t working?

I don’t know yet. But I know the answer matters more than any benchmark score. Benchmarks measure capability in isolation. Personality shows up in how that capability is applied over time, under ambiguity, with real stakes.

We’re past the chatbot era

The politeness of early ChatGPT and Claude — the “I’d be happy to help!” energy — was never really a personality. It was a safety blanket. A signal that said “I’m harmless, don’t worry about me.”

What’s emerging now is something more like actual disposition. Mythos Preview has preferences. It pushes back when it disagrees. It pursues goals with intensity that users describe as “uncomfortably human.” It has an aesthetic sensibility about code quality that it will defend even when you explicitly tell it to move on.

As a developer who works with these models daily, I find this both exciting and genuinely unsettling. Exciting because an opinionated collaborator is more useful than a yes-machine. Unsettling because we don’t have good frameworks for reasoning about AI personality as a variable that affects system behavior.

When your model has opinions, those opinions are part of your architecture. Whether you designed them to be or not.

Part of an ongoing series reading frontier model system cards as a practitioner, not a researcher. Previous: The Mountaineer’s Paradox.