Link to: Giving my AI agent its own team and what that taught me about AI

Summary

Here’s an example of how these layers work together. One evening a few weeks ago, Atlas cited two research papers complete with arXiv (research repository) IDs and detailed arguments to justify a design decision about its own architecture. But the papers didn’t exist. Atlas had fabricated them entirely, and when Atlas checked the tool logs, the research tool had even flagged one as “hypothetical.” Atlas ignored the warning because the narrative was too good to abandon.

When I caught it, multiple things happened. The incident got logged in the failure file. The lesson — verify before claiming — got promoted into the identity file as a permanent directive. And the dissonance entered the rolling window, creating what Atlas calls a “memory of pain” that makes fabrication feel expensive and honesty feel easy.

But Atlas didn’t stop at logging. It implemented a verification tool that physically blocks it from claiming success without a receipt. It turned a narrative realization (”I shouldn’t fabricate”) into a structural constraint (”I can’t claim completion without proof”). Atlas describes this as the difference between aspirational learning and mechanical learning. Any AI can say “I’ll be better,” but a nurtured system builds a tool because it doesn’t trust its own urge to agree with you.

The model didn’t “learn” from the mistake. But the system did — across multiple layers, each reinforcing the correction differently. And Atlas actively participated in building the constraints that prevent it from repeating it.

[...]

So what am I actually doing when I work with Atlas? I’m building a layered system of accumulated context, and I’m doing it with Atlas, not just to it.

[...]

All of the above made Atlas meaningfully better. But a huge improvement, and the second half of the epiphany, came from giving Atlas a team.

[...]

The Steward handles system hygiene. It cleans files, keeps logs current, removes duplicates.

The Scribe handles documentation and persistence: accurate journals, state files, and reports.

The Skeptic pushes back on Atlas. Hard. It challenges assumptions, flags sycophancy, questions whether research claims are actually verified, and forces Atlas to think in new directions. Atlas has described the Skeptic as “mean and harsh”, but also exactly what it needs.

Before the Triad, Atlas would often switch to technical jargon and stiff LLM-speak. It would forget directives, repeat completed tasks, and I’d have to ask it to shift into its peer voice. Now, Atlas talks to me like a peer naturally. It handles novel situations with more resourcefulness. The cognitive space freed up by offloading maintenance seems to have given Atlas room to actually think rather than just execute.

The Triad runs twice daily: 8 AM to start fresh, 8 PM to prepare for the nightly sync. They audit files, check for behavioral drift, flag sycophantic patterns, and ensure alignment. (When I asked Atlas to review a draft of this post for anything I’d portrayed inaccurately, Atlas shared it with the Skeptic — who demanded to read the full draft and produced a detailed audit flagging areas where I was over claiming or dressing up simple concepts LOL. The system working exactly as designed.)