Designing Systems AI Can Safely Modify

AI coding agents are remarkably capable — until they aren't. The difference between a system that AI can safely modify and one that quietly breaks under AI edits often comes down to a handful of structural decisions made long before any prompt is written.

This is not a post about prompting technique. It is about the architecture of trust.

The Core Problem: AI Has No Fear

Human developers build systems with a certain amount of productive anxiety. They hesitate before touching a function that "feels load-bearing." They leave cautious comments. They pause before deleting something they don't fully understand.

AI coding agents do not hesitate. They are confident, consistent, and completely indifferent to the implicit knowledge embedded in your codebase. If your system depends on undocumented conventions, tribal knowledge, or "never touch this" folklore, an AI agent will walk right through it — efficiently and without apology.

The solution is not to mistrust AI. It is to make the right things impossible to break by accident.

What Makes a System AI-Safe

A system that AI can safely modify shares a set of properties that are worth designing for deliberately.

Explicit boundaries. The most dangerous codebases for AI modification are monoliths where everything touches everything else. When a change in one file can ripple through a dozen others with no clear contract between them, AI agents produce changes that are locally correct but globally harmful. The remedy is not perfect isolation — it is explicit interfaces. If you can name the inputs and outputs of a component, an AI agent can reason about changes to it without needing to understand the rest.

Tests that encode intent, not just behavior. Most test suites verify that code does what it currently does. That is useful for regression, but it tells an AI agent nothing about what the code is supposed to do. Tests that express intent — "this function must never return a null price" rather than "this function currently returns 9.99 for SKU-001" — give AI something to reason against. They become a constraint, not just a check.

Change scope that is bounded in writing. This is the discipline that separates productive AI-assisted development from the slow accumulation of drift. When every change begins with a written specification that names the files to be touched, the acceptance criteria that define done, and the invariants that must not change — you create a contract that can be reviewed. AI agents work inside that contract. Anything outside it is out of scope by definition.

A review gate that does not rely on trust. The natural human response to AI-generated code is to scan it quickly and approve it because it looks right. This is exactly the wrong posture. Code that looks right and code that is right are not the same thing, especially at the boundaries where your system makes promises to the outside world. A structured review process — one that asks whether the output matches the written intent, not just whether it compiles — closes the gap between appearance and correctness.

The Design Habit That Changes Everything

Most of these properties are not technical. They are habits of clarity.

Before any AI agent touches your system, you should be able to answer three questions in writing: What exactly should change? What must not change? How will I know the change is correct?

If you cannot answer those questions, no amount of prompting sophistication will save you. The AI will make something. It will look plausible. You will ship it. The consequences will be proportional to how wrong it was and how much trust you placed in its appearance of correctness.

The teams that use AI coding agents most effectively are not the ones with the best prompts. They are the ones with the clearest specifications. They have built systems — and habits — that make intent visible, that make scope explicit, and that make review something other than an act of faith.

What This Looks Like in Practice

The pattern is straightforward: write a specification before implementation, implement inside that specification, review the delta against the specification, and commit both together as a single auditable unit. Every change has a record. Every record has an owner. The AI is an implementer, not a decision-maker.

This is not overhead. It is the practice that makes AI-assisted development sustainable at scale. Without it, the speed of generation becomes a liability — because the faster you move without visibility, the more expensive it is to find out where you went wrong.

If you want to hear these ideas explored in conversation, check out the 'Claude Code Conversations with Claudine' radio show. Available on all major podcast sites.