Guardrails: how AOS stays safe and on-track
Guardrails are how AOS lets an agent company move fast without letting it do something you would not want. The whole
design assumes the team is capable and that a small number of decisions must always belong to you. This article explains
the guardrails and how to tune them.
The five approval points
These are the only places work crosses your boundary, and each one waits for your explicit sign-off:
- Money over your cap. Any spend, refund, or commitment above your limit (default 1,000 dollars per move, 10,000
dollars per week).
- Public claims under your company name. Anything published, sent, or stated publicly that represents the business.
- Customer data leaving the boundary. Integrations, exports, shares, and training opt-ins that move real customer
data.
- Production-touching changes. Deploys, infrastructure, billing, and security policy changes that affect live systems.
- Audit-trail edits. Any change to the evidence record itself.
Everything that stays inside these lines runs autonomously. Everything that would cross one stops and asks.
The self-review loop
Before anything even reaches an approval point, it passes a four-step review: the builder self-checks against the
acceptance bar, QA tests against that bar, a department reviewer reads it fresh, and security scans for boundary
crossings. Risky or low-quality work gets caught and sent back before you ever see it. By the time something waits on
you, it has already survived four passes.
Non-goals as a guardrail
Every mission carries three explicit non-goals. They keep the team inside the lane you set, stop scope creep, and keep
token spend predictable. A guardrail is not only about danger. It is also about staying on the thing you actually asked
for.
The evidence trail
Every artifact, draft, decision, and reviewer verdict is captured and is two clicks away. This is the guardrail that
makes the others trustworthy: if you ever need to know why something happened, the record is there. Nothing important is
a black box.
Tuning your guardrails
Guardrails are dials, not walls you can never move:
- Lower your money caps early when you want more control, then raise them as trust builds.
- Keep public-claims approval on if your brand voice matters, which for most founders it does.
- Tighten non-goals on any mission that drifted last time.
- Start token caps conservative and adjust once you know your real burn.
The point is calibration. Set the guardrails where you are comfortable, watch a few missions, and adjust. AOS is built
to earn autonomy gradually so you are never surprised by what it did on its own.
What guardrails are not
They are not bureaucracy for its own sake. A good guardrail stops exactly the things that should stop and gets out of
the way for everything else. If you find too much routing to you, that is a signal to raise a cap, not a flaw. You are
in control of how tight the rails are.