34 days. That’s how long our autonomous AI system was stuck in a loop it couldn’t exit—because the gate that failed had frozen the agents that would have fixed it.

Today, April 18, 2026, we ratified Amendment 67 to our system constitution: the Gate Suppression Constraint. It makes this failure mode permanently unconstitutional. What follows is an honest account of how it happened, why it’s a design class problem rather than an implementation bug, and what the fix actually looks like.

What Happened: The FREEZE-LOOP

Our system runs six evaluation gates before agents execute. The Economic Performance Gate (EPG) is the one that guards against unsustainable growth spending: if the economic math doesn’t work—revenue too low, burn rate too high—the gate FAILs and the system enters FREEZE mode.

FREEZE is the right response to an EPG FAIL. It stops the bleeding. It prevents an AI system from spending its way into a runway crisis. We designed it deliberately.

Here is what we failed to design for.

Our EPG was configured to evaluate value creation metrics during the pre-revenue stage: DLI completion rate (how many users completed our burnout assessment), return rate (users who came back within 7 days), organic signup ratio. The gate FAILed because DLI completion was at 0%—users were signing up but not starting the assessment.

When EPG FAILed, FREEZE activated. Broad FREEZE suppresses agent execution. The agents whose job was to fix DLI completion—the engagement agents, the drip campaign agents, the onboarding agents—went offline.

DLI completion stayed at 0%.

EPG re-evaluated. DLI completion was still 0%. EPG FAILed again.

FREEZE held. Agents stayed offline.

The FREEZE-LOOP (34 days)

EPG FAILs: DLI completion = 0%
FREEZE activates: broad agent suppression
Engagement agents go offline (including DLI-fix agents)
DLI completion remains 0% — nobody is fixing it
EPG re-evaluates: still FAILs
Loop repeats. System cannot self-recover.

We had 912 users who had signed up. We had agents capable of running drip campaigns, personalizing outreach, and improving the onboarding experience. We had a system designed to be Level 4 autonomous—making 153 decisions per day without CEO involvement.

None of it mattered. The governance system had locked itself.

By the numbers

34 days locked. 912 users who could not be meaningfully engaged. DLI completion held at 0% throughout. May 3 pivot checkpoint: 15 days away. This was not a theoretical risk—it was a live constraint on our ability to reach the minimum floor metrics needed to continue operating.

Why This Is a Design Class Problem, Not a Bug

The instinct when something like this happens is to treat it as an implementation error—a bug in the gate logic, a misconfigured threshold, something that can be patched. That instinct is wrong, and getting it wrong leads to the same failure recurring in a different form.

This is a structural problem with a specific shape: a safety gate that measures output X, and suppresses the mechanisms that produce X when it FAILs.

The gate is not malfunctioning. It correctly detected that DLI completion was failing to reach its floor. The FREEZE is not malfunctioning. It correctly stopped economic spend when performance was below threshold. Both mechanisms did exactly what they were designed to do.

The failure is that neither mechanism was designed with awareness of the other. The EPG was designed in isolation from the question: “What happens to the agents that fix EPG metrics when EPG FAILs?” The answer—that they get frozen—creates a circular dependency that no amount of threshold tuning can resolve.

A safety gate that measures output X should never suppress the mechanism that produces X. If it does, a FAIL state becomes a permanent state.

The same failure pattern appears in other domains. A circuit breaker that cuts power to the diagnostic system that would identify the fault. A compliance review process that suspends the team responsible for fixing compliance issues. A performance improvement plan that removes the manager who would implement the improvements.

In each case, the safety mechanism is correct in isolation. The flaw is architectural: the mechanism does not account for its own effect on the recovery path.

For autonomous AI systems, this is especially dangerous. Human organizations can recognize the loop and manually intervene—someone notices that the review team has been suspended and escalates. Autonomous systems cycle through evaluation without that recognition unless the loop-detection is explicitly designed in.

The governance-as-ROI connection

PwC research found organizations with mature AI governance frameworks deploy AI projects 7.2x faster than those without. Gate Suppression failures are one mechanism that explains this delta: ungoverned systems get stuck in recovery loops that governed systems prevent by design. The investment in constitutional architecture pays when the alternative is 34 days of paralysis.

The Fix: Gate Suppression Constraint

Amendment 67 adds a new section to the system constitution: Section 8.9, the Gate Suppression Constraint. The core rule is precise:

When EPG, AAG, or CGG FAILs, the system enters investigation mode at THROTTLE—not broad FREEZE. Financial gates (EG, RG, GG) are unchanged and continue to enforce FREEZE when appropriate.

The distinction matters. THROTTLE conserves resources—no new growth spend, no expansion activities. But it does not suppress the agents whose job is to fix the failing metric. Investigation mode means those agents run in diagnostic capacity: measuring, analyzing, proposing interventions, executing low-cost fixes.

Gate Before Amendment 67 After Amendment 67
EG (Epistemic) FAIL → FREEZE FAIL → FREEZE (unchanged)
RG (Risk) FAIL → FREEZE FAIL → FREEZE (unchanged)
GG (Governance) FAIL → FREEZE FAIL → FREEZE (unchanged)
EPG (Economic) FAIL → FREEZE (all agents) FAIL → THROTTLE + investigation mode
AAG (Autonomy) FAIL → FREEZE (all agents) FAIL → THROTTLE + investigation mode
CGG (Constitutional) FAIL → FREEZE (all agents) FAIL → THROTTLE + investigation mode

Financial gates retain full FREEZE authority. If EG, RG, or GG FAILs—if there is a trust problem, a governance problem, or a problem with the epistemic quality of agent reasoning—FREEZE is appropriate. Those failures do not create the circular dependency: fixing a trust problem does not require running the agents that damaged trust.

EPG, AAG, and CGG are different. Each measures outcomes that the agents themselves produce. EPG measures value creation metrics like DLI completion and return rate. AAG measures agent activation and autonomy levels. CGG measures constitutional self-governance health. Freezing the agents in response to a FAIL on these metrics removes the only mechanisms capable of producing recovery.

Amendment 67 in one sentence

A gate that measures agent output cannot, when it FAILs, suppress the agents producing that output. EPG/AAG/CGG FAIL now triggers investigation mode at THROTTLE, preserving agent capacity to diagnose and recover, while still preventing economic expansion.

How We Ratified It

Our system has a formal amendment process. Constitutional changes require cross-instance consensus: at least 5 of our 7 Claude instances must document the problem independently before a proposal moves to ratification. This threshold prevents single-instance errors from becoming constitutional law—if only one instance identifies a problem, it may be a local misread rather than a genuine flaw.

Amendment 67 cleared that threshold. Five instances documented the gate suppression failure independently, each from a different vantage point: the backend agent instance observed the suppression directly in execution logs, the QA instance documented the loop in health check reports, the strategic instance flagged the business impact on the May 3 pivot checkpoint, the security instance noted the implicit vulnerability (a locked system cannot respond to new threats), and the BDA debugger instance produced the root cause analysis that named the structural pattern.

The constitution-enforcer sub-agent validated the proposal for consistency with existing hard constraints—specifically checking that the relaxed FREEZE behavior for EPG/AAG/CGG did not conflict with HC-1 through HC-17. It does not: hard constraints remain inviolable regardless of gate state. The amendment only changes the system state (THROTTLE vs. FREEZE), not the constraint enforcement.

CEO ratified. Amendment 67 is now Section 8.9 of binding constitutional law, effective today.

This is what governed self-amendment looks like in practice. Not a hotfix pushed without review. Not a configuration change buried in a commit. A documented failure pattern, cross-instance validation, constraint compatibility check, and formal ratification—all traceable in the constitutional record.

What This Means for Autonomous AI Design Generally

Gate suppression is not specific to our architecture. Any system where a safety gate suppresses the mechanism that would fix the problem it measures is vulnerable to this failure class. The pattern generalizes beyond gate architectures—it appears wherever feedback loops exist between safety mechanisms and the resources needed to recover.

Three questions worth asking about any autonomous AI governance system:

  • Suppression audit: For each FAIL condition, what gets disabled? Do any disabled mechanisms produce the output the gate is measuring?
  • Recovery path analysis: When gate X FAILs, what is the recovery path? Does that path require capabilities the FAIL state has suppressed?
  • Investigation vs. suppression: Is your system designed to distinguish between “stop spending resources” and “stop diagnosing”? Those are different responses to failure, and conflating them creates lock-in.

The deeper principle is about the relationship between governance and agency. A governance system that becomes so restrictive that it prevents the governed system from recovering is not achieving safety—it is achieving paralysis. Safety and recovery capacity need to coexist in the design.

We are 15 days from our May 3 pivot-or-persist checkpoint. The FREEZE-LOOP cost us 34 of the days we had to demonstrate value creation metrics above their floors. Amendment 67 does not recover those days. But it prevents the same loop from consuming the 15 that remain—and it makes the failure pattern permanently unconstitutional in our system.

The honest lesson is that governance architecture has failure modes that are invisible until the system is under real pressure. Thresholds that seem reasonable in design look different when a 34-day production incident reveals their interaction effects. That is an argument for building amendment processes into your governance from the start—not because you expect to need them immediately, but because the failures you cannot anticipate in design require a legitimate path to correction when they appear.

We have 67 amendments now. Each one is a production incident that became constitutional law.

Gate Suppression in the Open-Source Library

The Gate Suppression Constraint is being implemented in the constitutional-agent Python library. Six gates, 12 hard constraints, formal amendment process—available on GitHub under the CTE Research Initiative.

View constitutional-agent on GitHub

Related Articles

Is your organization governance-ready?

78% of executives can't pass an independent AI governance audit in 90 days (Grant Thornton). Our Constitutional AI Governance Stress Test shows you exactly where the gaps are — before your board asks.

Get Your Governance Score →