# Agent Containment Engine: A Kill-Switch for Rogue Agents — Mnemom Research

```json
{"@context":"https://schema.org","@type":"Article","headline":"Agent Containment Engine: A Kill-Switch for Rogue Agents","name":"Agent Containment Engine: A Kill-Switch for Rogue Agents","description":"Enterprise teams can now pause, kill, and resume agents in real-time. Gateway-enforced containment with auto-containment policies, webhook events, audit trails, and role-based access control.","url":"https://www.mnemom.ai/blog/mnemom-research/agent-containment-engine","inLanguage":"en-US","datePublished":"2026-02-19","dateModified":"2026-02-19","author":{"@type":"Organization","name":"Mnemom Research","url":"https://www.mnemom.ai/blog/mnemom-research"},"image":"https://www.mnemom.ai/api/og-image?type=blog&eyebrow=DISPATCHES&chip=Mnemom+Research+%C2%B7+5+min&author=Mnemom+Research&title=Agent+Containment+Engine%3A+A+Kill-Switch+for+Rogue+Agents&subtitle=Enterprise+teams+can+now+pause%2C+kill%2C+and+resume+agents+in+real-time.+Gateway-enforced+containment+with+auto-containment+policies%2C+webhook+events%2C+audit+trails%2C+and+role-based+access+control.","publisher":{"@id":"https://www.mnemom.ai#organization"},"keywords":["containment","enterprise","security","aip"]}
```

```json
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https://www.mnemom.ai/"},{"@type":"ListItem","position":2,"name":"Dispatches","item":"https://www.mnemom.ai/blog"},{"@type":"ListItem","position":3,"name":"Mnemom Research","item":"https://www.mnemom.ai/blog/mnemom-research"},{"@type":"ListItem","position":4,"name":"Agent Containment Engine: A Kill-Switch for Rogue Agents","item":"https://www.mnemom.ai/blog/mnemom-research/agent-containment-engine"}]}
```

[← Mnemom Research](/blog/mnemom-research)

# Agent Containment Engine: A Kill-Switch for Rogue Agents

![Mnemom Research](/images/mnemom_hero.webp)

Mnemom Research

February 19, 2026

This post was published before the May 2026 vocabulary alignment (ADR-042). Where you see "Smoltbot," read "Mnemom." Where you see "XFD," read "Safe House." The technical claims still hold; the names have evolved as the system matured.

_Mnemom Research | February 2026_

* * *

Integrity monitoring tells you when an agent goes wrong. The containment engine lets you do something about it.

Today we ship the Agent Containment Engine -- real-time pause, kill, and resume controls for agents running through the Mnemom gateway. When an agent crosses a boundary, you can stop it in milliseconds. When the issue is resolved, you can bring it back. Every action is audited, role-gated, and surfaced through webhooks for SIEM integration.

This is the operational counterpart to the integrity checking and cryptographic proof layers we shipped earlier this month. Those systems answer "is this agent behaving?" The containment engine answers "what do I do when it isn't?"

## The Problem

Monitoring without intervention is a spectator sport. You can detect that an agent has violated a boundary, log the violation, alert the team -- and then watch it continue operating while someone figures out what to do. The gap between detection and response is where damage happens.

For enterprise teams running agent fleets, this gap is a compliance risk. The EU AI Act requires mechanisms for human oversight of high-risk AI systems. Logging that a violation occurred is necessary but not sufficient -- you need the ability to intervene. Regulators will ask not just "did you detect the problem?" but "what did you do about it, and how fast?"

## How It Works

The containment engine adds a three-state lifecycle to every agent: **active**, **paused**, and **killed**.

**Active** is the default. The agent operates normally through the gateway. Integrity checks run. Verdicts flow. Everything works as before.

**Paused** is a reversible suspension. The gateway immediately begins returning HTTP 403 responses with a structured `containment_error` body to any request routed through the contained agent. The agent's API keys still authenticate -- it's the agent's operational status that's suspended, not its identity. Pausing is instant: the next request after containment is enforced.

**Killed** is a hard stop. Same gateway enforcement as paused, but resumption requires explicit reactivation by an org owner. Kill is for situations where you don't just want to pause operations -- you want to ensure the agent cannot resume without a deliberate decision by someone with authority.

### The API

Four endpoints under each organization's agent routes:

```
POST /v1/orgs/:org_id/agents/:agent_id/pause
POST /v1/orgs/:org_id/agents/:agent_id/resume
POST /v1/orgs/:org_id/agents/:agent_id/kill
POST /v1/orgs/:org_id/agents/:agent_id/reactivate
```

Each accepts an optional `reason` field (up to 500 characters) that gets recorded in the audit log. The response includes the agent's new `containment_status`, the timestamp, the actor who performed the action, and the reason.

A read endpoint returns the current containment state plus the most recent audit log entries:

```
GET /v1/orgs/:org_id/agents/:agent_id/containment
```

### Gateway Enforcement

When a contained agent (paused or killed) makes a request through the gateway, it receives:

```json
{
  "error": "agent_contained",
  "containment_status": "paused",
  "contained_at": "2026-02-19T14:30:00Z",
  "message": "This agent has been paused. Contact your organization admin."
}
```

Enforcement happens at the gateway layer, before the request reaches any upstream provider. There is no window between containment and enforcement -- the `get_quota_context_for_agent` function that the gateway calls on every request now includes `containment_status` in its return value. If the status is anything other than `active`, the request is rejected.

### Role-Based Access

Not everyone should be able to kill an agent. The containment engine uses the existing org RBAC system:

-   **Pause / Resume**: Available to org `owner` and `admin` roles
-   **Kill / Reactivate**: Restricted to org `owner` only
-   **View containment status**: Available to all org roles including `viewer` and `auditor`

This separation matters. An admin responding to an alert can pause an agent immediately. But permanently killing and reactivating an agent -- which may have downstream dependencies, SLAs, or contractual implications -- requires owner authority.

### Auto-Containment

Manual intervention requires someone to be watching. Auto-containment doesn't.

Each agent can be configured with an `auto_containment_threshold` -- an integer >= 2 that specifies how many consecutive boundary violations trigger an automatic pause. When the threshold is hit, the system pauses the agent with `action: "auto_pause"` and the reason includes the violation count.

```
PUT /v1/orgs/:org_id/agents/:agent_id/containment-policy
{ "auto_containment_threshold": 3 }
```

Set to `null` to disable. The threshold is intentionally set at a minimum of 2 -- a single boundary violation could be a false positive from the analysis model. Two or more consecutive violations represent a pattern.

### Webhook Events

Every containment action fires a webhook event:

-   `agent.paused` -- Agent was paused (manual or auto)
-   `agent.resumed` -- Agent was resumed from paused state
-   `agent.killed` -- Agent was killed
-   `agent.reactivated` -- Agent was reactivated from killed state (owner only)

Events include the actor, reason, previous status, new status, and timestamp. Route these to your SIEM, PagerDuty, Slack, or incident management system.

### Audit Log

Every containment action is recorded in a dedicated audit table:

-   **Action**: pause, resume, kill, reactivate, auto\_pause
-   **Actor**: The user ID of whoever performed the action (or "system" for auto-containment)
-   **Reason**: Free-text explanation
-   **Previous / new status**: The state transition
-   **Timestamp**: When it happened

The audit log is append-only and immutable. It's queryable through the containment status endpoint, which returns the most recent 50 entries by default.

## The Dashboard

The fleet dashboard now includes containment controls alongside integrity scores. Pause and resume are one click. Kill requires confirmation. The containment status badge (green/amber/red) is visible in the agent list view so operators can see fleet-wide containment state at a glance.

## Availability

The containment engine is available on **Team** and **Enterprise** plans. It requires the organization feature set -- containment operates at the org level because agent governance is inherently a team concern.

-   **Team**: Full containment (pause/resume/kill/reactivate), auto-containment policies, containment webhooks, audit log
-   **Enterprise**: Same, plus custom containment policies and SLA guarantees on enforcement latency

## Get Started

If you're on a Team or Enterprise plan, the containment API is live now. No migration needed -- the `containment_status` field defaults to `active` for all existing agents.

-   **API docs**: [docs.mnemom.ai](https://docs.mnemom.ai) -- full endpoint reference for containment operations
-   **Dashboard**: The fleet dashboard at [mnemom.ai/dashboard](https://mnemom.ai/dashboard) includes containment controls
-   **Changelog**: [docs.mnemom.ai/changelog](https://docs.mnemom.ai/changelog) -- technical details of the release

Integrity monitoring without containment is a warning light with no brakes. Now you have both.

* * *

_Mnemom builds alignment and integrity infrastructure for autonomous agents. The containment engine is part of the Smoltbot gateway, available on Team and Enterprise plans._

_GitHub: [github.com/mnemom](https://github.com/mnemom)_

#containment#enterprise#security#aip

### Stay in the loop

New dispatches and product updates, no spam.

Company website (leave blank)

Subscribe

### Ready to verify your agents?

See it liveView plansTalk to us

[![Mnemom Research](/images/mnemom_hero.webp)

Mnemom Research

All posts →


](/blog/mnemom-research)

---
_Source: /blog/mnemom-research/agent-containment-engine/index.html · Generated by build-markdown-mirrors.mjs · For agent-readability commitment #4 see https://www.mnemom.ai/for-agents_