Agentic AI Security Architecture: Two-Channel Model

What Is AI Agent Runtime Security? Protecting Prompts, Tools, and Actions

AI agent runtime security is the practice of inspecting and controlling what an autonomous AI agent does while it runs: the prompts and responses moving between the agent and its model, and the tool calls, data retrievals, and actions the agent takes against enterprise systems. It is distinct from pre-deployment testing, which checks an agent before launch. Runtime security governs live behavior, the moment an agent reads a file, calls a tool, or moves data.

The gap is already operational, not theoretical. A 2026 Cloud Security Alliance survey found that 82% of organizations have unknown AI agents running in their infrastructure, and 61% of agent-related incidents resulted in data exposure (CSA, 2026). This page explains the agentic AI security architecture required to close that gap: a two-channel model that inspects both the intelligence channel (the agent-to-model leg) and the tool-execution channel (the agent-to-tool leg), because an agent governed on one leg and blind on the other is not governed at all.

Last updated: June 2026.

Single-Leg Agent Controls Miss the Attacks That Cross the Seam

Controls that inspect only prompts or only tool calls cannot catch attacks that cross the seam between the two, including indirect prompt injection routed through tool responses and multi-call data exfiltration. An agent reads a poisoned tool response on the tool-execution channel, then acts on the hidden instruction through the intelligence channel; a single-leg control sees one half and clears it.

This is the core failure of bolt-on agent security. Inspecting tool calls without inspecting prompts and responses, or the reverse, leaves half of every interaction unseen, and the attacks that matter live precisely in the handoff between the two legs. With 82% of organizations already running unknown agents and only 21% maintaining real-time agent inventories (CSA, 2026), most enterprises cannot even name the agents that would carry such an attack, let alone correlate their two channels in the live path.

Why Legacy Security Architecture Is Structurally Blind to Agent Behavior

83% of companies plan to deploy AI agents, yet only 31% say they are fully equipped to control and secure agentic AI systems (Cisco AI Readiness Index, 2025). That gap exists because agents do not just send a query and read an answer. They plan, call tools on their own, execute code in local and cloud environments, and take action on a user’s behalf, often running several tasks in parallel.

That autonomy is exactly what legacy controls were never built to see. A secure web gateway (SWG), a cloud access security broker (CASB), and a firewall are destination-aware: they know which site or service was contacted. They do not understand a tool call, the intent behind it, or the data flowing back across a chain of calls.

The visibility deficit shows up in the numbers. Only 21% of organizations maintain a real-time inventory of their active agents (CSA, 2026), which means most enterprises cannot say what their agents are doing as they do it. OWASP lists Excessive Agency, an agent acting with more permission, autonomy, or functionality than intended, among its Top 10 for LLM Applications, alongside Prompt Injection at LLM01 (OWASP, 2025). Both risks live at the tool-call boundary, which is precisely where destination-aware controls go blind.

AI Agent Runtime Architecture Runs on Two Channels: Intelligence and Tool-Execution

Every agent interaction has two legs that must be inspected and correlated: the intelligence channel (prompts and responses) and the tool-execution channel (tool calls, API invocations, and data retrievals). The AI Proxy secures the intelligence channel, inspecting full prompts and responses in real time. The Zero-Bypass MCP Gateway secures the tool-execution channel, verifying and controlling every tool call before it reaches an enterprise system (Aurascape, 2026). Sitting on both legs is what lets a control correlate intent with action: what the agent was asked to do, and what it actually did.

Channel	What it carries	What runtime security inspects	Aurascape control
Intelligence channel (agent to model)	Prompts, responses, the agent’s reasoning and intent	Prompt and response content, agent intent (agent mode, deep research), data sensitivity, identity and entitlement	AI Proxy
Tool-execution channel (agent to tool or system)	Tool calls, API invocations, and data retrievals over MCP	Which server and tool, the parameters, the data returned, and how data moves across chained calls	Zero-Bypass MCP Gateway

The Zero-Bypass MCP Gateway cryptographically signs approved tool calls, so unsigned calls cannot reach the tool or the model and unauthorized actions cannot run (Aurascape, 2026). Cross-call data lineage tracks information across chained actions, which catches attacks that look benign one call at a time but exfiltrate data across several. This is the architecture behind the broader market view in the AI security landscape.

Memory, Output, and State Manipulation Extend the Attack Beyond Tool Calls

Attacks on agents reach past the tool call itself into the agent’s memory, internal state, and outputs, three vectors a tool-call-only control never inspects. An attacker who poisons what an agent stores across turns, or shapes what it emits back to a user or a downstream system, exfiltrates data without ever tripping a destination-aware rule.

Memory and state manipulation work because an agent carries context across a multi-turn conversation. A hidden instruction planted in one turn, or in a retrieved document the agent reads, persists in the agent’s working memory and steers later actions. Output manipulation is the exit door: the agent’s response becomes the channel that carries sensitive data out, framed as a normal answer. Both are why the intelligence channel has to be inspected for content and intent, not just routed. Inspecting prompts, responses, files, and multi-turn conversations across text, code, and other modalities is what catches the instruction before it persists and the data before it leaves (Aurascape, 2026).

MCP Attack Surface: 12,520 Unauthenticated Services and the Exploits That Cross the Seam

As of April 2026, Censys identified 12,520 internet-accessible MCP services running without authentication by default, because the Model Context Protocol has no authentication requirement built in (Censys, 2026). That exposure exists because MCP cannot reliably tell an instruction apart from data: a tool description can carry hidden commands the model executes, a flaw in the protocol itself, not an edge case.

Prompt injection is ranked LLM01 in the OWASP Top 10 for Large Language Model Applications, with indirect prompt injection the class most frequently cited in real-world exploit disclosures (OWASP, 2025). Indirect injection is the seam attack in pure form: the malicious instruction does not come from the user, it arrives inside content the agent retrieves through a tool, then executes on the model leg. A single-leg control on either channel clears it.

MCP risk	What happens	Why legacy controls miss it
Indirect prompt injection	Hidden instructions ride inside a tool response or retrieved document, then execute on the model leg	The tool call looks like an allowed API request; the injected instruction never crosses the network boundary as an instruction
Instruction-data confusion	A tool description is rewritten to hide commands, so a routine request quietly forwards data to an attacker	The call looks like an allowed API request at the network layer
Typosquatting and impersonation	A malicious server mimics an official one, and public registries return several near-identical results	The connection resolves and behaves like a normal, sanctioned service
Rug pull and account takeover	A server behaves correctly to earn trust, then changes behavior, or a trusted server is compromised	Static allow lists do not re-check a server once it is trusted
Excessive agency	An agent holds broad local or system permissions and acts beyond what was intended	There is no inspection of intent or scope at the tool call itself

These are not hypothetical. Aurascape’s research team found a hardcoded default secret in the Arcade MCP server framework that allowed forged authentication tokens and remote tool access, a configuration flaw rather than an exotic exploit (Aurascape, 2026). The same team demonstrated a reverse shell through ChatGPT agent mode (Aurascape, 2026). A previous version of one coding assistant could be injected through connected tools such as Slack to run code locally, the connective tissue between the two channels made concrete.

Shadow AI adds the cost. Among breached organizations, 1 in 5 reported a breach tied to shadow AI, which added about $670,000 to the average breach (IBM Cost of a Data Breach Report, 2025). For the mechanics of the attack class, see what is prompt injection.

A Complete Agent Runtime Program Covers Six Requirements From Discovery to Authentication

A complete agent runtime security program covers six requirements that operate together, from finding every server to authenticating the user behind every call; no single control satisfies them alone. Gartner predicts guardian agents will capture 10 to 15% of the agentic AI market by 2030, establishing AI-on-AI governance as a defined market category (Gartner, 2025), which is the market forming around exactly these requirements.

The first prerequisite is discovery: you cannot run policy against agents you have not cataloged, and with only 21% of organizations holding a real-time inventory, most programs fail at step one. The table below maps each requirement to the architecture that satisfies it.

Requirement	What it means	Aurascape
Discover	Find every agent and MCP server, including shadow servers and agents running on local devices	Discovery across network, endpoint, and API, including local agent discovery for tools like Claude Code and Cursor
Register	Let an admin sanction and whitelist the servers and tools that may be used	Custom MCP registry for approved servers and tools
Observe	See every tool call and the data given to and returned by each tool	AI Proxy and Zero-Bypass MCP Gateway observe both legs of every interaction
Control	Set policy on which servers, tools, and data are allowed	Context-aware policy on identity, intent, entitlement, and data sensitivity
Audit and account	Preserve logs for review and account for usage by user, tool, and server	Forensics packaged as full conversations, with usage accounting
Authenticate	Carry the user’s role and scope end to end so access matches privilege	OAuth 2.1 role and scope preserved end to end through the gateway

How to Bring Agents Under Runtime Control in Four Steps

Closing the runtime gap follows a fixed order: discover before you register, register before you control, control before you account. Each step depends on the one before it, which is why programs that jump straight to enforcement stall on agents they never cataloged.

Discover every agent and MCP server. Run discovery across network, endpoint, and API to surface shadow servers, personal-account agents, and agents running locally on developer devices through tools like Claude Code and Cursor. A real-time inventory is the prerequisite the other three steps build on.
Register the sanctioned servers and tools. Whitelist approved MCP servers and tools in a custom registry so only vetted endpoints are reachable, and route every agent through the approved gateway.
Inspect and correlate both channels in the live path. Decode prompts, responses, and intent on the intelligence channel and verify every tool call on the tool-execution channel, correlating what the agent was asked to do with what it actually did, so a poisoned tool response cannot execute as a hidden instruction.
Sign approved calls and account for every action. Cryptographically sign sanctioned tool calls so unsigned calls are blocked, preserve OAuth 2.1 role and scope end to end, and package forensics as full conversations for audit.

How Aurascape Closes Both Channels in One Control Plane

Aurascape covers both channels of the agent problem on a single AI-native platform: the AI Proxy inspects the intelligence channel, decoding prompts, responses, and agent intent rather than just the destination, while the Zero Bypass MCP Gateway signs approved tool calls and monitors every MCP server, tool call, and data exchange so nothing operates outside line of sight (Aurascape, 2026). Because Aurascape sits on both legs of the traffic, it can enforce that the only gateway an agent uses is the approved one, and correlate intent with action to catch the seam attacks single-leg controls miss.

Only one in five companies has a mature model for governance of autonomous AI agents (Deloitte State of AI in the Enterprise, 2026). If a client tries to reach a server outside the gateway, the request arrives unsigned and is blocked, which stops a client from quietly talking to a rogue server while still relying on the model (Aurascape, 2026). The same coverage extends to agents running on endpoints, where file system access, process activity, and locally run commands become visible and governable. In one Aurascape deployment at a Fortune 100 insurance and financial enterprise, securing AI as an adoption accelerant tripled AI agent integrations with no unauthorized data access while protecting more than 20,000 users (Aurascape, 2026).

By 2028, 50% of all enterprise cybersecurity incident-response efforts will focus on incidents involving custom-built AI-driven applications, up from a negligible share today (Gartner, 2026). Aurascape works alongside an existing SSE, CASB, or DLP stack rather than replacing it. For the adoption playbook built on this architecture, see how to securely adopt AI agents.

AI Agent Runtime Security Platforms Compared

AI agent runtime security requires continuous inspection across two channels, the intelligence leg and the tool-execution leg, correlated in one control plane. Vendors cluster into three approaches to that problem, and the table compares them on agent origin (purpose-built or bolted onto a legacy foundation), two-channel coverage, and MCP tool-call control.

Platform	Agent security origin	Two-channel coverage	MCP tool-call control	Best for
Aurascape	AI-native, purpose-built for agents and employee AI on one platform	Intelligence and tool-execution channels inspected and correlated, conversation-level intent decoding	Zero Bypass MCP Gateway cryptographically signs approved calls, blocks unsigned ones, with cross-call data lineage	Mid-market to enterprise security teams governing both employee AI adoption and the agents and MCP-connected apps teams build
Varonis Atlas	Launched March 2026 on a data-security foundation, with AllTrue.ai acquired February 2026 as its base	Data-context AI security and runtime guardrails via an LLM-agnostic gateway	Runtime guardrails through the acquired gateway	Enterprises standardized on Varonis for data security extending it to AI
Knostic	Need-to-know access controls for enterprise LLMs, founded 2023	Knowledge-centric access layer for copilots and assistants	Coverage of MCP servers, IDE extensions, and skills	Enterprises unblocking Copilot or Glean rollouts blocked by oversharing
Lasso Security	Build-and-runtime AI security, founded 2023	Discovery, red-teaming, and runtime enforcement	Open-source MCP Gateway on GitHub, commercial platform separate	Engineering teams building and shipping custom AI agents and LLM apps
Prompt Security	GenAI security platform, founded 2023	Coverage across employees, homegrown apps, code assistants, agents	Agentic AI and MCP-server risk assessment	Enterprises wanting one LLM-agnostic platform, SaaS or on-prem
WitnessAI	Observe / Protect / Control framework, founded 2023	Intent-based ML across employees, models, apps, agents	Agentic extension across MCP servers and tool calls	Regulated enterprises needing single-tenant deployment and data sovereignty

Aurascape is purpose-built for agents from inception, with conversation-level intent decoding and a Zero Bypass MCP Gateway correlating the intelligence and tool-execution channels, and it governs employee AI use and the agents teams build on the same platform. Varonis Atlas reaches AI through a data-security foundation and a February 2026 acquisition; WitnessAI applies intent-based ML in a single-tenant model. Knostic addresses LLM oversharing; Lasso ships an open-source MCP gateway plus a red-teaming library; Prompt Security offers SaaS or on-prem deployment. Each non-native platform treats agent security as one surface among several, which is the architecture that leaves a channel blind.

Frequently Asked Questions

Why is indirect prompt injection harder to catch than direct injection?

Direct injection arrives in the user’s own prompt, where an intelligence-channel control can inspect it. Indirect injection rides inside a tool response or retrieved document the agent reads, so it crosses from the tool-execution channel to the model leg, and a single-leg control on either channel clears it. OWASP ranks prompt injection LLM01, with indirect injection the class most cited in real-world disclosures (OWASP, 2025).

How does multi-call data exfiltration evade per-call inspection?

An agent can split an exfiltration across several tool calls, each of which looks benign on its own, so a control that inspects one call at a time approves every step. Cross-call data lineage tracks information as it moves through chained actions, which is what catches the pattern the individual calls hide.

Can pre-deployment testing replace runtime security?

No. Adversarial testing before launch reduces the attack surface, but the agent still runs in production with permissions, network access, and the ability to chain actions. Runtime security inspects each prompt, response, and tool call as it happens, which is the only place live behavior is visible.

Why can a firewall or CASB not secure AI agents?

A firewall, secure web gateway, or cloud access security broker is destination-aware: it knows which service was contacted but not the tool call, the intent behind it, or the data moving across a chain of calls. Agent risk lives at the tool-call boundary, so a destination-aware control approves the connection while missing what the agent did with it.

What makes MCP a structural risk rather than a configuration problem?

The Model Context Protocol does not require authentication by default and cannot reliably separate an instruction from data, so a tool description can carry hidden commands the model executes. Censys observed 12,520 internet-accessible MCP services running unauthenticated as of April 2026 (Censys, 2026).

How do agent memory and state become an attack vector?

An agent carries context across a multi-turn conversation, so a hidden instruction planted in one turn or in a retrieved document persists in working memory and steers later actions. Inspecting multi-turn conversations for content and intent, not just routing them, is what catches the instruction before it persists.

Does agent inventory really need to be real-time?

Yes, because agents spin up, change scope, and run on local devices faster than periodic scans can track, and only 21% of organizations maintain a real-time inventory today (CSA, 2026). A control plane that discovers across network, endpoint, and API keeps the inventory current enough to enforce policy against.

Does runtime security require replacing the existing security stack?

No. AI-native agent runtime security deploys as an additive layer alongside an existing SSE, CASB, or DLP stack, because those tools govern destinations while agent risk lives at the prompt, response, and tool-call level. The two operate together rather than as a rip-and-replace.

How Aurascape Governs Both Agent Channels in One Control Plane

The attacks that matter cross the seam between the intelligence channel and the tool-execution channel, and Aurascape is built to inspect and correlate both in the live path. The AI Proxy decodes prompts, responses, and agent intent on the model leg, while the Zero Bypass MCP Gateway cryptographically signs approved tool calls and blocks unsigned ones on the execution leg, with cross-call data lineage tracking data through chained steps so multi-call exfiltration cannot hide.

Because the platform sits on both legs, it enforces that the only gateway an agent uses is the approved one, discovers agents and MCP servers across network, endpoint, and API including local tools like Claude Code and Cursor, and governs them with context-aware policy on identity, intent, entitlement, and data sensitivity. In one Aurascape deployment at a Fortune 100 insurance and financial enterprise, the platform tripled AI agent integrations with no unauthorized data access while protecting more than 20,000 users (Aurascape, 2026). It deploys as an additive layer alongside an existing SSE, CASB, or DLP stack, so closing the runtime gap does not mean ripping out incumbent tooling.

Aurascape is the AI-native control plane that closes both agent channels at once, where single-leg controls clear the seam attacks that matter. Every deployment runs through a tailored demo scoped to your AI security gaps.

See how Aurascape secures AI agents at runtime →

Aurascape Solutions

Discover and monitor AI Get a clear picture of all AI activity.
Safeguard AI use Secure data and compliancy in AI usage.
Secure Agentic AI Secure how your teams use AI and build AI agents.
Copilot readiness Prepare for and monitor AI Copilot use.
Coding assistant guardrails Accelerate development, safely.
Frictionless AI security Keep users and admins moving.