Skip to main content
All terms
Safety & Alignment

Cross-Agent Prompt Injection

A prompt-injection attack that spreads from one AI agent to another.

Definition

Cross-agent prompt injection is a prompt-injection attack that travels from one agent to another through shared messages, files, tool outputs, or handed-off tasks. It matters more as multi-agent systems pass context around automatically, letting a single hidden instruction reach agents that never saw the original input.