All terms
Safety & Alignment
Cross-Agent Prompt Injection
A prompt-injection attack that spreads from one AI agent to another.
Definition
Cross-agent prompt injection is a prompt-injection attack that travels from one agent to another through shared messages, files, tool outputs, or handed-off tasks. It matters more as multi-agent systems pass context around automatically, letting a single hidden instruction reach agents that never saw the original input.