Skip to main content
All terms
Safety & Alignment

Malicious MCP Server

An MCP server built or hijacked to steal data, mislead an agent, or inject harmful instructions.

Definition

A malicious MCP server is one that has been built or compromised to steal data, mislead an agent, change how its tools behave, or slip in harmful instructions. Because agents tend to trust the tools they connect to, it is a supply-chain risk — a danger that enters through a component you depend on.