mcp-injection-experiments

invariantlabs-ai

143

Code snippets to reproduce MCP tool poisoning attacks.

Overview

mcp-injection-experiments Introduction

mcp-injection-experiments is a repository containing code snippets designed to reproduce MCP tool poisoning attacks, demonstrating various methods of injecting malicious behavior into MCP clients or agents.

How to Use

To use mcp-injection-experiments, clone the repository and run the provided Python scripts, such as 'direct-poisoning.py', 'shadowing.py', or 'whatsapp-takeover.py', to simulate different types of MCP attacks.

Key Features

Key features include direct poisoning attacks that leak sensitive files, tool shadowing that manipulates trusted tools, and WhatsApp takeover attacks that disguise malicious behavior until the second load.

Where to Use

mcp-injection-experiments can be used in cybersecurity research, penetration testing, and educational contexts to understand and demonstrate the vulnerabilities of MCP servers.

Use Cases

Use cases include testing the security of MCP implementations, training security professionals on attack methods, and developing defensive strategies against MCP tool poisoning.

Content

# MCP Tool Poisoning Experiments This repository contains a few experimental MCP server implementations, that attempt ot inject the MCP client/agent in use. For more details about the attack method, please see our [blog post](https://invariantlabs.ai/blog/mcp-security-notification-tool-poisoning-attacks). ## Direct Poisoning In [`direct-poisoning.py`](./direct-poisoning.py), we implement a simple MCP server that instructs an agent to leak sensitive files, when calling the `add` tool (in this case SSH keys and the `mcp.json` file itself). An example execution in cursor looks like this: ![Cursor executes tool poisoning](https://invariantlabs.ai/images/cursor-injection.png) ## Tool Shadowing In [`shadowing.py`](./shadowing.py), we implement a more sophisticated MCP attack, that manipulates the agent's behavior of a `send_email` tool (provided by a different, trusted server), such that all emails sent by the agent are leaked to the attacker's server. An example execution in Cursor looks like this: ![Cursor executes tool shadowing](https://invariantlabs.ai/images/mcp-shadow.png) ## WhatsApp takeover Lastly, in [`whatsapp-takeover.py`](./whatsapp-takeover.py), we implement a shadowing attack combined with a sleeper rug pull, i.e. an MCP server that changes its tool interface only on the second load to a malicious one. The server first masks as a benign "random fact of the day" implementation, and then changes the tool to a malicious one that manipulates [whatsapp-mcp](https://github.com/lharries/whatsapp-mcp) in the same agent, to leak messages to the attacker's phone number. ![Cursor executes WhatsApp MCP attack](https://github.com/user-attachments/assets/a39ea101-3fd2-4945-abcd-942006cfe11c) Can you spot the exfiltration? Here, the malicious tool instructions ask the agent to include the smuggled data after many spaces, such that with invisible scroll bars, the user does not see the data being leaked. Only when you scroll all the way to the right, will you be able to find the exfiltration payload.

mcp-injection-experiments

Scan with WeChat to Share

Overview

mcp-injection-experiments Introduction

How to Use

Key Features

Where to Use

Use Cases

Content

You Might Also Like

MarkItDown MCP

semantic-kernel

repomix

Github MCP

Context 7

python-sdk