Content
# Image Vision MCP
MCP Server that provides descriptions of images.
## Requirements
### Ollama
You must have [Ollama](https://ollama.com/) set up, and exposing its LLMs via its API / webserver.
By default, the MCP uses the [llava:34b](https://ollama.com/library/llava) LLM, which you need to have installed and on the same computer that Ollama is running on.
You can install the model via:
```
ollama run llava:34b
```
You can also specify a different model installed via Ollama via the **--model** argument in Claude config.
## Installation
In order to install the MCP into Claude, add the following to the **claude_desktop_config.json** file.
```
"image-vision": {
"command": "npx",
"args": [
"-y",
"node",
"C:/Users/FOO/src/image-vision-mcp/src/image-vision-mcp.js",
"--permitted",
"C:/Users/FOO/mcp",
"--host",
"http://192.168.1.238:11434"
]
}
```
### Arguments
* **--permitted** (required) The paths that the MCP is allowed to access. You can include multiple entries.
* **--host** (optional) The host and post for the Ollama server. Defaults to http://127.0.0.1:11434
* **--model** (optional) The model installed into Ollama that should be used for the image vision. Defaults to "llava:34b".
## Usage
To run the server:
```bash
node src/image-vision-mcp.js --permitted /path/to/dir1 /path/to/dir2
```
The `--permitted` flag is used to specify which directories roots the MCP is allowed to access for security reasons.
## Development
You can run in development mode using the [MCP inspector](https://github.com/modelcontextprotocol/typescript-sdk?tab=readme-ov-file):
```
npx @modelcontextprotocol/inspector node src/image-vision-mcp.js --permitted /Users/FOO/Desktop/mcp/
```
## Questions, Feature Requests, Feedback
If you have any questions, feature requests, need help, or just want to chat, join the [discord](https://discord.gg/fgxw9t37D7).
You can also log bugs and feature requests on the [issues page](https://github.com/mikechambers/image-vision-mcp/issues).
## License
Project released under a [MIT License](LICENSE.md).
[](LICENSE.md)
Connection Info
You Might Also Like
markitdown
Python tool for converting files and office documents to Markdown.
Fetch
Retrieve and process content from web pages by converting HTML into markdown format.
oh-my-opencode
Background agents · Curated agents like oracle, librarians, frontend...
chatbox
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
continue
Continue is an open-source project for seamless server management.
semantic-kernel
Build and deploy intelligent AI agents with Semantic Kernel's orchestration...