Content

# SuperNova MCP RAG Monorepo A monorepo demonstrating a Model Context Protocol (MCP) server with Retrieval-Augmented Generation (RAG) for answering questions about imaginary SuperNova documentation. - The documentation is an imaginary product documentation. It is a collection of HTML files with AI generated content. - The documentation is processed into chunks and stored in a vector database. - The server is built with Node.js and uses free HuggingFace embeddings for semantic search. ## Architecture Overview ```mermaid flowchart TD User[User Question in Cursor] -->|MCP Protocol| MCPServer[MCP RAG Server] MCPServer -->|Triggers| RAG[RAG Pipeline] RAG -->|Loads & Chunks| Docs[SuperNova HTML Docs] RAG -->|Embeds| Embeddings[HuggingFace Embeddings] Embeddings -->|Stores| VectorStore[In-Memory Vector Store] MCPServer -->|Semantic Search| VectorStore VectorStore -->|Relevant Chunks| MCPServer MCPServer -->|Answer| User ``` ## Monorepo Structure - `mcp-rag-server/` — MCP server with RAG pipeline (Node.js, TypeScript) - `monorepo-sample-package/` — Sample package (for monorepo demonstration) - `docs/` — Dummy HTML documentation for SuperNovaStorybook-Mobile-Swift ## Quick Start ### Prerequisites - Node.js 18+ - Yarn (for workspace support) ### Install Dependencies ```bash yarn install ``` ### Environment Setup Create a `.env` file in `mcp-rag-server/`: ``` HUGGINGFACE_API_KEY=your_huggingface_token_here ``` ### Build & Run MCP RAG Server ```bash #install yarn install #list workspace yarn workspaces info # Build yarn workspace mcp-rag-server build # Start yarn workspace mcp-rag-server start ``` - For development (hot-reload): ```bash yarn dev ``` > Note: The server might take a while to prepare the vector store. You can see the progress in the logs. ## How It Works - **MCP Protocol:** Exposes a tool (`search_docs`) for semantic search over documentation. - **RAG Pipeline:** - Loads and parses `docs/SuperNovaStorybook-Mobile-Swift/*.html`, i.e. all the HTML files in that directory. - Splits text into chunks - Embeds chunks using HuggingFace Inference API - Stores in an in-memory vector store (LangChain) - Answers queries by semantic similarity search ## Usage with Cursor 1. Open Cursor 2. Add a new MCP server in Settings → MCP: - Type: MCP (Stdio) - Command: `node` (from `mcp-rag-server`) - Arguments: `path/to/your/mcp-rag-server/dist/index.js` - Ensure `.env` is set up with your HuggingFace API key 3. Ask questions about the SuperNova documentation in Cursor chat ### Sample mcp.json ```json { "mcpServers": { "mcp-rag-server": { "command": "node", "args": [ "/Users/your-username/projects/supernova-mcp-rag/mcp-rag-server/dist/index.js" ], "disabled": false, "autoApprove": [] } } } ``` ![cursor-example](./assets/cursor-example.gif) ## Debugging with MCP Inspector and Simple Browser The **MCP Inspector** is an interactive developer tool designed to help you test and debug your MCP server in real time. ### How to Use MCP Inspector 1. **Start your MCP server locally.** 2. **Run the Inspector with your server from the root of the monorepo:** ```bash npx @modelcontextprotocol/inspector node mcp-rag-server/dist/index.js ``` 3. **Open the Inspector Web UI:** The Inspector will print a URL such as: `http://127.0.0.1:6274/` 4. **Open this URL in the VS Code Simple Browser or any web browser:** - In Cursor / VS Code, open the Command Palette (`Ctrl+Shift+P` or `Cmd+Shift+P`), type `Simple Browser: Show`, and enter the URL. - Alternatively, open the URL in Chrome, Firefox, or any browser. 5. **Interact with your MCP server:** - Send test queries. - Inspect tool calls and responses. - Debug and verify your MCP server’s behavior live. ### Why Use Simple Browser? - Some browsers (like Safari) may block HTTP requests due to HTTPS-only mode. - VS Code’s Simple Browser avoids such restrictions and is convenient for local development. ![simple-browser-example](./assets/simple-browser.gif) --- Using the MCP Inspector with the Simple Browser is a powerful way to debug and validate your MCP server before integrating it fully with clients like Cursor. ## Hugging Face API Usage & Limits - The Hugging Face Inference API has a free tier with request limits (e.g., 300 requests/hour for registered users). - See [API Pricing & Rate Limits](https://huggingface.co/docs/api-inference/en/pricing) and [Supported Models](https://huggingface.co/docs/api-inference/index) for details. - If you exceed your quota, you may receive 429 errors or have to wait for your quota to reset. ## Troubleshooting - Ensure your HuggingFace API key is valid and not rate-limited - If the server fails to start, check `.env` and logs - For dependency issues, use `yarn install` from the root ## License This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

supernova-mcp-rag

Content

You Might Also Like

OpenWebUI

NextChat

cherry-studio

LibreChat

Continue

repomix

supernova-mcp-rag

Scan with WeChat to Share

Content

You Might Also Like

OpenWebUI

NextChat

cherry-studio

LibreChat

Continue

repomix