Content
<h1 align="center">
<picture>
<source media="(prefers-color-scheme: dark)" srcset="web/public/logo-white.svg" />
<img src="web/public/logo-primary.svg" alt="" height="48" align="absmiddle" />
</picture>
Beever Atlas
</h1>
<div align="center">
<picture>
<source media="(prefers-color-scheme: dark)" srcset="assets/banner-dark.png" />
<img src="assets/banner.png" alt="Beever Atlas — LLM-first Wiki Knowledge Base" width="85%" />
</picture>
</div>
<h3 align="center">
Turn your team's Slack, Discord, Teams & Mattermost chats<br>
into a self-maintaining wiki — automatically.
</h3>
<p align="center">
<a href="https://docs.beever.ai/atlas"><img src="https://img.shields.io/badge/DOCS-docs.beever.ai/atlas-FFC107?style=for-the-badge&labelColor=4A4A4A" alt="Docs" /></a>
<a href="LICENSE"><img src="https://img.shields.io/badge/LICENSE-Apache_2.0-7CB342?style=for-the-badge&labelColor=4A4A4A" alt="License Apache 2.0" /></a>
<a href="https://beever.ai"><img src="https://img.shields.io/badge/BUILT_BY-BEEVER.AI-15404E?style=for-the-badge&labelColor=4A4A4A" alt="Built by Beever.ai" /></a>
<a href="https://google.github.io/adk-docs/"><img src="https://img.shields.io/badge/BUILT_WITH-Google_ADK-FF6F00?style=for-the-badge&labelColor=4A4A4A" alt="Built with Google ADK" /></a>
</p>
<p align="center">
<a href="https://discord.gg/VshBCUUX"><img src="https://img.shields.io/badge/DISCORD-Join_Community-5865F2?style=for-the-badge&labelColor=4A4A4A&logo=discord&logoColor=white" alt="Join our Discord" /></a>
<a href="https://x.com/Beever_AI"><img src="https://img.shields.io/badge/X-@Beever__AI-000000?style=for-the-badge&labelColor=4A4A4A&logo=x&logoColor=white" alt="Follow us on X" /></a>
<a href="https://beever.ai/"><img src="https://img.shields.io/badge/WEBSITE-beever.ai-15404E?style=for-the-badge&labelColor=4A4A4A" alt="beever.ai" /></a>
</p>
---
Beever Atlas pulls the conversations your team already has on Slack, Discord, Microsoft Teams, and Mattermost, extracts atomic facts, deduplicates them, and clusters them into topic pages with citations. A graph store links the people, decisions, and projects mentioned across channels. Ask questions in natural language and get answers cited back to the source messages — through the dashboard, or through MCP into Claude Code and Cursor.
If you want a knowledge base that grows on its own from the chats your team already has, this is it.
---
## ✨ Features in action
Six short clips — connect a workspace, sync history, watch memory build, browse the auto-generated wiki, ask questions, plug external AI agents in via MCP.
<table>
<tr>
<td width="33%" align="center" valign="top">
<strong>Multi-Platform</strong><br><br>
<img src="assets/clips/multi-platform.gif" alt="Multi-platform connections demo" width="100%"><br>
Connect Slack, Discord, Teams, Mattermost, or file imports. One bot, every workspace.
</td>
<td width="33%" align="center" valign="top">
<strong>Message Sync</strong><br><br>
<img src="assets/clips/sync.gif" alt="Channel sync demo" width="100%"><br>
Pull channel history on demand or on a schedule. Resumable and rate-limit aware.
</td>
<td width="33%" align="center" valign="top">
<strong>Memory Ingestion</strong><br><br>
<img src="assets/clips/memory.gif" alt="Memory ingestion pipeline demo" width="100%"><br>
6-stage ADK pipeline distils messages into atomic facts, entities, and relationships.
</td>
</tr>
<tr>
<td width="33%" align="center" valign="top">
<strong>LLM Wiki</strong><br><br>
<img src="assets/clips/wiki.gif" alt="LLM wiki browsing demo" width="100%"><br>
Auto-maintained wiki per channel — overview, topics, people, decisions, citations.
</td>
<td width="33%" align="center" valign="top">
<strong>QA Agent</strong><br><br>
<img src="assets/clips/qa.gif" alt="QA agent answering demo" width="100%"><br>
Streams cited answers over SSE. Smart router picks semantic or graph per question.
</td>
<td width="33%" align="center" valign="top">
<strong>MCP Server</strong><br><br>
<img src="assets/clips/mcp.gif" alt="MCP server querying from Claude Code demo" width="100%"><br>
Plug Claude Code / Cursor into your knowledge base — 16 tools, per-agent auth.
</td>
</tr>
</table>
---
## 🏗️ Architecture
Conversations from any supported platform flow into a unified ingestion pipeline that produces two complementary memory systems — a **3-tier semantic store** (channel / topic / atomic fact) for fast hybrid search, and a **graph store** that extracts entities and their relationships. Those memories fuel two consumer surfaces: the **LLM Wiki** (distilled, auto-maintained) and **QA Agents** (served through the dashboard directly, or through **MCP** into Claude Code / Cursor).
<p align="center">
<picture>
<source media="(prefers-color-scheme: dark)" srcset="assets/architecture-dark.png" />
<img src="assets/architecture.png" alt="Beever Atlas architecture — chat platforms → memory ingestion → 3-tier semantic memory + graph memory → LLM Wiki and QA Agent → Dashboard and MCP clients" width="60%" />
</picture>
</p>
<p align="center"><em>From chat platforms to MCP agents — one ingestion path, two memory systems, two delivery surfaces.</em></p>
Under the hood, three services (backend, bot, frontend) are backed by four data stores (Weaviate, Neo4j, MongoDB, Redis). See the [architecture overview](https://docs.beever.ai/atlas/concepts/architecture) on the documentation site for the full design — component responsibilities, dual-memory internals, and the smart query router.
---
## 💡 Why Wiki-First RAG?
Most RAG systems answer questions by retrieving raw message snippets and feeding them straight to an LLM. Beever Atlas takes a different approach: it continuously distils conversations into a structured, auto-maintained wiki — with topic pages, entity graphs, decisions, and citations — before any query is issued. When you ask a question, the retrieval layer works against clean, deduplicated knowledge rather than noisy chat history. This means answers are more consistent, citations are traceable to source messages, and the wiki itself becomes a useful artifact your team can browse independently of the Q&A interface. The dual-memory architecture (semantic + graph) lets the query router pick the right retrieval strategy per question, keeping latency low and context precise.
<div align="center">
<img src="assets/wiki-preview.png" alt="Beever Atlas wiki view — auto-generated overview with concept map, topics, FAQ, glossary, and resources, built from Slack messages" width="85%" />
</div>
<p align="center"><em>A live auto-generated channel wiki: overview, concept map, topics, FAQ, glossary — distilled from 246 Slack messages, not hand-written.</em></p>
### The inspiration: LLMs read wikis, not chat logs
The per-channel wiki concept is directly inspired by [Andrej Karpathy's observation](https://x.com/karpathy/status/2039805659525644595) that LLMs are far better at reasoning over curated, encyclopedic content (books, docs, wikis) than over raw conversational transcripts. Chat history is noisy, redundant, temporally scattered, and full of implicit context that only humans resolve. A wiki, by contrast, is the *already-distilled* form of that knowledge — deduplicated, structured, citation-bearing, and organised by topic rather than by timestamp.
Beever Atlas operationalises this insight: every synced channel gets its own **auto-generated, continuously-updated wiki** — sections for topics, entities, decisions, open questions, and timelines — rebuilt incrementally as new messages arrive. The QA agent retrieves against this wiki first, falling back to raw messages only when a fact hasn't been distilled yet.
### What this unlocks in practice
- **Better answers, fewer hallucinations** — retrieval operates on fact-dense prose with explicit entity relationships, not on fragmented turn-by-turn chat.
- **Traceable citations** — every wiki claim links back to the source messages that produced it, so answers are auditable all the way down to the original Slack/Discord/Teams thread.
- **A browsable artifact, not just a Q&A box** — the wiki is useful *on its own*. New teammates onboarding to a channel can read the distilled wiki instead of scrolling three months of history.
- **Cheaper inference at query time** — the expensive distillation work happens once, at ingestion. Queries hit compact, pre-digested context instead of re-summarising raw logs on every request.
- **Graph-aware reasoning** — the entity graph built alongside the wiki lets the query router answer relational questions ("who worked on X with Y?") that pure vector RAG struggles with.
For a detailed comparison with other LLM knowledge tools, see [the comparison page](https://docs.beever.ai/atlas/comparison) on the documentation site.
---
## 🚀 Quick Start
Beever Atlas ships as a Docker Compose stack (backend + bot + web + 4 datastores). You can try a seeded demo in 30 seconds with zero keys, then pick one of **three deployment options** to install it for real.
### 1. Get the code
```bash
git clone https://github.com/beever-ai/beever-atlas.git
cd beever-atlas
```
### 2. Try the demo first (optional, no keys needed for seeding)
```bash
make demo
```
`make demo` brings up the full stack pre-loaded with a public Wikipedia corpus (Ada Lovelace + Python history). Seeding uses pre-computed fixtures — no API keys required. Asking questions via `/api/ask` needs a free-tier `GOOGLE_API_KEY` because the QA agent calls Gemini. See [demo/README.md](demo/README.md) for curl examples.
Skip this step if you're ready to install for real.
### 3. Before you start: get your API keys
Two free keys are required before installing. Both offer generous free tiers — enough to sync a small team's channels for testing.
| Key | Purpose | Where to get it |
|---|---|---|
| `GOOGLE_API_KEY` | Gemini — extraction, entity graph, answers | [aistudio.google.com/apikey](https://aistudio.google.com/apikey) |
| `JINA_API_KEY` | Jina v4 embeddings (2048-dim) for semantic search | [jina.ai/api-dashboard](https://jina.ai/api-dashboard/) |
Optional (skip unless you know you need them):
| Key | What it enables |
|---|---|
| `TAVILY_API_KEY` | External web search when QA retrieval confidence is low — [tavily.com](https://tavily.com/) |
| Slack / Discord / Teams bot tokens | **Configured via the web UI after setup**, not `.env` — the bot stores platform credentials encrypted in MongoDB |
> **Tip:** Keep the two required keys handy before you start. Option 1 prompts for them interactively; Options 2 and 3 need them pasted into `.env`.
### 4. Choose a deployment option
| Option | When to use | Time to "up" |
|---|---|---|
| **1. One-line install** (recommended) | You want the fastest path to a running stack. | ~2 min first run |
| **2. Manual Docker** | CI/CD, ops environments, or when you want explicit control over every step. | ~3 min first run |
| **3. Local development** | Active contributors who need hot-reload on backend and frontend. | varies |
### Option 1 — One-line install (recommended)
```bash
./atlas
```
The `atlas` installer walks you through a guided 5-step checklist:
1. **Embedding model** — pick a provider (Jina / OpenAI / Cohere / Voyage / Gemini / Mistral / Ollama), then its API key.
2. **Agent LLM provider** — pick a provider for the 16 ADK agents (Google Gemini / OpenAI / Anthropic / Mistral / DeepSeek / Groq / MiniMax / Ollama / Custom); optional second provider for hybrid setups.
3. **Graph backend** — Neo4j (default) or skip.
4. **Optional integrations** — Tavily web search, MCP server for Claude Code / Cursor.
5. **Auth tokens** — keep dev defaults or rotate now.
Under the hood it verifies `docker` + `docker compose`, copies `.env.example` → `.env` (preserves your values on re-run, `chmod 600`), auto-generates `CREDENTIAL_MASTER_KEY` (64 hex) and `WEAVIATE_API_KEY` (32 hex), runs a port-conflict preflight, launches the stack via `docker compose up -d --build --force-recreate --remove-orphans`, and polls `/api/health` before printing the ready card.
When you see **"Beever Atlas is ready"**, open **[http://localhost:3000](http://localhost:3000)** — then **Settings → AI Setup** to manage providers, assign LLMs per-agent, run Test Connection, or discover models. For CI / Docker / GitOps, configure declaratively: `BEEVER_LLM_API_KEY=...` (single-provider shortcut), `BEEVER_ENDPOINTS='[...]'` + `BEEVER_PRESET=...`, or commit an `atlas.yaml` and run `atlas apply` — see [`docs/runbooks/ai-setup.md`](docs/runbooks/ai-setup.md) and [`docs/runbooks/atlas-yaml.md`](docs/runbooks/atlas-yaml.md).
For CI or unattended installs — skip prompts, pre-seed keys from shell env:
```bash
GOOGLE_API_KEY=... JINA_API_KEY=... ./atlas --non-interactive
```
Re-running `./atlas` on an existing stack is idempotent.
### Option 2 — Manual Docker
Full control, step-by-step.
```bash
cp .env.example .env
```
Open `.env` and fill in the two required keys:
```env
GOOGLE_API_KEY=your_gemini_key
JINA_API_KEY=your_jina_key
```
Generate two required secrets and paste them into `.env`:
```bash
# CREDENTIAL_MASTER_KEY — AES-256-GCM key for stored platform credentials (64 hex chars)
python -c "import secrets; print(secrets.token_hex(32))"
# WEAVIATE_API_KEY — auth between backend and Weaviate (required by docker-compose)
python -c "import secrets; print(secrets.token_hex(16))"
```
Launch:
```bash
docker compose up -d --build
```
Open **[http://localhost:3000](http://localhost:3000)**.
**Services started:**
| Service | Port | Description |
|---|---|---|
| Web (nginx) | `:3000` | React dashboard |
| Backend | `:8000` | FastAPI + ADK agents |
| Bot | `:3001` | Platform bridge (Slack / Discord / Teams) |
| Weaviate | `:8080` | Semantic memory |
| Neo4j | `:7474` / `:7687` | Graph memory |
| MongoDB | `:27017` | State + wiki cache |
| Redis | `:6380` | Sessions (internal `:6379`) |
First run takes 2–3 minutes while images build and databases initialize. Subsequent runs start in seconds.
### Option 3 — Local development
Databases in Docker, app services native for hot-reload.
**Prerequisites:** Python 3.12+ with [uv](https://docs.astral.sh/uv/), Node.js 20+
```bash
cp .env.example .env
# Fill in GOOGLE_API_KEY, JINA_API_KEY, CREDENTIAL_MASTER_KEY, WEAVIATE_API_KEY (same as Option 2)
# Start just the databases
docker compose up -d weaviate neo4j mongodb redis
# Backend (terminal 1)
uv sync
uv run uvicorn beever_atlas.server.app:app --reload --port 8000
# Bot (terminal 2)
cd bot && npm install && npm run dev
# Web (terminal 3) — Vite dev server with HMR
cd web && npm install && npm run dev
```
Open **[http://localhost:5173](http://localhost:5173)** (the Vite dev port — **not** `:3000`).
The Vite dev server proxies `/api/*` to `http://localhost:8000` (configured via `VITE_API_URL`).
### Before going to production
`.env.example` defaults are tuned for local testing. Before any real deploy, rotate the secrets that ship with placeholder values and flip the environment flag:
| What to change | Why | How |
|---|---|---|
| `BEEVER_API_KEYS`, `BEEVER_ADMIN_TOKEN` | Ship as `dev-key-change-me` / `dev-admin-change-me` — public placeholders | `python -c "import secrets; print(secrets.token_hex(24))"` per token |
| `BRIDGE_API_KEY` | Shared secret between backend and bot; blank by default, required outside local dev | Same `secrets.token_hex(24)` |
| `VITE_BEEVER_API_KEY`, `VITE_BEEVER_ADMIN_TOKEN` | Vite bakes these into the web bundle at build time — must mirror the rotated backend values above | Copy the rotated `BEEVER_API_KEYS` / `BEEVER_ADMIN_TOKEN` values |
| `NEO4J_PASSWORD` + password half of `NEO4J_AUTH` | Dev password is public in this repo | Pick a strong password; both values must match |
| `BEEVER_ENV=production` | Enables fail-fast startup that rejects every dev default above | Flip the value in `.env` |
Option 1 (`./atlas`) handles all of this through the **"Rotate auth tokens"** prompt in step 4 of the checklist — answer **Y** and the installer generates random tokens and mirrors the VITE_* values for you. If you used Option 2 or 3, you can re-run `./atlas` on the existing `.env`, skip every other prompt with Enter, and only accept the rotation prompt.
### 5. Open the dashboard
Navigate to the URL for your chosen option:
- **Options 1 & 2** → **[http://localhost:3000](http://localhost:3000)**
- **Option 3** → **[http://localhost:5173](http://localhost:5173)**
From there:
- **Real mode** (default, `ADAPTER_MOCK=false`): connect a workspace in **Settings → Connections** — Slack / Discord / Teams tokens are entered through the UI, not `.env`.
- **Mock mode** (`ADAPTER_MOCK=true`): uses fixture data — opt in for local UI iteration without platform credentials.
### 6. Sync a channel
From the dashboard: **Connections → Add Workspace → Select channels → Sync**.
Or via API (auto-extracts your bearer token from `.env`):
```bash
curl -X POST http://localhost:8000/api/channels/C12345/sync \
-H "Authorization: Bearer $(grep -E '^BEEVER_API_KEYS=' .env | cut -d= -f2 | cut -d, -f1)"
```
### MCP server (for external AI agents)
Beever Atlas exposes a curated MCP (Model Context Protocol) server at `/mcp` for AI agents like Claude Code and Cursor. This allows external code assistants to query your team's knowledge base without using the dashboard.
See [docs/mcp-server.md](docs/mcp-server.md) for:
- **Tool catalog** — 16 tools for discovery, retrieval, graph traversal, and long-running operations
- **Auth setup** — generating and managing `BEEVER_MCP_API_KEYS`
- **Client configuration** — ready-to-use `.mcp.json` templates for Claude Code and Cursor
- **Rate limits** — principal-keyed limits to prevent one agent from throttling others
Quick example (Claude Code):
```json
{
"mcpServers": {
"beever-atlas": {
"url": "https://atlas.example.com/mcp",
"transport": "streamable-http",
"headers": {
"Authorization": "Bearer ${BEEVER_MCP_KEY}"
}
}
}
}
```
### Common commands
```bash
docker compose up -d # Start in background
docker compose logs -f beever-atlas # Tail backend logs
docker compose down # Stop (keeps data)
docker compose down -v # Stop and DELETE all indexed data
make demo # Full stack + seeded demo corpus
make docker-up # Shortcut for `docker compose up -d`
```
---
## 🔒 Privacy & Telemetry
Beever Atlas collects no telemetry. No usage data, error reports, or analytics are sent anywhere by default. All LLM calls go through API keys you configure in your own `.env`, and all data stays in the databases you control.
---
## 📐 API Stability
All `/api/*` endpoints are **UNSTABLE** in 0.1.0. v0.2.0 will introduce a `/api/v1/*` prefix; clients pinning current paths will break. See [SECURITY.md](SECURITY.md).
---
## 💬 Community & Contact
- **Discord**: [discord.gg/VshBCUUX](https://discord.gg/VshBCUUX) — get help, share what you're building, talk to the team
- **X / Twitter**: [@Beever_AI](https://x.com/Beever_AI) — release notes, posts, announcements
- **Website**: [beever.ai](https://beever.ai/) — about the company and other projects
- **GitHub Discussions**: [github.com/Beever-AI/beever-atlas/discussions](https://github.com/Beever-AI/beever-atlas/discussions) — longer-form questions and ideas
Commercial support, partnerships, or press: `tech@beever.ai`.
---
## 📜 License
[Apache License 2.0](LICENSE) © 2026 Beever Atlas contributors. Third-party attributions in [NOTICE](NOTICE).
Security policy: [SECURITY.md](SECURITY.md) | Community standards: [CODE_OF_CONDUCT.md](CODE_OF_CONDUCT.md)
MCP Config
Below is the configuration for this MCP Server. You can copy it directly to Cursor or other MCP clients.
mcp.json
Connection Info
You Might Also Like
Filesystem
Node.js MCP Server for filesystem operations with dynamic access control.
Fetch
Retrieve and process content from web pages by converting HTML into markdown format.
Context 7
Context7 MCP provides up-to-date code documentation for any prompt.
context7-mcp
Context7 MCP Server provides natural language access to documentation for...
mempalace
The highest-scoring AI memory system ever benchmarked. And it's free.
chrome-devtools-mcp
Chrome DevTools for coding agents