diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md
index 5cfdc64..1453bf7 100644
--- a/CONTRIBUTING.md
+++ b/CONTRIBUTING.md
@@ -68,6 +68,25 @@ node dist/index.js /path/to/test/project
 
 The server logs to stderr, so you can see what it's doing.
 
+## Evaluation Harness
+
+Run `pnpm eval` to measure search/ranking quality against frozen fixtures. Use this before releases or after changing search/ranking/chunking logic.
+
+```bash
+# Two codebases (defaults to bundled fixtures)
+pnpm eval -- <codebaseA> <codebaseB>
+
+# Offline smoke test (no network)
+pnpm eval -- tests/fixtures/codebases/eval-controlled tests/fixtures/codebases/eval-controlled \
+  --fixture-a=tests/fixtures/eval-controlled.json \
+  --fixture-b=tests/fixtures/eval-controlled.json \
+  --skip-reindex --no-rerank
+```
+
+Flags: `--help`, `--fixture-a`, `--fixture-b`, `--skip-reindex`, `--no-rerank`, `--no-redact`.
+
+Save a report: `pnpm eval -- <path> --skip-reindex > eval-report.txt`
+
 ## Pull Requests
 
 - Fork, branch, make changes
diff --git a/README.md b/README.md
index 918d3f2..76b5956 100644
--- a/README.md
+++ b/README.md
@@ -16,11 +16,9 @@ Here's what codebase-context does:
 
 **Remembers across sessions** - Decisions, failures, workarounds that look wrong but exist for a reason - the battle scars that aren't in the comments. Recorded once, surfaced automatically so the agent doesn't "clean up" something you spent a week getting right. Conventional git commits (`refactor:`, `migrate:`, `fix:`) auto-extract into memory with zero effort. Stale memories decay and get flagged instead of blindly trusted.
 
-**Checks before editing** - Before editing something, you get a decision card showing whether there's enough evidence to proceed. If a symbol has four callers (files that import or reference it) and only two appear in your search results, the card shows that coverage gap. If coverage is low, `whatWouldHelp` lists the specific searches to run before you touch anything. When code, team memories, and patterns contradict each other, it tells you to look deeper instead of guessing.
+**Checks before editing** - Before editing something, you get a decision card showing whether there's enough evidence to proceed. If a symbol has four callers and only two appear in your search results, the card shows that coverage gap. If coverage is low, `whatWouldHelp` lists the specific searches to run before you touch anything.
 
-One tool call returns all of it. Local-first - your code never leaves your machine by default. Opt into `EMBEDDING_PROVIDER=openai` for cloud speed, but then code is sent externally.
-
-The index auto-refreshes as you edit - a file watcher triggers incremental reindex in the background when the MCP server is running. No stale context between tool calls.
+One tool call returns all of it. Local-first - your code never leaves your machine by default.
 
 ### What it looks like
 
@@ -38,289 +36,36 @@ This is the part most tools miss: what the team is doing now, what it is moving
 
 When the agent searches with edit intent, it gets a compact decision card: confidence, whether it's safe to proceed, which patterns apply, the best example, and which files are likely to be affected.
 
-More CLI examples:
-
-- `refs --symbol "ComponentStore"` for concrete static references
-- `metadata` for a quick codebase overview
-- Full gallery in `docs/cli.md`
-
-## Table of Contents
-
-- [Quick Start](#quick-start)
-- [Multi-Project and Monorepos](#multi-project-and-monorepos)
-- [Test It Yourself](#test-it-yourself)
-- [Common First Commands](#common-first-commands)
-- [What It Actually Does](#what-it-actually-does)
-- [Evaluation Harness (`npm run eval`)](#evaluation-harness-npm-run-eval)
-- [How the Search Works](#how-the-search-works)
-- [Language Support](#language-support)
-- [Configuration](#configuration)
-- [Performance](#performance)
-- [File Structure](#file-structure)
-- [CLI Reference](#cli-reference)
-- [What to add to your CLAUDE.md / AGENTS.md](#what-to-add-to-your-claudemd--agentsmd)
-- [Links](#links)
-- [License](#license)
+More CLI examples in [`docs/cli.md`](./docs/cli.md).
 
 ## Quick Start
 
-Start with the default setup:
-
-- Configure one `codebase-context` server with no project path.
-- If a tool call later returns `selection_required`, retry with `project`.
-- If you only use one repo, you can also append that repo path up front.
-
-### Pick the right setup
-
-| Situation                             | Recommended config                                                                                   |
-| ------------------------------------- | ---------------------------------------------------------------------------------------------------- |
-| Default setup                         | Run `npx -y codebase-context` with no project path                                                   |
-| Single repo setup                     | Append one project path or set `CODEBASE_ROOT`                                                       |
-| Multi-project call is still ambiguous | Retry with `project`, or keep separate server entries if your client cannot preserve project context |
-
-### Recommended setup
-
-Add it to the configuration of your AI Agent of preference:
-
-### Claude Code
-
 ```bash
 claude mcp add codebase-context -- npx -y codebase-context
 ```
 
-### Claude Desktop
-
-Add to `claude_desktop_config.json`:
-
-```json
-{
-  "mcpServers": {
-    "codebase-context": {
-      "command": "npx",
-      "args": ["-y", "codebase-context"]
-    }
-  }
-}
-```
-
-### VS Code (Copilot)
-
-Add `.vscode/mcp.json` to your project root:
-
-```json
-{
-  "servers": {
-    "codebase-context": {
-      "command": "npx",
-      "args": ["-y", "codebase-context"] // Or append "${workspaceFolder}" for single-project use
-    }
-  }
-}
-```
-
-### Cursor
-
-Add to `.cursor/mcp.json` in your project:
-
-```json
-{
-  "mcpServers": {
-    "codebase-context": {
-      "command": "npx",
-      "args": ["-y", "codebase-context"]
-    }
-  }
-}
-```
-
-### Windsurf
-
-Open Settings > MCP and add:
-
-```json
-{
-  "mcpServers": {
-    "codebase-context": {
-      "command": "npx",
-      "args": ["-y", "codebase-context"]
-    }
-  }
-}
-```
-
-### OpenCode
-
-Add `opencode.json` to your project root:
-
-```json
-{
-  "$schema": "https://opencode.ai/config.json",
-  "mcp": {
-    "codebase-context": {
-      "type": "local",
-      "command": ["npx", "-y", "codebase-context"],
-      "enabled": true
-    }
-  }
-}
-```
-
-OpenCode also supports interactive setup via `opencode mcp add`.
-
-### Codex
-
-```bash
-codex mcp add codebase-context npx -y codebase-context
-```
-
-That single config entry is the intended starting point.
-
-### Fallback setup for single-project use
-
-If you only use one repo, append a project path:
-
-```bash
-codex mcp add codebase-context npx -y codebase-context "/path/to/your/project"
-```
-
-Or set:
-
-```bash
-CODEBASE_ROOT=/path/to/your/project
-```
-
-## Multi-Project and Monorepos
-
-The MCP server can serve multiple projects in one session without requiring one MCP config entry per repo.
-
-Three cases matter:
-
-| Case                                                               | What happens                                                  |
-| ------------------------------------------------------------------ | ------------------------------------------------------------- |
-| One project                                                        | Routing is automatic                                          |
-| Multiple projects and the client provides enough workspace context | The server can route across those projects in one MCP session |
-| Multiple projects and the target is still ambiguous                | The server does not guess. Use `project` explicitly           |
-
-Important rules:
-
-- `project` is the explicit override when routing is ambiguous.
-- `project` accepts a project root path, file path, `file://` URI, or a relative subproject path under a configured workspace such as `apps/dashboard`.
-- If a client reads `codebase://context` before any project is active, the server returns a workspace overview instead of guessing.
-- The server does not rely on `cwd` walk-up in MCP mode.
-
-Typical explicit retry in a monorepo:
-
-```json
-{
-  "name": "search_codebase",
-  "arguments": {
-    "query": "auth interceptor",
-    "project": "apps/dashboard"
-  }
-}
-```
-
-Or target a repo directly:
-
-```json
-{
-  "name": "search_codebase",
-  "arguments": {
-    "query": "auth interceptor",
-    "project": "/repos/customer-portal"
-  }
-}
-```
-
-Or pass a file path and let the server resolve the nearest trusted project boundary:
-
-```json
-{
-  "name": "search_codebase",
-  "arguments": {
-    "query": "auth interceptor",
-    "project": "/repos/monorepo/apps/dashboard/src/auth/guard.ts"
-  }
-}
-```
-
-If you see `selection_required`, the server could not tell which project you meant. The response looks like this:
-
-```json
-{
-  "status": "selection_required",
-  "errorCode": "selection_required",
-  "message": "Multiple projects are available and no active project could be inferred. Retry with project.",
-  "nextAction": "retry_with_project",
-  "availableProjects": [
-    { "label": "app-a", "project": "/repos/app-a", "indexStatus": "idle", "source": "root" },
-    { "label": "app-b", "project": "/repos/app-b", "indexStatus": "ready", "source": "root" }
-  ]
-}
-```
-
-Retry the call with `project` set to one of the listed paths.
-
-`codebase://context` follows the active project in the session. In unresolved multi-project sessions it returns a workspace overview. Project-scoped resources are also available via the URIs listed in that overview.
-
-The CLI stays intentionally simpler: it targets one repo per invocation via `CODEBASE_ROOT` or the current working directory. Multi-project discovery and routing are MCP-only features, not a second CLI session model.
-
-## Test It Yourself
-
-Build the local branch first:
-
-```bash
-pnpm build
-```
-
-Then point your MCP client at the local build:
-
-```json
-{
-  "mcpServers": {
-    "codebase-context": {
-      "command": "node",
-      "args": ["<path-to-local-build>/dist/index.js"]
-    }
-  }
-}
-```
-
-If the default setup is not enough for your client, use this instead:
-
-```json
-{
-  "mcpServers": {
-    "codebase-context": {
-      "command": "node",
-      "args": ["<path-to-local-build>/dist/index.js", "/path/to/your/project"]
-    }
-  }
-}
-```
+The server runs in two modes. Use stdio unless you need multiple clients connected at once:
 
-Check these three flows:
+| Mode | How it runs | When to use |
+| ---- | ----------- | ------------ |
+| **stdio** (default) | Process spawned by the client | One AI client talking to one or more repos |
+| **HTTP** | Long-lived server at `http://127.0.0.1:3100/mcp` | Multiple clients sharing one server |
 
-1. Single project
-   Ask for `search_codebase` or `metadata`.
-   Expected: routing is automatic.
+Client support at a glance:
 
-2. Multiple projects with one server entry
-   Open two repos or one monorepo workspace.
-   Ask for `codebase://context`.
-   Expected: workspace overview first, then automatic routing once one project is active or unambiguous.
+| Client | stdio | HTTP |
+| ------ | ----- | ---- |
+| Claude Code | Yes | No (stdio only) |
+| Claude Desktop | Yes | No |
+| Cursor | Yes | Yes — `.cursor/mcp.json` with `type: "http"` |
+| Windsurf | Yes | Not yet |
+| Codex | Yes | Yes — `--mcp-config` flag |
+| VS Code (Copilot) | Yes | No |
+| OpenCode | Yes | Not documented yet |
 
-3. Ambiguous project selection
-   Start without a bootstrap path.
-   Ask for `search_codebase`.
-   Expected: `selection_required`.
-   Retry with `project`, for example `apps/dashboard` or `/repos/customer-portal`.
+Copy-pasteable templates: [`templates/mcp/stdio/.mcp.json`](./templates/mcp/stdio/.mcp.json) and [`templates/mcp/http/.mcp.json`](./templates/mcp/http/.mcp.json).
 
-For monorepos, verify all three selector forms:
-
-- relative subproject path: `apps/dashboard`
-- repo path: `/repos/customer-portal`
-- file path: `/repos/monorepo/apps/dashboard/src/auth/guard.ts`
+Full per-client setup, HTTP server instructions, and local build testing: [`docs/client-setup.md`](./docs/client-setup.md).
 
 ## Common First Commands
 
@@ -339,199 +84,78 @@ npx -y codebase-context memory list
 
 This is also what your AI agent consumes automatically via MCP tools; the CLI is the human-readable version.
 
-### CLI preview
-
-```text
-$ npx -y codebase-context patterns
-┌─ Team Patterns ──────────────────────────────────────────────────────┐
-│                                                                      │
-│ UNIT TEST FRAMEWORK                                                  │
-│      USE: Vitest – 96% adoption                                      │
-│ alt  CAUTION: Jest – 4% minority pattern                             │
-│                                                                      │
-│ STATE MANAGEMENT                                                     │
-│      PREFER: RxJS – 63% adoption                                     │
-│ alt  Redux-style store – 25%                                         │
-│                                                                      │
-└──────────────────────────────────────────────────────────────────────┘
-```
-
-```text
-$ npx -y codebase-context search --query "file watcher" --intent edit --limit 1
-┌─ Search: "file watcher" ─── intent: edit ────────────────────────────┐
-│ Quality: ok (1.00)                                                   │
-│ Ready to edit: YES                                                   │
-│                                                                      │
-│ Best example: index.ts                                               │
-└──────────────────────────────────────────────────────────────────────┘
-```
-
-```text
-$ npx -y codebase-context metadata
-┌─ codebase-context [monorepo] ────────────────────────────────────────┐
-│                                                                      │
-│ Framework: Angular unknown   Architecture: mixed                     │
-│ 130 files · 24,211 lines · 1077 components                           │
-│                                                                      │
-│ Dependencies: @huggingface/transformers · @lancedb/lancedb ·         │
-│ @modelcontextprotocol/sdk · @typescript-eslint/typescript-estree ·   │
-│ chokidar · fuse.js (+14 more)                                        │
-│                                                                      │
-└──────────────────────────────────────────────────────────────────────┘
-```
-
-```text
-$ npx -y codebase-context refs --symbol "startFileWatcher"
-┌─ startFileWatcher ─── 11 references ─── static analysis ─────────────┐
-│                                                                      │
-│ startFileWatcher                                                     │
-│ │                                                                    │
-│ ├─ file-watcher.test.ts:5                                            │
-│ │   import { startFileWatcher } from '../src/core/file-watcher....   │
-│                                                                      │
-└──────────────────────────────────────────────────────────────────────┘
-```
-
-```text
-$ npx -y codebase-context cycles
-┌─ Circular Dependencies ──────────────────────────────────────────────┐
-│                                                                      │
-│ No cycles found  ·  98 files  ·  260 edges  ·  2.7 avg deps          │
-│                                                                      │
-└──────────────────────────────────────────────────────────────────────┘
-```
-
-See `docs/cli.md` for the full CLI gallery.
-
-## What It Actually Does
-
-Other tools help AI find code. This one helps AI make the right decisions - by knowing what your team does, tracking where codebases are heading, and warning before mistakes happen.
-
-### The Difference
-
-| Without codebase-context                                | With codebase-context                               |
-| ------------------------------------------------------- | --------------------------------------------------- |
-| Generates code using whatever matches or "sounds" right | Generates code following your team conventions      |
-| Copies any example that fits                            | Follows your best implementations (golden files)    |
-| Repeats mistakes you already corrected                  | Surfaces failure memories right before trying again |
-| You re-explain the same things every session            | Remembers conventions and decisions automatically   |
-| Edits confidently even when context is weak             | Flags high-risk changes when evidence is thin       |
-| Sees what the current code does and assumes             | Sees how your code has evolved and why              |
+## What it does
 
 ### The Search Tool (`search_codebase`)
 
-This is where it all comes together. One call returns:
-
-- **Code results** with `file` (path + line range), `summary`, `score`
-- **Type** per result: compact `componentType:layer` (e.g., `service:data`) — helps agents orient
-- **Pattern signals** per result: `trend` (Rising/Declining — Stable is omitted) and `patternWarning` when using legacy code
-- **Relationships** per result: `importedByCount` and `hasTests` (condensed) + **hints** (capped ranked callers, consumers, tests) — so you see suggested next reads and know what you haven't looked at yet
-- **Related memories**: up to 3 team decisions, gotchas, and failures matched to the query
-- **Search quality**: `ok` or `low_confidence` with confidence score and `hint` when low
-- **Preflight**: `ready` (boolean) with decision card when `intent="edit"|"refactor"|"migrate"`. Shows `nextAction` (if not ready), `warnings`, `patterns` (do/avoid), `bestExample`, `impact` (import-graph coverage — how many files that import or reference the result are in your search), and `whatWouldHelp` (next steps). If search quality is low, `ready` is always `false`.
-
-Snippets are optional (`includeSnippets: true`). When enabled, snippets that have symbol metadata (e.g. from the Generic analyzer's AST chunking or Angular component chunks) start with a scope header so you know where the code lives (e.g. `// AuthService.getToken()` or `// SpotifyApiService`). Example:
+One call returns ranked results with `file`, `summary`, `score`, compact type (`componentType:layer`), pattern trend signals, relationship hints, related team memories, a search quality assessment, and a preflight decision card when `intent="edit"`. The decision card shows `ready` (boolean), `nextAction` when not ready, `patterns` (do/avoid), `bestExample`, impact coverage (`"3/5 callers in results"`), and `whatWouldHelp`.
 
-```ts
-// AuthService.getToken()
-getToken(): string {
-  return this.token;
-}
-```
-
-Default output is lean — if the agent wants code, it calls `read_file`.
+Default output is lean — if the agent wants code, it calls `read_file`. Add `includeSnippets: true` for inline code with scope headers (e.g. `// AuthService.getToken()`).
 
-For scripting and automation, every CLI command accepts `--json` for machine output (stdout = JSON; logs/errors go to stderr).
-See `docs/capabilities.md` for the field reference.
-
-Lean enough to fit on one screen. If search quality is low, preflight blocks edits instead of faking confidence.
+See [`docs/capabilities.md`](./docs/capabilities.md) for the full field reference.
 
 ### Patterns & Conventions (`get_team_patterns`)
 
-Detects what your team actually does by analyzing the codebase:
-
-- Adoption percentages for dependency injection, state management, testing, libraries
-- Patterns/conventions trend direction (Rising / Stable / Declining) based on git recency
-- Golden files - your best implementations ranked by modern pattern density
-- Conflicts - when the team hasn't converged (both approaches above 20% adoption)
+Detects what your team actually does by analyzing the codebase: adoption percentages for DI, state management, testing, and library patterns; trend direction (Rising / Stable / Declining) from git recency; golden files ranked by modern pattern density; conflicts when two approaches both exceed 20%.
 
 ### Team Memory (`remember` + `get_memory`)
 
-Record a decision once. It surfaces automatically in search results and preflight cards from then on. **Your git commits also become memories** - conventional commits like `refactor:`, `migrate:`, `fix:`, `revert:` from the last 90 days are auto-extracted during indexing.
+Record a decision once. It surfaces automatically in search results and preflight cards from then on. Conventional commits (`refactor:`, `migrate:`, `fix:`, `revert:`) from the last 90 days auto-extract into memory during indexing — no setup required.
 
-- **Types**: conventions (style rules), decisions (architecture choices), gotchas (things that break), failures (we tried X, it broke because Y)
-- **Confidence decay**: decisions age over 180 days, gotchas and failures over 90 days. Stale memories get flagged instead of blindly trusted.
-- **Zero-config git extraction**: runs automatically during `refresh_index`. No setup, no manual work.
+Memory types: `convention`, `decision`, `gotcha`, `failure`. Confidence decay: conventions never decay, decisions 180-day half-life, gotchas/failures 90-day. Stale memories get flagged instead of blindly trusted.
 
-### All Tools
+## Tools
 
-| Tool                           | What it does                                                                                                                                            |
-| ------------------------------ | ------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| `search_codebase`              | Hybrid search + decision card. Pass `intent="edit"` to get `ready`, `nextAction`, patterns, import-graph coverage, and `whatWouldHelp`.                 |
-| `get_team_patterns`            | Pattern frequencies, golden files, conflict detection                                                                                                   |
-| `get_symbol_references`        | Find concrete references to a symbol (usageCount + top snippets). `confidence: "syntactic"` = static/source-based only; no runtime or dynamic dispatch. |
-| `remember`                     | Record a convention, decision, gotcha, or failure                                                                                                       |
-| `get_memory`                   | Query team memory with confidence decay scoring                                                                                                         |
-| `get_codebase_metadata`        | Project structure, frameworks, dependencies                                                                                                             |
-| `get_style_guide`              | Style guide rules for the current project                                                                                                               |
-| `detect_circular_dependencies` | Import cycles between files                                                                                                                             |
-| `refresh_index`                | Re-index (full or incremental) + extract git memories                                                                                                   |
-| `get_indexing_status`          | Progress and stats for the current index                                                                                                                |
+| Tool | What it does |
+| ---- | ------------ |
+| `search_codebase` | Hybrid search + decision card when `intent="edit"` |
+| `get_team_patterns` | Pattern frequencies, golden files, conflict detection |
+| `get_symbol_references` | Concrete references to a symbol (count + snippets) |
+| `remember` | Record a convention, decision, gotcha, or failure |
+| `get_memory` | Query team memory with confidence decay scoring |
+| `get_codebase_metadata` | Project structure, frameworks, dependencies |
+| `get_style_guide` | Style guide rules for the current project |
+| `detect_circular_dependencies` | Import cycles between files |
+| `refresh_index` | Full or incremental re-index + git memory extraction |
+| `get_indexing_status` | Progress and stats for the current index |
 
-## Evaluation Harness (`npm run eval`)
+## Multi-project
 
-Reproducible evaluation with frozen fixtures so ranking/chunking changes are measured honestly and regressions get caught. **For contributors and CI:** run before releases or after changing search/ranking/chunking to guard against regressions.
+One server, multiple repos. Three cases:
 
-- Two codebases: `npm run eval -- <codebaseA> <codebaseB>`
-- Defaults: fixture A = `tests/fixtures/eval-angular-spotify.json`, fixture B = `tests/fixtures/eval-controlled.json`
-- Offline smoke (no network):
+| Case | What happens |
+| ---- | ------------ |
+| One project | Routing is automatic |
+| Multiple projects, active project already set | Routes to the active project |
+| Multiple projects, ambiguous | Returns `selection_required` — retry with `project` |
 
-```bash
-npm run eval -- tests/fixtures/codebases/eval-controlled tests/fixtures/codebases/eval-controlled \
-  --fixture-a=tests/fixtures/eval-controlled.json \
-  --fixture-b=tests/fixtures/eval-controlled.json \
-  --skip-reindex --no-rerank
-```
-
-- Flags: `--help`, `--fixture-a`, `--fixture-b`, `--skip-reindex`, `--no-rerank`, `--no-redact`
-- To save a report for later comparison, redirect stdout (e.g. `pnpm run eval -- <path-to-angular-spotify> --skip-reindex > internal-docs/tests/eval-runs/angular-spotify-YYYY-MM-DD.txt`).
-
-## How the Search Works
+`project` accepts a project root path, file path, `file://` URI, or relative subproject path (e.g. `apps/dashboard`).
 
-The retrieval pipeline is designed around one goal: give the agent the right context, not just any file that matches.
-
-- **Definition-first ranking** - for exact-name lookups (e.g. a symbol name), the file that _defines_ the symbol ranks above files that only use it.
-- **Intent classification** - knows whether "AuthService" is a name lookup or "how does auth work" is conceptual. Adjusts keyword/semantic weights accordingly.
-- **Hybrid fusion (RRF)** - combines keyword and semantic search using Reciprocal Rank Fusion instead of brittle score averaging.
-- **Query expansion** - conceptual queries automatically expand with domain-relevant terms (auth → login, token, session, guard).
-- **Contamination control** - test files are filtered/demoted for non-test queries.
-- **Import centrality** - files that are imported more often rank higher.
-- **Cross-encoder reranking** - a stage-2 reranker triggers only when top scores are ambiguous. CPU-only, bounded to top-K.
-- **Incremental indexing** - only re-indexes files that changed since last run (SHA-256 manifest diffing).
-- **Version gating** - index artifacts are versioned; mismatches trigger automatic rebuild so mixed-version data is never served.
-- **Auto-heal** - if the index corrupts, search triggers a full re-index automatically.
+```json
+{
+  "name": "search_codebase",
+  "arguments": {
+    "query": "auth interceptor",
+    "project": "apps/dashboard"
+  }
+}
+```
 
-**Index reliability:** Rebuilds write to a staging directory and swap atomically only on success, so a failed rebuild never corrupts the active index. Version mismatches or corruption trigger an automatic full re-index (no user action required).
+If you get `selection_required`, retry with one of the paths from `availableProjects`. Full routing details and response shapes in [`docs/capabilities.md`](./docs/capabilities.md#project-routing).
 
 ## Language Support
 
-**10 languages** have full symbol extraction (Tree-sitter): TypeScript, JavaScript, Python, Java, Kotlin, C, C++, C#, Go, Rust. **30+ languages** have indexing and retrieval coverage (keyword + semantic), including PHP, Ruby, Swift, Scala, Shell, and config/markup (JSON/YAML/TOML/XML, etc.).
-
-Enrichment is framework-specific: right now only **Angular** has a dedicated analyzer for rich conventions/context (signals, standalone components, control flow, DI patterns).
-
-For non-Angular projects, the **Generic** analyzer uses **AST-aligned chunking** when a Tree-sitter grammar is available: symbol-bounded chunks with **scope-aware prefixes** (e.g. `// ClassName.methodName`) so snippets show where code lives. Without a grammar it falls back to safe line-based chunking.
-
-Structured filters available: `framework`, `language`, `componentType`, `layer` (presentation, business, data, state, core, shared).
+10 languages with full symbol extraction via Tree-sitter: TypeScript, JavaScript, Python, Java, Kotlin, C, C++, C#, Go, Rust. 30+ languages with indexing and retrieval coverage, including PHP, Ruby, Swift, Scala, Shell, and config formats. Angular has a dedicated analyzer; everything else uses the Generic analyzer with AST-aligned chunking when a grammar is available.
 
 ## Configuration
 
-| Variable                 | Default                    | Description                                                                                                |
-| ------------------------ | -------------------------- | ---------------------------------------------------------------------------------------------------------- |
-| `EMBEDDING_PROVIDER`     | `transformers`             | `openai` (fast, cloud) or `transformers` (local, private)                                                  |
-| `OPENAI_API_KEY`         | -                          | Required only if using `openai` provider                                                                   |
-| `CODEBASE_ROOT`          | -                          | Optional bootstrap root for CLI and single-project MCP clients without roots                               |
-| `CODEBASE_CONTEXT_DEBUG` | -                          | Set to `1` for verbose logging                                                                             |
-| `EMBEDDING_MODEL`        | `Xenova/bge-small-en-v1.5` | Local embedding model override (e.g. `onnx-community/granite-embedding-small-english-r2-ONNX` for Granite) |
+| Variable | Default | Description |
+| -------- | ------- | ----------- |
+| `EMBEDDING_PROVIDER` | `transformers` | `openai` (fast, cloud) or `transformers` (local, private) |
+| `OPENAI_API_KEY` | — | Required only if using `openai` provider |
+| `CODEBASE_ROOT` | — | Bootstrap root for CLI and single-project MCP clients |
+| `CODEBASE_CONTEXT_DEBUG` | — | Set to `1` for verbose logging |
+| `EMBEDDING_MODEL` | `Xenova/bge-small-en-v1.5` | Local embedding model override |
 
 ## Performance
 
@@ -559,54 +183,6 @@ Structured filters available: `framework`, `language`, `componentType`, `layer`
 !.codebase-context/memory.json
 ```
 
-## CLI Reference
-
-Repo-scoped analysis commands are available via the CLI — no AI agent required. MCP multi-project routing uses the shared `project` selector when needed; the CLI stays one-root-per-invocation.
-For formatted examples and “money shots”, see `docs/cli.md`.
-
-Set `CODEBASE_ROOT` to your project root, or run from the project directory.
-
-```bash
-# Search the indexed codebase
-npx -y codebase-context search --query "authentication middleware"
-npx -y codebase-context search --query "auth" --intent edit --limit 5
-
-# Project structure, frameworks, and dependencies
-npx -y codebase-context metadata
-
-# Index state and progress
-npx -y codebase-context status
-
-# Re-index the codebase
-npx -y codebase-context reindex
-npx -y codebase-context reindex --incremental --reason "added new service"
-
-# Style guide rules
-npx -y codebase-context style-guide
-npx -y codebase-context style-guide --query "naming" --category patterns
-
-# Team patterns (DI, state, testing, etc.)
-npx -y codebase-context patterns
-npx -y codebase-context patterns --category testing
-
-# Symbol references
-npx -y codebase-context refs --symbol "UserService"
-npx -y codebase-context refs --symbol "handleLogin" --limit 20
-
-# Circular dependency detection
-npx -y codebase-context cycles
-npx -y codebase-context cycles --scope src/features
-
-# Memory management
-npx -y codebase-context memory list
-npx -y codebase-context memory list --category conventions --type convention
-npx -y codebase-context memory list --query "auth" --json
-npx -y codebase-context memory add --type convention --category tooling --memory "Use pnpm, not npm" --reason "Workspace support and speed"
-npx -y codebase-context memory remove <id>
-```
-
-All commands accept `--json` for raw JSON output suitable for piping and scripting.
-
 ## What to add to your CLAUDE.md / AGENTS.md
 
 Paste this into `.cursorrules`, `CLAUDE.md`, `AGENTS.md`, or wherever your AI reads project instructions:
@@ -629,9 +205,12 @@ These are the behaviors that make the most difference day-to-day. Copy, trim wha
 
 ## Links
 
-- [Motivation](./MOTIVATION.md) - Research and design rationale
-- [Changelog](./CHANGELOG.md) - Version history
-- [Contributing](./CONTRIBUTING.md) - How to add analyzers
+- [Client Setup](./docs/client-setup.md) — per-client config, HTTP setup, local build testing
+- [Capabilities Reference](./docs/capabilities.md) — tool API, retrieval pipeline, decision card schema
+- [CLI Gallery](./docs/cli.md) — formatted command output examples
+- [Motivation](./MOTIVATION.md) — research and design rationale
+- [Contributing](./CONTRIBUTING.md) — dev setup and eval harness
+- [Changelog](./CHANGELOG.md)
 
 ## License
 
diff --git a/docs/capabilities.md b/docs/capabilities.md
index bc0828b..52185de 100644
--- a/docs/capabilities.md
+++ b/docs/capabilities.md
@@ -2,6 +2,26 @@
 
 Technical reference for what `codebase-context` ships today. For the user-facing overview, see [README.md](../README.md).
 
+## Transport Modes
+
+The server supports two transport modes:
+
+| Mode | Command | MCP endpoint |
+| ---- | ------- | ------------ |
+| **stdio** (default) | `npx -y codebase-context` | Spawned process stdin/stdout |
+| **HTTP** | `npx -y codebase-context --http [--port N]` | `http://127.0.0.1:3100/mcp` |
+
+HTTP defaults to `127.0.0.1:3100`. Override with `--port`, `CODEBASE_CONTEXT_PORT`, or `server.port` in `~/.codebase-context/config.json`.
+
+Config-registered project roots (from `~/.codebase-context/config.json`) are loaded at startup in both modes.
+
+Copy-pasteable client config templates are shipped in the package:
+
+- `templates/mcp/stdio/.mcp.json` — stdio setup for `.mcp.json`-style clients
+- `templates/mcp/http/.mcp.json` — HTTP setup for `.mcp.json`-style clients
+
+Client transport support varies — see [README.md](../README.md) for a per-client matrix covering Claude Code, Cursor, Codex, Windsurf, VS Code, Claude Desktop, and OpenCode.
+
 ## CLI Reference
 
 Repo-scoped capabilities are available locally via the CLI (human-readable by default, `--json` for automation).
@@ -86,6 +106,61 @@ Rules:
 - `codebase://context` serves the active project. Before selection in an unresolved multi-project session, it returns a workspace overview with candidate projects, readiness state, and project-scoped resource URIs.
 - `codebase://context/project/<encoded-project-path>` serves a specific project directly and also makes that project active for later tool calls.
 
+### Examples
+
+Retry with a subproject path in a monorepo:
+
+```json
+{
+  "name": "search_codebase",
+  "arguments": {
+    "query": "auth interceptor",
+    "project": "apps/dashboard"
+  }
+}
+```
+
+Target a repo directly:
+
+```json
+{
+  "name": "search_codebase",
+  "arguments": {
+    "query": "auth interceptor",
+    "project": "/repos/customer-portal"
+  }
+}
+```
+
+Pass a file path and let the server resolve the nearest project boundary:
+
+```json
+{
+  "name": "search_codebase",
+  "arguments": {
+    "query": "auth interceptor",
+    "project": "/repos/monorepo/apps/dashboard/src/auth/guard.ts"
+  }
+}
+```
+
+`selection_required` response shape:
+
+```json
+{
+  "status": "selection_required",
+  "errorCode": "selection_required",
+  "message": "Multiple projects are available and no active project could be inferred. Retry with project.",
+  "nextAction": "retry_with_project",
+  "availableProjects": [
+    { "label": "app-a", "project": "/repos/app-a", "indexStatus": "idle", "source": "root" },
+    { "label": "app-b", "project": "/repos/app-b", "indexStatus": "ready", "source": "root" }
+  ]
+}
+```
+
+Retry the call with `project` set to one of the listed paths.
+
 ## Retrieval Pipeline
 
 Ordered by execution:
diff --git a/docs/client-setup.md b/docs/client-setup.md
new file mode 100644
index 0000000..4acdb02
--- /dev/null
+++ b/docs/client-setup.md
@@ -0,0 +1,208 @@
+# Client Setup
+
+Full setup instructions for each AI client. For the quick-start summary, see [README.md](../README.md).
+
+## Transport modes
+
+| Mode | How it runs | When to use |
+| ---- | ----------- | ------------ |
+| **stdio** (default) | Process spawned by the client | One AI client, simple setup |
+| **HTTP** | Long-lived server at `http://127.0.0.1:3100/mcp` | Multiple clients sharing one server |
+
+Start the HTTP server:
+
+```bash
+npx -y codebase-context --http            # default port 3100
+npx -y codebase-context --http --port 4000
+```
+
+Copy-pasteable templates: [`templates/mcp/stdio/.mcp.json`](../templates/mcp/stdio/.mcp.json) and [`templates/mcp/http/.mcp.json`](../templates/mcp/http/.mcp.json).
+
+## Claude Code
+
+```bash
+claude mcp add codebase-context -- npx -y codebase-context
+```
+
+Claude Code only supports stdio. HTTP is not available for this client.
+
+## Claude Desktop
+
+Add to `claude_desktop_config.json`:
+
+```json
+{
+  "mcpServers": {
+    "codebase-context": {
+      "command": "npx",
+      "args": ["-y", "codebase-context"]
+    }
+  }
+}
+```
+
+Claude Desktop only supports stdio.
+
+## Cursor
+
+**Stdio** — add to `.cursor/mcp.json` in your project (copy from [`templates/mcp/stdio/.mcp.json`](../templates/mcp/stdio/.mcp.json)):
+
+```json
+{
+  "mcpServers": {
+    "codebase-context": {
+      "command": "npx",
+      "args": ["-y", "codebase-context"]
+    }
+  }
+}
+```
+
+**HTTP** — start the server first, then add to `.cursor/mcp.json` (copy from [`templates/mcp/http/.mcp.json`](../templates/mcp/http/.mcp.json)):
+
+```json
+{
+  "mcpServers": {
+    "codebase-context": {
+      "type": "http",
+      "url": "http://127.0.0.1:3100/mcp"
+    }
+  }
+}
+```
+
+## Windsurf
+
+Open Settings > MCP and add (stdio only — HTTP is not documented for Windsurf yet):
+
+```json
+{
+  "mcpServers": {
+    "codebase-context": {
+      "command": "npx",
+      "args": ["-y", "codebase-context"]
+    }
+  }
+}
+```
+
+## Codex
+
+**Stdio:**
+
+```bash
+codex mcp add codebase-context npx -y codebase-context
+```
+
+**HTTP** — start the server first (`npx -y codebase-context --http`), then save a config file and pass it:
+
+```json
+{
+  "mcpServers": {
+    "codebase-context": {
+      "type": "http",
+      "url": "http://127.0.0.1:3100/mcp"
+    }
+  }
+}
+```
+
+```bash
+codex --mcp-config /path/to/mcp-http.json
+```
+
+## VS Code (Copilot)
+
+Add `.vscode/mcp.json` to your project root. VS Code uses `servers` instead of `mcpServers`:
+
+```json
+{
+  "servers": {
+    "codebase-context": {
+      "command": "npx",
+      "args": ["-y", "codebase-context"]
+    }
+  }
+}
+```
+
+## OpenCode
+
+Add `opencode.json` to your project root:
+
+```json
+{
+  "$schema": "https://opencode.ai/config.json",
+  "mcp": {
+    "codebase-context": {
+      "type": "local",
+      "command": ["npx", "-y", "codebase-context"],
+      "enabled": true
+    }
+  }
+}
+```
+
+OpenCode also supports interactive setup via `opencode mcp add`.
+
+## Single-project fallback
+
+If you only use one repo, append a project path:
+
+```bash
+codex mcp add codebase-context npx -y codebase-context "/path/to/your/project"
+```
+
+Or set an environment variable:
+
+```bash
+CODEBASE_ROOT=/path/to/your/project
+```
+
+## Test a local build
+
+Build the local branch first:
+
+```bash
+pnpm build
+```
+
+Then point your MCP client at the local build:
+
+```json
+{
+  "mcpServers": {
+    "codebase-context": {
+      "command": "node",
+      "args": ["<path-to-local-build>/dist/index.js"]
+    }
+  }
+}
+```
+
+If the default setup is not enough for your client, pass a project path explicitly:
+
+```json
+{
+  "mcpServers": {
+    "codebase-context": {
+      "command": "node",
+      "args": ["<path-to-local-build>/dist/index.js", "/path/to/your/project"]
+    }
+  }
+}
+```
+
+Check these three flows:
+
+1. **Single project** — call `search_codebase` or `metadata`. Routing is automatic.
+
+2. **Multiple projects, one server entry** — open two repos or a monorepo. Call `codebase://context`. Expected: workspace overview, then automatic routing once a project is active.
+
+3. **Ambiguous selection** — start without a bootstrap path, call `search_codebase`. Expected: `selection_required`. Retry with `project` set to `apps/dashboard` or `/repos/customer-portal`.
+
+For monorepos, test all three selector forms:
+
+- relative subproject path: `apps/dashboard`
+- repo path: `/repos/customer-portal`
+- file path: `/repos/monorepo/apps/dashboard/src/auth/guard.ts`
diff --git a/package.json b/package.json
index 787a580..4f41797 100644
--- a/package.json
+++ b/package.json
@@ -40,7 +40,9 @@
     "README.md",
     "LICENSE",
     "docs/cli.md",
-    "docs/capabilities.md"
+    "docs/capabilities.md",
+    "docs/client-setup.md",
+    "templates"
   ],
   "packageManager": "pnpm@10.27.0",
   "engines": {
diff --git a/templates/mcp/http/.mcp.json b/templates/mcp/http/.mcp.json
new file mode 100644
index 0000000..b1cc52c
--- /dev/null
+++ b/templates/mcp/http/.mcp.json
@@ -0,0 +1,8 @@
+{
+  "mcpServers": {
+    "codebase-context": {
+      "type": "http",
+      "url": "http://127.0.0.1:3100/mcp"
+    }
+  }
+}
diff --git a/templates/mcp/stdio/.mcp.json b/templates/mcp/stdio/.mcp.json
new file mode 100644
index 0000000..cfedd20
--- /dev/null
+++ b/templates/mcp/stdio/.mcp.json
@@ -0,0 +1,8 @@
+{
+  "mcpServers": {
+    "codebase-context": {
+      "command": "npx",
+      "args": ["-y", "codebase-context"]
+    }
+  }
+}
diff --git a/tests/mcp-client-templates.test.ts b/tests/mcp-client-templates.test.ts
new file mode 100644
index 0000000..5d165e1
--- /dev/null
+++ b/tests/mcp-client-templates.test.ts
@@ -0,0 +1,137 @@
+import { describe, it, expect } from 'vitest';
+import { readFileSync } from 'node:fs';
+import { resolve } from 'node:path';
+
+const root = resolve(import.meta.dirname, '..');
+
+function readJson(relPath: string): unknown {
+  const content = readFileSync(resolve(root, relPath), 'utf8');
+  return JSON.parse(content);
+}
+
+function readText(relPath: string): string {
+  return readFileSync(resolve(root, relPath), 'utf8');
+}
+
+// ---------------------------------------------------------------------------
+// Template JSON validity
+// ---------------------------------------------------------------------------
+
+describe('templates/mcp/stdio/.mcp.json', () => {
+  it('parses as valid JSON', () => {
+    expect(() => readJson('templates/mcp/stdio/.mcp.json')).not.toThrow();
+  });
+
+  it('has an mcpServers key at the top level', () => {
+    const config = readJson('templates/mcp/stdio/.mcp.json') as Record<string, unknown>;
+    expect(config).toHaveProperty('mcpServers');
+    expect(typeof config.mcpServers).toBe('object');
+  });
+
+  it('contains the codebase-context server entry', () => {
+    const config = readJson('templates/mcp/stdio/.mcp.json') as {
+      mcpServers: Record<string, unknown>;
+    };
+    expect(config.mcpServers).toHaveProperty('codebase-context');
+  });
+
+  it('server entry uses npx command', () => {
+    const config = readJson('templates/mcp/stdio/.mcp.json') as {
+      mcpServers: Record<string, { command?: string; args?: string[] }>;
+    };
+    const entry = config.mcpServers['codebase-context'];
+    expect(entry.command).toBe('npx');
+    expect(entry.args).toContain('codebase-context');
+  });
+});
+
+describe('templates/mcp/http/.mcp.json', () => {
+  it('parses as valid JSON', () => {
+    expect(() => readJson('templates/mcp/http/.mcp.json')).not.toThrow();
+  });
+
+  it('has an mcpServers key at the top level', () => {
+    const config = readJson('templates/mcp/http/.mcp.json') as Record<string, unknown>;
+    expect(config).toHaveProperty('mcpServers');
+  });
+
+  it('contains the codebase-context server entry', () => {
+    const config = readJson('templates/mcp/http/.mcp.json') as {
+      mcpServers: Record<string, unknown>;
+    };
+    expect(config.mcpServers).toHaveProperty('codebase-context');
+  it('server entry points to the local HTTP endpoint', () => {
+    const config = readJson('templates/mcp/http/.mcp.json') as {
+      mcpServers: Record<string, { url?: string; type?: string }>;
+    };
+    const entry = config.mcpServers['codebase-context'];
+    expect(entry.url).toBe('http://127.0.0.1:3100/mcp');
+    expect(entry.type).toBe('http');
+  });
+    const entry = config.mcpServers['codebase-context'];
+    expect(entry.url).toBe('http://127.0.0.1:3100/mcp');
+  });
+});
+
+// ---------------------------------------------------------------------------
+// README references templates and all four target clients
+// ---------------------------------------------------------------------------
+
+describe('README.md client setup documentation', () => {
+  const readme = readText('README.md');
+
+  it('references the stdio template path', () => {
+    expect(readme).toContain('templates/mcp/stdio/.mcp.json');
+  });
+
+  it('references the HTTP template path', () => {
+    expect(readme).toContain('templates/mcp/http/.mcp.json');
+  });
+
+  it('mentions Claude Code', () => {
+    expect(readme).toContain('Claude Code');
+  });
+
+  it('mentions Cursor', () => {
+    expect(readme).toContain('Cursor');
+  });
+
+  it('mentions Codex', () => {
+    expect(readme).toContain('Codex');
+  });
+
+  it('mentions Windsurf', () => {
+    expect(readme).toContain('Windsurf');
+  });
+
+  it('includes the HTTP endpoint URL', () => {
+    expect(readme).toContain('127.0.0.1:3100/mcp');
+  });
+});
+
+// ---------------------------------------------------------------------------
+// docs/capabilities.md transport notes
+// ---------------------------------------------------------------------------
+
+describe('docs/capabilities.md transport documentation', () => {
+  const caps = readText('docs/capabilities.md');
+
+  it('references the stdio template path', () => {
+    expect(caps).toContain('templates/mcp/stdio/.mcp.json');
+  });
+
+  it('references the HTTP template path', () => {
+    expect(caps).toContain('templates/mcp/http/.mcp.json');
+  });
+
+  it('mentions the HTTP endpoint URL', () => {
+    expect(caps).toContain('127.0.0.1:3100/mcp');
+  });
+
+  it('covers all four target clients', () => {
+    expect(caps).toContain('Claude Code');
+    expect(caps).toContain('Cursor');
+    expect(caps).toContain('Codex');
+    expect(caps).toContain('Windsurf');
+  });
+});